pdf.js

Author	SHA1	Message	Date
Calixte Denizet	516aea5562	[XFA] Set default max value in occur tag to -1 (bug 1998843)	2025-11-21 17:53:38 +01:00
Calixte Denizet	bc87f4e8d6	Add the possibility to create a pdf from different ones (bug 1997379) For now it's just possible to create a single pdf in selecting some pages in different pdf sources. The merge is for now pretty basic (it's why it's still a WIP) none of these data are merged for now: - the struct trees - the page labels - the outlines - named destinations For there are 2 new ref tests where some new pdfs are created: one with some extracted pages and an other one (encrypted) which is just rewritten. The ref images are generated from the original pdfs in selecting the page we want and the new images are taken from the generated pdfs.	2025-11-07 14:57:48 +01:00
Calixte Denizet	19ff148163	Fix incremental saving with hybrid references This patch removes some previous fixes which are now likely fixed by #17636. Fixes #20302.	2025-10-04 18:31:55 +02:00
calixteman	adf9233f46	Merge pull request #20270 from calixteman/issue20232 Consider a ttf font with both Symbolic and Nonsymbolic flags set with a Differences array in the encoding dict as non-symbolic	2025-09-14 21:37:58 +02:00
Calixte Denizet	b6d772d71d	Consider a ttf font with both Symbolic and Nonsymbolic flags set with a Differences array in the encoding dict as non-symbolic It fixes #20232.	2025-09-14 18:52:16 +02:00
Nicolò Ribaudo	e4ea2e0c79	Store ops bboxes in a linear Uint8Array This PR changes the way we store bounding boxes so that they use less memory and can be more easily shared across threads in the future. Instead of storing the bounding box and list of dependencies for each operation that renders _something_, we now only store the bounding box of _every_ operation and no dependencies list. The bounding box of each operation covers the bounding box of all the operations affected by it that render something. For example, the bounding box of a `setFont` operation will be the bounding box of all the `showText` operations that use that font. This affects the debugging experience in pdfBug, since now the bounding box of an operation may be larger than what it renders itself. To help with this, now when hovering on an operation we also highlight (in red) all its dependents. We highlight with white stripes operations that do not affect any part of the page (i.e. with an empty bbox). To save memory, we now save bounding box x/y coordinates as uint8 rather than float64. This effectively gives us a 256x256 uniform grid that covers the page, which is high enough resolution for the usecase.	2025-09-09 10:24:48 +02:00
Nicolò Ribaudo	6a22da9c2e	Add logic to track rendering area of various PDF ops This commit is a first step towards #6419, and it can also help with first compute which ops can affect what is visible in that part of the page. This commit adds logic to track operations with their respective bounding boxes. Only operations that actually cause something to be rendered have a bounding box and dependencies. Consider the following example: ``` 0. setFillRGBColor 1. beginText 2. showText "Hello" 3. endText 4. constructPath [...] -> eoFill ``` here we have three rendering operations: the showText op (2) and the path (4). (2) depends on (0), (1) and (3), while (4) only depends on (0). Both (2) and (4) have a bounding box. This tracking happens when first rendering a PDF: we then use the recorded information to optimize future partial renderings of a PDF, so that we can skip operations that do not affected the PDF area on the canvas. All this logic only runs when the new `enableOptimizedPartialRendering` preference, disabled by default, is enabled. The bounding boxes and dependencies are also shown in the pdfBug stepper. When hovering over a step now: - it highlights the steps that they depend on - it highlights on the PDF itself the bounding box	2025-08-22 18:26:59 +02:00
Calixte Denizet	78391ed85a	Fix the xref table with the values we've at the beginning of a xref stream (bug 1978317)	2025-07-22 22:10:23 +02:00
Calixte Denizet	8b17e5ecd8	Use canvas context text primitives when the font file is missing It fixes #20065. The only to get a path (from the path generator) is when the font is embedded. So when we need a path (disableFontFace: true or when we want to use a pattern for stroking/filling), it's impossible to fulfil.	2025-07-18 19:57:30 +02:00
Calixte Denizet	8fc51dc089	[Editor] Add the possibility to add a popup to an annotation when saving When saving/printing, only update the properties which are provided and set a default value only when there is no pre-existing one.	2025-07-11 21:42:21 +02:00
Calixte Denizet	ecc7096a80	Fix the default appearance of a Polygon annotation when a fill color is provided It fixes #20062.	2025-07-08 20:51:58 +02:00
Calixte Denizet	bb52a440ce	Use the creation date in the popup when there is no modification date Remove the h1 element in popup title because it caused a warning in Firefox and use a span instead.	2025-07-07 10:51:35 +02:00
calixteman	2d0ba7db08	Merge pull request #20043 from yyliu12/popup-rotation-fix Make Popup annotations always have noRotate flag set as true	2025-07-03 17:27:40 +02:00
Yuyang Liu	d8ecfad8bd	Make Popup annotations always have noRotate flag set as true Necessary because when there is no Popup annotation created along with a Text annotation, the Popup annotation created by pdf.js does not receive the noRotate flag	2025-07-03 05:52:31 +09:00
Calixte Denizet	fc9ba0cda3	Remove the shadow from the links (bug 1974436) The shadow was taken into account when computing the bounding box of the section containing the link and it was making the clip path wrong. Since the shadow is almost invisible because of the opacity, the yellow color and the clip we can remove it without causing any visual regressions (and as a side effect it'll avoid to use resources to compute it when displayed).	2025-06-30 21:39:22 +02:00
Jonas Jenwald	c5449a98e0	Ignore empty paths when optimizing `constructPath` operations (issue 19971) Note how we're handling empty paths in [src/display/canvas.js](`a8e05d82e2/src/display/canvas.js (L1423-L1428)`), hence we need add similar code in the `QueueOptimizer` as well.	2025-05-23 13:59:05 +02:00
Jonas Jenwald	5f5d9dfc28	Support Type3 fonts with an incomplete /FontDescriptor dictionary (issue 19954) We have a fallback for the common case of Type3 fonts without a /FontDescriptor dictionary, however we also need to handle the case where it's present but lacking the required /FontName entry.	2025-05-19 12:56:14 +02:00
Calixte Denizet	49a098cb5d	Decode appearance keys of checkboxes	2025-05-09 21:46:17 +02:00
Calixte Denizet	ac925f4f1b	Downscale jpeg2000 images, if needed, while decoding them It fixes #19517.	2025-05-05 22:39:59 +02:00
Calixte Denizet	7a251b206e	Fix the bbox when saving a rotated text field (bug 1963407)	2025-04-29 18:49:07 +02:00
Jonas Jenwald	64007e777e	Ensure that the /Form XObject /Resources-entry is actually a dictionary (issue 19848)	2025-04-23 10:19:20 +02:00
Jonas Jenwald	adc9eb5a5a	Always fallback to checking all destinations, when lookup fails (issue 19835) In the referenced PDF document the keys, in the /Dests dictionary, need to account for PDFDocEncoding. To improve destination handling in general we'll now unconditionally fallback to always checking all destinations.	2025-04-20 14:53:10 +02:00
Jonas Jenwald	1048508dd1	Catch circular references in /Form XObjects (issue 19800) For simplicity we will abort /Form XObject parsing immediately when encountering a circular reference, rather than letting it continue up until some limit (as e.g. PDFium appears to do), which should be fine since there are never any guarantees if/how corrupt PDF documents will render.	2025-04-11 16:54:22 +02:00
Jonas Jenwald	835a456767	Use `adjustWidths` unconditionally for all embedded fonts (issue 19802) Previously we'd only do this for Type1/CFF fonts, see e.g. PR 6736, since the font-program may update the /FontMatrix. However, it seems that we should do this unconditionally to account for fonts with non-default /FontMatrix-entries in the font-dictionary (which seem to be pretty rare).	2025-04-11 15:01:35 +02:00
Jonas Jenwald	fbc4f4b12a	Handle non-integer and out-of-range values correctly in Indexed color spaces In PDF version 2.0 the handling of Indexed color spaces was clarified as follows: > The index value should be an integer in the range 0 to hival. If the value is a real number, it shall be rounded to the nearest integer (0.5 values shall be rounded up); if it is outside the range 0 to hival, it shall be adjusted to the nearest value within that range. Please refer to https://github.com/pdf-association/pdf-differences/tree/main/IndexedColor	2025-04-09 15:31:49 +02:00
Jonas Jenwald	667645798f	Apply char/word-spacing correctly for missing Type3-glyphs In the included PDF document the Type3-font doesn't contain any glyph definition for "space", despite that character being referenced in the /Contents stream. While missing Type3-glyphs obviously cannot be rendered, we still need to update the current canvas position such that any char/word-spacing is correctly applied. The test-case was found at https://github.com/pdf-association/pdf-differences/tree/main/Type3WordSpacing	2025-03-29 00:12:08 +01:00
Jonas Jenwald	f577271908	Simplify the `compileType3Glyph` function to just return the `Path2D` objects Originally this function would "manually" invoke the rendering commands for Type3-glyphs, however that was changed some time ago: - Initial `Path2D` support was added in PR 14858, but the old code kept for Node.js compatibility. - Since PR 15951 we've been using a `Path2D` polyfill in Node.js environments. Hence, after the previous commit, we can further simplify this function by directly returning/using the `Path2D` object when rendering Type3-glyphs; see also https://github.com/mozilla/pdf.js/pull/19731#discussion_r2018712695 While this won't improve performance significantly, when compared to the introduction of `Path2D`, it definately cannot hurt.	2025-03-28 15:20:43 +01:00
Calixte Denizet	1d0227af62	Don't overwrite the global alpha when switching to smask mode It fixes #issue16287.	2025-03-24 21:35:09 +01:00
Calixte Denizet	a3c31904f1	Take into account the group bbox It fixes #16742.	2025-03-24 15:07:31 +01:00
Calixte Denizet	2369e2d84f	Take into account the path and the line width when consuming a stroked path	2025-03-23 18:08:06 +01:00
calixteman	d009e4b3a7	Merge pull request #19689 from calixteman/use_path2d [api-minor] Use a Path2D when doing a path operation in the canvas (bug 1946953)	2025-03-22 21:46:27 +01:00
Calixte Denizet	be1f5671bb	[api-minor] Use a Path2D when doing a path operation in the canvas (bug 1946953) With this patch, all the paths components are collected in the worker until a path operation is met (i.e., stroke, fill, ...). Then in the canvas a Path2D is created and will replace the path data transfered from the worker, this way when rescaling, the Path2D can be reused. In term of performances, using Path2D is very slightly improving speed when scaling the canvas.	2025-03-22 20:35:24 +01:00
Jonas Jenwald	f90732e4c3	Extend `getSupplementalGlyphMapForCalibri` with Pound-sign (issue 19695)	2025-03-20 20:12:24 +01:00
Jonas Jenwald	afb14bdc0b	For JPEG images with CMYK-data, ensure that the alpha-component is set correctly when WebAssembly is disabled (issue 19676)	2025-03-17 16:15:32 +01:00
Jonas Jenwald	ef01ceda1b	Change a couple of "password" ref-tests to "eq" tests Currrently these are just "load" tests, and by also testing rendering we get slightly better test-coverage for the `src/core/crypto.js` file.	2025-03-13 11:58:13 +01:00
Jonas Jenwald	ee34c5c648	Let `Lexer.prototype.getNumber` treat more cases of a single minus sign as zero (bug 1953099) This patch extends the approach of PR 14543, by also treating e.g. minus signs followed by '(' or '<' as zero. Inside of a /Contents stream those characters will generally mean the start of one or more glyphs.	2025-03-12 17:50:13 +01:00
Calixte Denizet	4b4f85484e	Always use the absolute value of the line thickness (issue 19633)	2025-03-11 14:03:23 +01:00
Jonas Jenwald	0edfd29a3e	Improve text-selection for Type3 fonts, using `d0` operators, with empty /FontBBox-entries (issue 19624) For Type3 glyphs with `d1` operators it's easy to compute a fallback bounding box, however for `d0` the situation is more difficult. Given that we nowadays compute the min/max of basic path-rendering operators on the worker-thread, we can utilize that by parsing these Type3 operatorLists to guess a more suitable fallback bounding box.	2025-03-10 16:21:54 +01:00
Jonas Jenwald	10a99ea0a7	Let SMask/Mask images fallback to the parent image dimensions (issue 19611) One of the images have a corrupt SMask, where the /Height-entry is bogus; see the excerpt below (via https://brendandahl.github.io/pdf.js.utils/browser/). ``` SMask (stream) [id: 17, gen: 0] ColorSpace = /DeviceGray Height = /Length Subtype = /Image Filter = /FlateDecode Type = /XObject Width = 157 Matte (array) BitsPerComponent = 8 Length = 3893 <view contents> download ``` Hence we enable SMask/Mask images to fallback to the parent image dimensions, and also add more validation of the width/height to get a better error message when that data is wrong.	2025-03-10 12:37:44 +01:00
Calixte Denizet	971be48b60	Support using ICC profiles in using qcms (bug 860023)	2025-03-05 10:29:59 +01:00
Jonas Jenwald	59cb9a064e	Extend `getGlyphMapForStandardFonts` with some Cyrillic entries (issue 19550)	2025-02-26 10:16:06 +01:00
Ross Johnson	4f25d7f6cd	Fix decryption of R=4, V=4 files with < 16-byte keys by 0-padding - undocumented but matches Acrobat behavior (issue #19484 )	2025-02-24 15:36:37 -06:00
Jonas Jenwald	641e2f506e	[api-minor] Re-factor how the `useWorkerFetch` option is used internally With the recently added OpenJPEG no-wasm fallback we need to send the `wasmUrl` option to the worker-thread regardless of the value of the `useWorkerFetch` option, since the fallback won't work if we don't have a URL to `import` it from. For consistency the code is re-factored to always send the factory-urls to the worker-thread, and simply check the `useWorkerFetch` option there instead. Also, as a follow-up to PR 19525, introduce a new `useWasm` option that can be used in e.g. browser-tests to forcibly disable WebAssembly usage.	2025-02-22 09:56:53 +01:00
Jonas Jenwald	6d3bb47655	Merge pull request #19525 from calixteman/bug1935076_part2 Provide a js fallback when the wasm version of openjpeg is failing to load (bug 1935076)	2025-02-22 09:34:40 +01:00
Xiphoseer	24aa39eb14	Consider textRise when showing type3 font glyphs Add test case for issue 19532	2025-02-21 21:31:04 +01:00
Calixte Denizet	36e4f5c222	Provide a js fallback when the wasm version of openjpeg is failing to load (bug 1935076)	2025-02-21 19:03:47 +01:00
Jonas Jenwald	db7cf40a30	Don't cache free/missing XRef entries (issue 19510) During the XRef stream parsing we're attempting to lookup an entry that hasn't yet been found, since parsing is currently running, and given that we'd also cache free/missing XRef entries we'd then return an incorrect value during normal PDF parsing. The simplest solution here is to just not cache free/missing XRef entries, since a properly generated PDF document shouldn't be trying to access objects it doesn't contain. Furthermore, the amount of "extra" parsing now needed for such XRef entries shouldn't be significant enough to be an issue.	2025-02-18 18:04:00 +01:00
Jonas Jenwald	65df1d336f	Check more of the stream when looking for commands after inline image (issue 19494) Currently we only check `followingBytes`, which turns out to be too short to find e.g. valid transform (cm) commands with decimal arguments.	2025-02-15 15:14:47 +01:00
Jonas Jenwald	bd05b255fa	[api-major] Apply the `userUnit` using CSS, to fix the text/annotation layers (bug 1947248) Rather than modifying the "raw" dimensions of the page, we'll instead apply the `userUnit` as an additional scale-factor via CSS. Please note: It's not clear to me if this solution is fully correct either, or if there's other problems with it, but it at least appears to work. --- With these changes, the following CSS variables are now assumed to be available/set as necessary: `--total-scale-factor`, `--scale-factor`, `--user-unit`, `--scale-round-x`, and `--scale-round-y`.	2025-02-11 14:36:06 +01:00
Calixte Denizet	24417a1a0b	[Editor] Add the ability to print and save some newly added signatures (bug 1946795)	2025-02-07 23:07:27 +01:00

1 2 3 4 5 ...

1437 Commits