pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	b629bafd1c	Allow to, optionally, keep Unicode escape sequences in `stringToPDFString` (PR 17331 follow-up) Currently some of the links[1] on page three of the `issue19835.pdf` test-case aren't clickable, since the destination (of the LinkAnnotation) becomes empty. The reason is that these destinations include the character `\x1b`, which is interpreted as the start of a Unicode escape sequence specifying the language of the string; please refer to section [7.9.2.2 Text String Type](https://opensource.adobe.com/dc-acrobat-sdk-docs/pdfstandards/PDF32000_2008.pdf#G6.1957385) in the PDF specification. Hence it seems that we need a way to optionally disable that behaviour, to avoid a "badly" formatted string from becoming empty (or truncated), at least for cases where we are: - Parsing named destinations[2] and URLs. - Handling "strings" that are actually /Name-instances. - Building a lookup Object/Map based on some PDF data-structure. NOTE: The issue that prompted this patch is obviously related to destinations, however I've gone through the `src/core/` folder and updated various other `stringToPDFString` call-sites that (directly or indirectly) fit the categories listed above. --- [1] Try clicking on anything on the line containing "Item 7A. Quantitative and Qualitative Disclosures About Market Risk 27". [2] Unfortunately just skipping `stringToPDFString` in this case would cause other issues, such as the named destination becoming "unusable" in the viewer; see e.g. issues 14847 and 14864.	2025-04-30 20:51:10 +02:00
Calixte Denizet	7a251b206e	Fix the bbox when saving a rotated text field (bug 1963407)	2025-04-29 18:49:07 +02:00
Tim van der Meij	967e34046a	Fix the "must work properly when selecting undo by keyboard" integration test This integration test fails intermittently because the undo button can only be activated if focus can be put on it, and that in turn can only happen if it's visible. The test tried to make sure that the undo bar is visible, but checking for the absence of the `hidden` attribute is unfortunately not enough to assert visibility according to Puppeteer documentation [1]. Moreover, the undo button wasn't checked at all. To fix the issue we let Puppeteer do the visibility detection for the undo bar by providing the `visible: true` option to `waitForSelector` [2]. This is consistent with the other tests that already do this, and also with the existing code that detects if the undo bar is hidden (which uses the `hidden: true` option of `waitForSelector`). Moreover, we wait for the undo button to be present before putting focus on it. For consistency, and to avoid intermittent failures elsewhere, we mirror this solution to the other undo bar/button tests of the various editors. [1] https://pptr.dev/api/puppeteer.elementhandle.isvisible [2] https://pptr.dev/api/puppeteer.waitforselectoroptions	2025-04-27 14:00:50 +02:00
Jonas Jenwald	abc9522886	Avoid (most) string parsing when removing/replacing the hash property of a URL	2025-04-25 23:13:05 +02:00
Calixte Denizet	785991a97c	Fix 'print to pdf' on Mac with a cid font (bug 1961423)	2025-04-24 20:19:12 +02:00
Jonas Jenwald	de2a44a558	Merge pull request #19849 from Snuffleupagus/issue-19848 Ensure that the /Form XObject /Resources-entry is actually a dictionary (issue 19848)	2025-04-23 16:48:06 +02:00
Jonas Jenwald	ae1cbc6a9e	Use the `light-dark` CSS function in the viewer (issue 17780) This removes the need for (most) separate `@media (prefers-color-scheme: dark)` blocks when defining colors values, and also provides a simple way of forcing use of either the light or dark theme. Please refer to https://developer.mozilla.org/en-US/docs/Web/CSS/color_value/light-dark and https://developer.mozilla.org/en-US/docs/Web/CSS/color-scheme NOTE: To support this in older browsers, we utilize a [PostCSS plugin](https://github.com/csstools/postcss-plugins/tree/main/plugins/postcss-light-dark-function).	2025-04-23 15:31:39 +02:00
Jonas Jenwald	64007e777e	Ensure that the /Form XObject /Resources-entry is actually a dictionary (issue 19848)	2025-04-23 10:19:20 +02:00
Jonas Jenwald	adc9eb5a5a	Always fallback to checking all destinations, when lookup fails (issue 19835) In the referenced PDF document the keys, in the /Dests dictionary, need to account for PDFDocEncoding. To improve destination handling in general we'll now unconditionally fallback to always checking all destinations.	2025-04-20 14:53:10 +02:00
Tim van der Meij	923753a66b	Skip the "must check that canvas perfectly fits the page whatever the zoom level" integration test in Chrome This is a temporary measure to reduce noise until #19811 is fixed. Note that this shouldn't be an issue in terms of coverage because we still run the test in Firefox.	2025-04-13 19:32:39 +02:00
Tim van der Meij	efe4a3455a	Remove debug code from the integration tests Doing so simplifies the code a bit, and such code is generally not checked in because it can easily be added locally if needed.	2025-04-13 19:19:13 +02:00
Tim van der Meij	35f85c55bd	Merge pull request #19810 from Snuffleupagus/issue-19808 Update Webpack to version `5.99.5` (issue 19808)	2025-04-13 18:43:19 +02:00
Jonas Jenwald	6b961c424f	Update Webpack to version `5.99.5` (issue 19808) In Webpack version `5.99.0` the way that `export` statements are handled was changed slightly, with much less boilerplate code being generated, which unfortunately breaks our `tweakWebpackOutput` function that's used to expose the exported properties globally and that e.g. the viewer depends upon. Given that we were depending on formatting that should most likely be viewed as nothing more than an internal implementation detail in Webpack, we instead work-around this by manually defining the structures that were previously generated. Obviously this will lead to a tiny bit more manual work in the future, however we don't change the API-surface often enough that it should be a big issue and the relevant unit-tests are updated such that it shouldn't be possible to break this. NOTE: In the future we might want to consider no longer using global properties like this, and instead rely only on proper `export`s throughout the code-base. However changing this would likely be non-trivial (given edge-cases), and it'd be an `api-major` change, so let's just do the minimal amount of work to unblock Webpack updates for now.	2025-04-13 16:48:19 +02:00
Tim van der Meij	938430be5b	Remove workaround from the "must convert input to uppercase" scripting integration test It's no longer necessary after commit 1c73e52 that caused the document to be closed properly between tests, and this therefore partly reverts commit 973b67f.	2025-04-13 15:57:54 +02:00
Tim van der Meij	5f46791887	Run the integration tests in a random order This commit configures Jasmine to no longer run the tests in a fixed order, which combined with the previous isolation commits avoids being able to accidentally introduce dependencies between integration tests.	2025-04-13 15:46:03 +02:00
Tim van der Meij	75b321b000	Isolate the viewer integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:24:15 +02:00
Tim van der Meij	965949c259	Isolate the text layer integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:24:15 +02:00
Tim van der Meij	6308f5eb51	Isolate the text field integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:24:15 +02:00
Tim van der Meij	7c8262ccc9	Isolate the stamp editor integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:24:15 +02:00
Tim van der Meij	44243761a5	Isolate the highlight editor integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:20:27 +02:00
Tim van der Meij	5011d96a62	Isolate the freetext editor integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:20:23 +02:00
Tim van der Meij	f043760b54	Isolate the find integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:20:18 +02:00
Tim van der Meij	179859f54a	Isolate the copy/paste integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:20:14 +02:00
Tim van der Meij	0e9e4ec64f	Isolate the caret browsing integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:20:08 +02:00
Tim van der Meij	a60f9bc612	Isolate the autolinker integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:20:03 +02:00
Tim van der Meij	e1cbba4ded	Isolate the annotation integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:19:59 +02:00
Tim van der Meij	193bebd78a	Isolate the accessibility integration tests To avoid being able to introduce dependencies between tests this commit makes sure that we close the document between tests so that we can't accidentally rely on state set by a previous test.	2025-04-13 15:18:19 +02:00
Tim van der Meij	f482232493	Merge pull request #19790 from Snuffleupagus/test-getAnnotationStorage-rm-entries Simplify the `getAnnotationStorage` integration-test helper function	2025-04-12 13:36:20 +02:00
Jonas Jenwald	1048508dd1	Catch circular references in /Form XObjects (issue 19800) For simplicity we will abort /Form XObject parsing immediately when encountering a circular reference, rather than letting it continue up until some limit (as e.g. PDFium appears to do), which should be fine since there are never any guarantees if/how corrupt PDF documents will render.	2025-04-11 16:54:22 +02:00
Jonas Jenwald	835a456767	Use `adjustWidths` unconditionally for all embedded fonts (issue 19802) Previously we'd only do this for Type1/CFF fonts, see e.g. PR 6736, since the font-program may update the /FontMatrix. However, it seems that we should do this unconditionally to account for fonts with non-default /FontMatrix-entries in the font-dictionary (which seem to be pretty rare).	2025-04-11 15:01:35 +02:00
Jonas Jenwald	fbc4f4b12a	Handle non-integer and out-of-range values correctly in Indexed color spaces In PDF version 2.0 the handling of Indexed color spaces was clarified as follows: > The index value should be an integer in the range 0 to hival. If the value is a real number, it shall be rounded to the nearest integer (0.5 values shall be rounded up); if it is outside the range 0 to hival, it shall be adjusted to the nearest value within that range. Please refer to https://github.com/pdf-association/pdf-differences/tree/main/IndexedColor	2025-04-09 15:31:49 +02:00
Jonas Jenwald	60bf9b133a	Simplify the `getAnnotationStorage` integration-test helper function As can be seen [at MDN](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Object/fromEntries#parameters) the `Object.fromEntries` method accepts a `Map` directly, hence using the `Map.prototype.entries` method is superfluous here.	2025-04-09 13:42:22 +02:00
Jonas Jenwald	22657e2b6e	Merge pull request #19778 from Snuffleupagus/replace-getAll [api-minor] Replace various `getAll` methods with iterators	2025-04-08 17:26:24 +02:00
Jonas Jenwald	38d01250f4	Enable the `no-array-reduce` ESLint plugin rule Please see https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/no-array-reduce.md Note that this still allows "simple" usage of `Array.prototype.reduce`, however most of those cases will be possible to replace with `Math.sumPrecise` once that becomes generally available (currently not supported in Node.js or QuickJS).	2025-04-08 12:21:20 +02:00
Nicolò Ribaudo	da5b681b16	[chrome] Fix text selection with `.markedContent` The current text layer approach based on absolutely positioned `<span>` elements by default causes flickering with text selection, and we have browser-specific workarounds to solve that. In Chrome, the workaround involves moving the `.endOfContent` element to right after the last element that contains some selected content. This works well in simple PDFs, but breaks when we have `span.markedContent` elements. Given a text layer structure like the following, rendered as four consecutive lines: ```html <span class="markedContent"> <br> <span>development enter the construction phase (estimated at around</span> </span> <span class="markedContent"> <br> <span>300 MEUR).</span> </span> <span class="markedContent"> <br> <span>Kreate's EBITA increased to 2.8 MEUR (Q4'23: 2.7 MEUR) and the</span> </span> <span class="markedContent"> <br> <span>margin rose to 3.7% (Q4'23: 3.4%). However, profitability was</span> </span> ``` when starting to select from inside the first line and dragging down to the empty space after the second line, Chrome will anchor the selection at the beginning of either the `<br>` or the `<span>` inside the last `.markedContent`, depending on whether the selection is in "per-character mode" (i.e. click and drag) or "per-word mode" (i.e. double click and drag). This causes us to insert the `.endOfContent` element in the wrong place (one element too far), which causes one more line to be selected, which triggers another `"selecctionchange"` event, which causes us to move `.endOfContent` again, and so on, looping until when the whole page is selected. This commit fixes the issue by making sure that when the end of the selection range points to the _begining_ of an element, we walk back the dom finding the first non-empty element, and attatch `.endOfContent` to the end of that.	2025-04-07 17:36:40 +02:00
Jonas Jenwald	2c593b06e4	[api-minor] Replace various `getAll` methods with iterators These `getAll` methods are not used anywhere within the PDF.js code-base, outside of tests, and were mostly added (speculatively) for third-party users. To still allow access to the same data we instead introduce iterators on these classes, which (slightly) shortens the code and allows us to remove the `objectFromMap` helper function. A summary of the changes in this patch: - Replace the `getAll` methods with iterators in the following classes: `AnnotationStorage`, `Metadata`, and `OptionalContentGroup`. - Change, and also re-name, `AnnotationStorage.prototype.setAll` into a test-only method since it's not used elsewhere. - Remove the `Metadata.prototype.has` method, since it's only used in tests and can be trivially replaced by calling `Metadata.prototype.get` and checking if the returned value is `null`.	2025-04-06 21:43:16 +02:00
Tim van der Meij	72feb4c256	Merge pull request #19775 from Snuffleupagus/AFSimple_Calculate-MIN-MAX-destructuring Use Array-destructuring when computing MIN/MAX in `AFSimple_Calculate`	2025-04-06 18:32:37 +02:00
Jonas Jenwald	ec5b5184d3	Remove `Array.prototype.reduce` usage from the unit-tests Using `Array.prototype.reduce` often leads to less readable code, and in these cases we can replace it with other Array-methods instead.	2025-04-06 12:12:37 +02:00
Jonas Jenwald	0845552ff9	Use Array-destructuring when computing MIN/MAX in `AFSimple_Calculate` This appears to work fine with QuickJS, as evident by the added unit-test, and allows us to remove more `Array.prototype.reduce` usage.	2025-04-06 11:44:08 +02:00
Jonas Jenwald	dad6febc39	Pass the /Info-strings as a `Map` to the `src/core/writer.js` code We want to iterate through the data in the `computeMD5` function, and `Map`s have "nicer" support for that than generic objects. (Somewhat recently `Map` performance was improved in Firefox, however this also isn't really performance sensitive code.)	2025-04-04 13:36:13 +02:00
Tim van der Meij	ed08a6a988	Isolate the "basic operations" freetext editor integration tests This commit reduces the number of freetext editor integration test suite failures, in full isolation, from 5 to 0 by fixing the following issues in the "basic operations" block: - Most tests relied on the first test to enable freetext editing mode. For isolation we now do it explicitly in all tests. - Most tests relied on the other tests having created editors. For isolation we now create the editors explicitly in the tests themselves. - Most tests relied on previous tests for the editor numbering. For isolation we change the editor numbering to the one after initial document load. Since we can't have state (editors) from a previous test anymore we can remove various `clearAll` calls as well.	2025-03-30 18:52:19 +02:00
Tim van der Meij	45cbe8bb0d	Merge pull request #19735 from Snuffleupagus/Type3WordSpacing Apply char/word-spacing correctly for missing Type3-glyphs	2025-03-30 12:07:54 +02:00
Jonas Jenwald	5b7b94c078	Spell "properly" in viewer integration-test names	2025-03-29 16:35:22 +01:00
Jonas Jenwald	667645798f	Apply char/word-spacing correctly for missing Type3-glyphs In the included PDF document the Type3-font doesn't contain any glyph definition for "space", despite that character being referenced in the /Contents stream. While missing Type3-glyphs obviously cannot be rendered, we still need to update the current canvas position such that any char/word-spacing is correctly applied. The test-case was found at https://github.com/pdf-association/pdf-differences/tree/main/Type3WordSpacing	2025-03-29 00:12:08 +01:00
Jonas Jenwald	f577271908	Simplify the `compileType3Glyph` function to just return the `Path2D` objects Originally this function would "manually" invoke the rendering commands for Type3-glyphs, however that was changed some time ago: - Initial `Path2D` support was added in PR 14858, but the old code kept for Node.js compatibility. - Since PR 15951 we've been using a `Path2D` polyfill in Node.js environments. Hence, after the previous commit, we can further simplify this function by directly returning/using the `Path2D` object when rendering Type3-glyphs; see also https://github.com/mozilla/pdf.js/pull/19731#discussion_r2018712695 While this won't improve performance significantly, when compared to the introduction of `Path2D`, it definately cannot hurt.	2025-03-28 15:20:43 +01:00
Calixte Denizet	7d8c64b2ff	[Editor] Set an aria-description for an added signature (bug 1956513) NVDA behaves differently depending if the user is hovering or focusing an added signature. An aria-description is read in both cases while an aria-label is not.	2025-03-27 18:14:06 +01:00
Calixte Denizet	62df96894d	[Editor] in the signature UI, disable the description input if there is no signature (bug 1956114)	2025-03-26 23:02:13 +01:00
Calixte Denizet	1d0227af62	Don't overwrite the global alpha when switching to smask mode It fixes #issue16287.	2025-03-24 21:35:09 +01:00
Calixte Denizet	a3c31904f1	Take into account the group bbox It fixes #16742.	2025-03-24 15:07:31 +01:00
calixteman	bcba053a5e	Merge pull request #19715 from calixteman/issue18529 Take into account the path and the line width when consuming a stroked path	2025-03-24 14:41:29 +01:00

... 3 4 5 6 7 ...

4233 Commits