pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	f26f984fa0	Improve validation in the `Catalog.prototype.openAction` getter When the /OpenAction data is an Array we're currently using it as-is which could theoretically cause problems in corrupt PDF documents, hence we ensure that a "raw" destination is actually valid. (This change is covered by existing unit-tests.) Note: In the Dictionary case we're using the `Catalog.parseDestDictionary` method, which already handles all of the necessary validation.	2025-05-10 11:51:58 +02:00
calixteman	293506ada7	Merge pull request #19903 from Snuffleupagus/shorten-fieldObjects-getter Shorten the `PDFDocument.prototype.fieldObjects` getter slightly	2025-05-09 15:49:51 +02:00
calixteman	ff0d9b13a7	Merge pull request #19902 from Snuffleupagus/core-document-shorten Shorten the code in the `src/core/document.js` file	2025-05-09 15:48:49 +02:00
Calixte Denizet	1225c1e39a	Add a pref in order to cap the canvas area to a factor of the window one (bug 1958015) This way it helps to reduce the overall canvas dimensions and make the rendering faster. The drawback is that when scrolling, the page can be blurry in waiting for the rendering. The default value is 200% on desktop and will be 100% for GeckoView.	2025-05-09 13:57:16 +02:00
Jonas Jenwald	1f7581b5c6	Shorten the `PDFDocument.prototype.fieldObjects` getter slightly The effect is probably not even measurable, however this patch ever so slightly reduces the asynchronicity in the `fieldObjects` getter. These changes should be safe since: - We're inside of the `PDFDocument`-class and the `annotationGlobals`-getter, which will always return a (shadowed) Promise and won't throw `MissingDataException`s, can be accessed directly without going through the `BasePdfManager`-instance. - The `acroForm`-dictionary can be accessed through the `annotationGlobals`-data, removing the need to "manually" look it up and thus the need for using `Promise.all` here. - We can also lookup the /Fields-data, in the `acroForm`-dictionary, synchronously since the initial `formInfo.hasFields` check guarantees that it's available.	2025-05-07 17:47:09 +02:00
Jonas Jenwald	36fafbc05c	Use object destructuring a bit more in the `src/core/document.js` file	2025-05-07 13:41:50 +02:00
Jonas Jenwald	92b065c87e	Replace a number of semi-private fields with actual private ones in `src/core/document.js` These are fields that can be moved out of their class constructors, and be initialized directly.	2025-05-07 13:41:44 +02:00
Jonas Jenwald	39803a9f25	Replace a number of semi-private methods with actual private ones in `src/core/document.js` There's a few remaining cases that are used with either cached getters or `BasePdfManager.prototype.ensure`-methods, and those cannot be converted.	2025-05-07 13:41:36 +02:00
Jonas Jenwald	0ded85e9b3	Add a `Page` helper method to create a `PartialEvaluator`-instance Currently we repeat the same identical code five times in the `Page`-class when creating a `PartialEvaluator`-instance, which given the number of parameters it needs seems like unnecessary duplication.	2025-05-07 13:41:29 +02:00
Jonas Jenwald	62009ffa70	Simplify how the `ObjectLoader` is used The `ObjectLoader.prototype.load` method has a fast-path, which avoids any lookup/parsing if the entire PDF document is already loaded. However, we still need to create an `ObjectLoader`-instance which seems unnecessary in that case. Hence we introduce a static `ObjectLoader.load` method, which will help avoid creating `ObjectLoader`-instances needlessly and also (slightly) shortens the call-sites. To ensure that the new method will be used, we extend the `no-restricted-syntax` ESLint rule to "forbid" direct usage of `new ObjectLoader()`.	2025-05-06 15:49:59 +02:00
Jonas Jenwald	ef1ad675c2	Unify method return values in the `ObjectLoader` class Given that all the methods are already asynchronous we can just use `await` more throughout this code, rather than having to explicitly return function-calls and `undefined`. Note also how none of the `ObjectLoader.prototype.load` call-sites use the return value.	2025-05-06 15:43:00 +02:00
Calixte Denizet	ac925f4f1b	Downscale jpeg2000 images, if needed, while decoding them It fixes #19517.	2025-05-05 22:39:59 +02:00
Jonas Jenwald	d9548b1c18	Slightly re-factor how we pre-load fonts and images in XFA documents Rather than "manually" invoking the methods from the `src/core/worker.js` file we introduce a single `PDFDocument`-method that handles this for us, and make the current methods private. Since this code is only invoked at most once per document, and only for XFA documents, we can use `BasePdfManager.prototype.ensureDoc` directly rather than needing a stand-alone method.	2025-05-04 13:44:33 +02:00
Jonas Jenwald	604153957a	Reduce duplication when parsing fonts in `loadXfaFonts` Currently we repeat virtually the same code when calling the `PartialEvaluator.prototype.handleSetFont` method, which we can avoid by introducing an inline helper function.	2025-05-04 13:42:17 +02:00
Jonas Jenwald	2979e23f3c	Ensure that `XFAFactory.prototype.isValid` returns a boolean value Considering the name of the method, and how it's actually being used, you'd expect it to return a boolean value. Given how it's currently being used this inconsistency doesn't cause any issues, however we should still fix this.	2025-05-04 13:42:17 +02:00
Tim van der Meij	5ca57fbd4b	Merge pull request #19885 from Snuffleupagus/loadXfaImages-simplify Simplify the `loadXfaImages` method and related code	2025-05-04 13:41:06 +02:00
Tim van der Meij	22cb3080ee	Merge pull request #19887 from Snuffleupagus/serializeXfaData-simplify Simplify the `serializeXfaData` method and related code	2025-05-04 13:38:01 +02:00
Jonas Jenwald	b3e16800f5	Remove the `BasePdfManager.prototype.catalog` getter This is only invoked once and it can be trivially replaced by the `ensureCatalog`-method, since the code where it's used is already asynchronous.	2025-05-03 13:40:23 +02:00
Jonas Jenwald	b531720d9c	Simplify the `serializeXfaData` method and related code Rather than having a dedicated `BasePdfManager`-method for this one call-site we can instead change `PDFDocument.prototype.serializeXfaData` to a non-async method, that we invoke via `BasePdfManager.prototype.ensureDoc`.	2025-05-03 11:20:42 +02:00
Jonas Jenwald	122822a750	Simplify the `loadXfaImages` method and related code Currently we create an intermediate `Dict` during parsing, however that seems unnecessary since (note especially the second point): - The `NameOrNumberTree.prototype.getAll` method will already resolve any references, as needed, during parsing. - The `Catalog.prototype.xfaImages` getter is invoked, via the `BasePdfManager`-instance, such that any `MissingDataException`s are already handled correctly.	2025-05-02 11:53:41 +02:00
calixteman	91bfe12f38	Merge pull request #19883 from gpanakkal/checkbutton-tostyle Fix arguments in `toStyle` call in `CheckButton`	2025-05-01 22:08:03 +02:00
Gautam Panakkal	7bba3bd4ad	Add missing `this` arg to `toStyle` in `CheckButton.prototype.[$toHTML]`	2025-05-01 10:19:28 -07:00
Jonas Jenwald	b629bafd1c	Allow to, optionally, keep Unicode escape sequences in `stringToPDFString` (PR 17331 follow-up) Currently some of the links[1] on page three of the `issue19835.pdf` test-case aren't clickable, since the destination (of the LinkAnnotation) becomes empty. The reason is that these destinations include the character `\x1b`, which is interpreted as the start of a Unicode escape sequence specifying the language of the string; please refer to section [7.9.2.2 Text String Type](https://opensource.adobe.com/dc-acrobat-sdk-docs/pdfstandards/PDF32000_2008.pdf#G6.1957385) in the PDF specification. Hence it seems that we need a way to optionally disable that behaviour, to avoid a "badly" formatted string from becoming empty (or truncated), at least for cases where we are: - Parsing named destinations[2] and URLs. - Handling "strings" that are actually /Name-instances. - Building a lookup Object/Map based on some PDF data-structure. NOTE: The issue that prompted this patch is obviously related to destinations, however I've gone through the `src/core/` folder and updated various other `stringToPDFString` call-sites that (directly or indirectly) fit the categories listed above. --- [1] Try clicking on anything on the line containing "Item 7A. Quantitative and Qualitative Disclosures About Market Risk 27". [2] Unfortunately just skipping `stringToPDFString` in this case would cause other issues, such as the named destination becoming "unusable" in the viewer; see e.g. issues 14847 and 14864.	2025-04-30 20:51:10 +02:00
Jonas Jenwald	254431df1e	Avoid extra lookup/parsing when all destinations are already available Whenever we cannot find a destination we'll fallback to checking all destinations, to account for e.g. out-of-order NameTrees, and in those cases any subsequent destination-lookups can be made a tiny bit more efficient by immediately checking the already cached destinations.	2025-04-30 15:26:00 +02:00
Jonas Jenwald	0922aa9e9d	Merge pull request #19880 from Snuffleupagus/numberToString-assert-number Assert that `numberToString` is called with a number (issue 19877)	2025-04-29 20:35:32 +02:00
calixteman	262a1f9895	Merge pull request #19881 from calixteman/bug1963407 Fix the bbox when saving a rotated text field (bug 1963407)	2025-04-29 20:33:53 +02:00
Jonas Jenwald	f5faf86180	Assert that `numberToString` is called with a number (issue 19877) NOTE: Given that this is an internal function, used only in the worker-thread, it's not clear to me that this is an entirely "necessary" change.	2025-04-29 20:31:24 +02:00
Calixte Denizet	7a251b206e	Fix the bbox when saving a rotated text field (bug 1963407)	2025-04-29 18:49:07 +02:00
Jonas Jenwald	c1a398d932	Merge pull request #19876 from Snuffleupagus/Node-polyfill-navigator Add a basic `navigator` polyfill for older Node.js versions	2025-04-29 10:04:19 +02:00
calixteman	2e10ff6dd4	Merge pull request #19855 from 1Jesper1/hotfix/useractivation-response Add useractivation check for response function	2025-04-28 13:14:51 +02:00
Jonas Jenwald	3d4e8bb17e	Add a basic `navigator` polyfill for older Node.js versions Modern Node.js versions now include a `navigator` implementation, with a few basic properties, that's actually enough for the PDF.js use-cases; please see https://nodejs.org/api/globals.html#navigator Unfortunately we still support Node.js version `20`, hence we add a basic polyfill since that allows simplifying the code slightly.	2025-04-28 13:07:12 +02:00
Jonas Jenwald	abc9522886	Avoid (most) string parsing when removing/replacing the hash property of a URL	2025-04-25 23:13:05 +02:00
calixteman	efc5c3c231	Merge pull request #19862 from calixteman/bug1961423 Fix 'print to pdf' on Mac with a cid font (bug 1961423)	2025-04-25 15:11:55 +02:00
Jonas Jenwald	312c85bfd6	Merge pull request #19815 from Snuffleupagus/getMergedResources-size Ensure that "local" /Contents stream-dict /Resources aren't empty (PR 19803 follow-up)	2025-04-25 10:46:04 +02:00
Jesper	8af06a4c60	Add useractivation check for response function	2025-04-24 22:40:28 +02:00
Calixte Denizet	785991a97c	Fix 'print to pdf' on Mac with a cid font (bug 1961423)	2025-04-24 20:19:12 +02:00
Jonas Jenwald	de2a44a558	Merge pull request #19849 from Snuffleupagus/issue-19848 Ensure that the /Form XObject /Resources-entry is actually a dictionary (issue 19848)	2025-04-23 16:48:06 +02:00
Jonas Jenwald	01c1c6e60f	Merge pull request #19819 from Snuffleupagus/CSS-light-dark Use the `light-dark` CSS function in the viewer (issue 17780)	2025-04-23 16:30:03 +02:00
Jonas Jenwald	ae1cbc6a9e	Use the `light-dark` CSS function in the viewer (issue 17780) This removes the need for (most) separate `@media (prefers-color-scheme: dark)` blocks when defining colors values, and also provides a simple way of forcing use of either the light or dark theme. Please refer to https://developer.mozilla.org/en-US/docs/Web/CSS/color_value/light-dark and https://developer.mozilla.org/en-US/docs/Web/CSS/color-scheme NOTE: To support this in older browsers, we utilize a [PostCSS plugin](https://github.com/csstools/postcss-plugins/tree/main/plugins/postcss-light-dark-function).	2025-04-23 15:31:39 +02:00
Calixte Denizet	05a45346a5	Disable userActivation before executing a setTimeout/setInterval callback Fixes issue #19850.	2025-04-23 15:25:12 +02:00
Jonas Jenwald	64007e777e	Ensure that the /Form XObject /Resources-entry is actually a dictionary (issue 19848)	2025-04-23 10:19:20 +02:00
Jonas Jenwald	adc9eb5a5a	Always fallback to checking all destinations, when lookup fails (issue 19835) In the referenced PDF document the keys, in the /Dests dictionary, need to account for PDFDocEncoding. To improve destination handling in general we'll now unconditionally fallback to always checking all destinations.	2025-04-20 14:53:10 +02:00
Jonas Jenwald	91ba147317	Check that the `Object.prototype` hasn't been incorrectly extended (PR 11582 follow-up) This complements, and extends, the existing check of the `Array.prototype` in the worker-thread. To simplify the implementation we'll now abort immediately, rather than collecting all "bad" properties.	2025-04-18 12:19:29 +02:00
calixteman	4b1875c8c0	Merge pull request #19825 from calixteman/bug1961107 Avoid to create any subarrays when optimizing 'save, transform, constructPath, restore' (bug 1961107)	2025-04-17 19:42:28 +02:00
Calixte Denizet	d7cbda6cb5	Avoid to create any subarrays when optimizing 'save, transform, constructPath, restore' (bug 1961107) Removing those `subarray`calls helps to improve performance by a factor 6 on Linux and by a factor of 3 on Windows 11.	2025-04-17 19:14:01 +02:00
Jonas Jenwald	bf553f22da	Ensure that the /P-entry is actually a dictionary in `StructTreePage.prototype.addNode` This may fix issue 19822, but without a test-case it's simply impossible to know for sure.	2025-04-17 14:01:53 +02:00
Jonas Jenwald	76f23ce3b5	Catch, and ignore, errors during `Page.prototype.getStructTree` This way any errors thrown during parsing of the page-structTree will not be forwarded to the viewer.	2025-04-17 13:57:30 +02:00
Jonas Jenwald	245d9ba925	Ensure that "local" /Contents stream-dict /Resources aren't empty (PR 19803 follow-up) This is a small, and quite possibly pointless, optimization which ensures that any "local" /Resources aren't empty, to avoid needlessly trying to load and merge dictionaries.	2025-04-14 09:58:15 +02:00
Jonas Jenwald	6b961c424f	Update Webpack to version `5.99.5` (issue 19808) In Webpack version `5.99.0` the way that `export` statements are handled was changed slightly, with much less boilerplate code being generated, which unfortunately breaks our `tweakWebpackOutput` function that's used to expose the exported properties globally and that e.g. the viewer depends upon. Given that we were depending on formatting that should most likely be viewed as nothing more than an internal implementation detail in Webpack, we instead work-around this by manually defining the structures that were previously generated. Obviously this will lead to a tiny bit more manual work in the future, however we don't change the API-surface often enough that it should be a big issue and the relevant unit-tests are updated such that it shouldn't be possible to break this. NOTE: In the future we might want to consider no longer using global properties like this, and instead rely only on proper `export`s throughout the code-base. However changing this would likely be non-trivial (given edge-cases), and it'd be an `api-major` change, so let's just do the minimal amount of work to unblock Webpack updates for now.	2025-04-13 16:48:19 +02:00
Jonas Jenwald	834423b51d	Add more logical assignment in the `src/` folder This patch uses nullish coalescing assignment in cases where it's immediately obvious from surrounding code that doing so is safe, and logical OR assignment elsewhere (mostly the changes in XFA code).	2025-04-12 17:28:33 +02:00

1 2 3 4 5 ...

7099 Commits