pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	8ec399d7e1	Convert the `getPdfManager` function to be asynchronous This is fairly old code, and by making the function `async` we can handle initialization errors "automatically" without the need for try-catch statements.	2024-11-22 17:49:43 +01:00
Calixte Denizet	b0b0de98e7	Use the V entry as an option when no options in a choice widget It fixes #19083. It isn't really a fix but more a workaround (we should correctly implement the choice widget as a mix of text input+select).	2024-11-21 17:27:34 +01:00
calixteman	8a8b69f456	Merge pull request #19054 from calixteman/issue18630 When saving some annotations with the same name, set the value in the parent	2024-11-17 18:00:48 +01:00
Tim van der Meij	e2bbcb544a	Merge pull request #19045 from Snuffleupagus/api-rm-isChrome [api-minor] Disable `ImageDecoder` usage by default in Chromium browsers	2024-11-17 16:32:48 +01:00
Calixte Denizet	2da586527f	When saving some annotations with the same name, set the value in the parent It fixes #18630.	2024-11-17 15:55:20 +01:00
Jonas Jenwald	bc91985941	Merge pull request #19051 from Snuffleupagus/Dict-Map Convert the `Dict`-implementation to use a `Map` internally	2024-11-17 12:59:02 +01:00
Jonas Jenwald	823e700b3b	Merge pull request #19057 from Snuffleupagus/extendCMap-avoid-lookup Avoid redundant CMap-value lookup in `extendCMap` (PR 5101 follow-up)	2024-11-17 12:49:20 +01:00
Jonas Jenwald	2c0cc48d1b	Replace the `forEach` method in `Dict` with "proper" iteration support	2024-11-17 12:45:32 +01:00
Jonas Jenwald	691be77f65	Convert the `Dict`-implementation to use a `Map` internally With all the recent work happening under https://bugzilla.mozilla.org/show_bug.cgi?id=1851662, the performance of `Map` is already good enough that I believe that we should now be able to utilize it in the `Dict`-class without problem. This patch was tested in Firefox Nightly, specifically build https://hg.mozilla.org/mozilla-central/rev/6c508a387477e3b72db913a9e1761e9a433d06a2, with the following manifest file: ``` [ { "id": "tracemonkey-eq", "file": "pdfs/tracemonkey.pdf", "md5": "9a192d8b1a7dc652a19835f6f08098bd", "rounds": 100, "type": "eq" }, { "id": "issue2618", "file": "pdfs/issue2618.pdf", "md5": "2c554a99a52288ca1a44a422eeafb8fb", "rounds": 100, "type": "eq" } ] ``` which gave the following results, indicating no significant regression, when comparing this patch against the `master` branch: - Overall ``` -- Grouped By browser, pdf, stat -- browser \| pdf \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| -------------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- firefox \| issue2618 \| Overall \| 100 \| 678 \| 678 \| 0 \| 0.04 \| firefox \| issue2618 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -3.88 \| firefox \| issue2618 \| Rendering \| 100 \| 677 \| 677 \| 0 \| 0.05 \| firefox \| tracemonkey-eq \| Overall \| 1400 \| 35 \| 36 \| 0 \| 0.96 \| firefox \| tracemonkey-eq \| Page Request \| 1400 \| 1 \| 1 \| 0 \| -8.08 \| firefox \| tracemonkey-eq \| Rendering \| 1400 \| 34 \| 35 \| 0 \| 1.26 \| ``` - Page-specific ``` -- Grouped By browser, pdf, page, stat -- browser \| pdf \| page \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| -------------- \| ---- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ------ \| ------------- firefox \| issue2618 \| 0 \| Overall \| 100 \| 678 \| 678 \| 0 \| 0.04 \| firefox \| issue2618 \| 0 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -3.88 \| firefox \| issue2618 \| 0 \| Rendering \| 100 \| 677 \| 677 \| 0 \| 0.05 \| firefox \| tracemonkey-eq \| 0 \| Overall \| 100 \| 23 \| 24 \| 0 \| 1.24 \| firefox \| tracemonkey-eq \| 0 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 19.77 \| firefox \| tracemonkey-eq \| 0 \| Rendering \| 100 \| 23 \| 23 \| 0 \| 0.40 \| firefox \| tracemonkey-eq \| 1 \| Overall \| 100 \| 32 \| 32 \| -1 \| -1.89 \| firefox \| tracemonkey-eq \| 1 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -28.13 \| firefox \| tracemonkey-eq \| 1 \| Rendering \| 100 \| 31 \| 31 \| 0 \| -0.77 \| firefox \| tracemonkey-eq \| 2 \| Overall \| 100 \| 17 \| 18 \| 1 \| 4.60 \| firefox \| tracemonkey-eq \| 2 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 23.53 \| slower firefox \| tracemonkey-eq \| 2 \| Rendering \| 100 \| 17 \| 17 \| 1 \| 3.71 \| firefox \| tracemonkey-eq \| 3 \| Overall \| 100 \| 23 \| 24 \| 0 \| 1.71 \| firefox \| tracemonkey-eq \| 3 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 7.79 \| firefox \| tracemonkey-eq \| 3 \| Rendering \| 100 \| 23 \| 23 \| 0 \| 1.55 \| firefox \| tracemonkey-eq \| 4 \| Overall \| 100 \| 31 \| 31 \| 1 \| 2.49 \| firefox \| tracemonkey-eq \| 4 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 48.96 \| firefox \| tracemonkey-eq \| 4 \| Rendering \| 100 \| 30 \| 30 \| 0 \| 1.05 \| firefox \| tracemonkey-eq \| 5 \| Overall \| 100 \| 31 \| 30 \| -1 \| -2.42 \| firefox \| tracemonkey-eq \| 5 \| Page Request \| 100 \| 2 \| 1 \| -1 \| -49.33 \| firefox \| tracemonkey-eq \| 5 \| Rendering \| 100 \| 29 \| 29 \| 0 \| -0.03 \| firefox \| tracemonkey-eq \| 6 \| Overall \| 100 \| 27 \| 27 \| 0 \| 1.81 \| firefox \| tracemonkey-eq \| 6 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 4.94 \| firefox \| tracemonkey-eq \| 6 \| Rendering \| 100 \| 26 \| 27 \| 0 \| 1.68 \| firefox \| tracemonkey-eq \| 7 \| Overall \| 100 \| 26 \| 26 \| 1 \| 3.13 \| firefox \| tracemonkey-eq \| 7 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 6.98 \| firefox \| tracemonkey-eq \| 7 \| Rendering \| 100 \| 25 \| 25 \| 1 \| 2.92 \| firefox \| tracemonkey-eq \| 8 \| Overall \| 100 \| 25 \| 26 \| 1 \| 5.16 \| firefox \| tracemonkey-eq \| 8 \| Page Request \| 100 \| 1 \| 1 \| -1 \| -41.84 \| firefox \| tracemonkey-eq \| 8 \| Rendering \| 100 \| 23 \| 25 \| 2 \| 8.19 \| firefox \| tracemonkey-eq \| 9 \| Overall \| 100 \| 33 \| 33 \| 0 \| 0.03 \| firefox \| tracemonkey-eq \| 9 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 0.79 \| firefox \| tracemonkey-eq \| 9 \| Rendering \| 100 \| 32 \| 32 \| 0 \| -0.10 \| firefox \| tracemonkey-eq \| 10 \| Overall \| 100 \| 144 \| 144 \| 1 \| 0.52 \| firefox \| tracemonkey-eq \| 10 \| Page Request \| 100 \| 2 \| 1 \| -1 \| -43.52 \| firefox \| tracemonkey-eq \| 10 \| Rendering \| 100 \| 141 \| 143 \| 2 \| 1.18 \| firefox \| tracemonkey-eq \| 11 \| Overall \| 100 \| 24 \| 25 \| 1 \| 2.51 \| firefox \| tracemonkey-eq \| 11 \| Page Request \| 100 \| 1 \| 1 \| 0 \| -4.71 \| firefox \| tracemonkey-eq \| 11 \| Rendering \| 100 \| 23 \| 24 \| 1 \| 2.78 \| firefox \| tracemonkey-eq \| 12 \| Overall \| 100 \| 40 \| 39 \| -1 \| -1.67 \| firefox \| tracemonkey-eq \| 12 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 14.71 \| firefox \| tracemonkey-eq \| 12 \| Rendering \| 100 \| 39 \| 38 \| -1 \| -1.98 \| firefox \| tracemonkey-eq \| 13 \| Overall \| 100 \| 19 \| 20 \| 1 \| 3.09 \| firefox \| tracemonkey-eq \| 13 \| Page Request \| 100 \| 1 \| 1 \| 0 \| 24.79 \| firefox \| tracemonkey-eq \| 13 \| Rendering \| 100 \| 18 \| 19 \| 0 \| 1.70 \| ```	2024-11-17 12:44:06 +01:00
Jonas Jenwald	8783dd0178	Avoid redundant CMap-value lookup in `extendCMap` (PR 5101 follow-up) When iterating through `useCMap` the value is already available, without having to manually invoke the `lookup`-method. While this will likely not affect performance in any noticeable way, it's nonetheless unnecessary to lookup an already available value twice.	2024-11-17 11:57:45 +01:00
Jonas Jenwald	c082169cae	Enable the ESLint `no-var` rule in the `src/core/evaluator.js` file This was previously attempted in PR 13371, but had to be reverted because of issues related to SystemJS (which has since been removed). Also, while unrelated, shortens an existing conditional assignment.	2024-11-15 12:36:51 +01:00
Jonas Jenwald	471284f51b	[api-minor] Disable `ImageDecoder` usage by default in Chromium browsers Given that there are multiple issues with `ImageDecoder` in Chromium browsers, affecting both BMP and JPEG images, for now we (by default) disable that functionality there to avoid problems. This also means that we can remove the previously added, and separate, `isChrome` API-option.	2024-11-14 12:05:15 +01:00
Jonas Jenwald	9bf9bbda0b	Merge pull request #19031 from Snuffleupagus/api-isImageDecoderSupported [api-minor] Add a `getDocument` option to disable `ImageDecoder` usage	2024-11-13 09:19:05 +01:00
Tim van der Meij	6676492920	Merge pull request #19021 from Snuffleupagus/PartialEvaluator-#fetchData Add a `PartialEvaluator` helper for fetching CMap and Standard Font data	2024-11-12 19:56:39 +01:00
Jonas Jenwald	65eedfb0fc	[api-minor] Add a `getDocument` option to disable `ImageDecoder` usage This allows end-users to forcibly disable `ImageDecoder` usage, even if the browser appears to support it (similar to the pre-existing option for `OffscreenCanvas`).	2024-11-12 17:12:42 +01:00
Jonas Jenwald	fe5967c84e	Merge pull request #19029 from nicolo-ribaudo/eslint-flat-config Migrate to ESLint flat config	2024-11-12 16:22:54 +01:00
Nicolò Ribaudo	9e6ff979db	Migrate to ESLint flat config Flat config is the new config system used by ESLint 9. To make the migration easier, they also added flat config support to ESLint 8. This commit migrates the various ESLint configs in the repository to use the new system, without upgrading to ESLint 9 yet.	2024-11-12 16:15:17 +01:00
Calixte Denizet	4bf7787084	Simplify saving added/modified annotations. Having this map to collect the different changes will allow to know if some objects have already been modified.	2024-11-12 10:59:38 +01:00
Jonas Jenwald	16e86878d2	Add a `PartialEvaluator` helper for fetching CMap and Standard Font data This avoids a little bit of code duplication, which cannot hurt.	2024-11-11 11:57:28 +01:00
Jonas Jenwald	0b864ee7d5	Shorten the `Page.prototype.userUnit` getter slightly	2024-11-10 16:30:07 +01:00
Pascal Maximilian Bremer	6d7157a875	Fix Typo:XFATemplate class Para Styling paddingight => paddingRight	2024-11-06 12:04:55 +01:00
Calixte Denizet	d59f9648a9	Simplify toRomanNumerals function	2024-11-05 22:35:35 +01:00
Jonas Jenwald	fdfcfbc351	Merge pull request #19005 from Snuffleupagus/core_utils-shorten Shorten a few helper functions in `src/core/core_utils.js`	2024-11-05 21:46:44 +01:00
Jonas Jenwald	e92a929a58	Try to improve handling of missing trailer dictionaries in `XRef.indexObjects` (issue 18986) The problem with the referenced PDF document has nothing to do with invalid dates, as the issue seems to suggest, but rather with the fact that it has neither an XRef table nor a trailer dictionary. Given that crucial parts of the internal document structure is missing, you might argue that it's not really a PDF document. In an attempt to support this kind of corruption, we'll simply iterate through all (previously found) XRef entries and pick one that might be a valid /Root dictionary. There's obviously no guarantee that this works, and it might not be fast in larger PDF documents, but at least it cannot be any worse than immediately throwing `InvalidPDFException` as we previously did here. Please note: I'm totally fine with this patch being rejected, since it's somewhat questionable if we should actually attempt to support "PDF documents" with this level of corruption.	2024-11-05 18:19:26 +01:00
Jonas Jenwald	2c90eee5a8	Shorten a few helper functions in `src/core/core_utils.js` In a few cases we can ever so slightly shorten the code without negatively impacting the readability.	2024-11-05 13:58:00 +01:00
Tim van der Meij	e930f3030c	Merge pull request #18992 from Snuffleupagus/getPdfManager-inline-flushChunks Inline the `flushChunks` helper function, used in `getPdfManager` on the worker-thread	2024-11-02 18:58:29 +01:00
Jonas Jenwald	e5485108ec	Merge pull request #18990 from Snuffleupagus/ensure-structTree-serializable Ensure that serializing of StructTree-data cannot fail during loading	2024-11-02 15:17:10 +01:00
Jonas Jenwald	2145a7b9ca	Use the `hexNumbers` structure in the `stringToUTF16HexString` helper We can re-use the `hexNumbers` structure here, since that allows us to directly lookup the hexadecimal values and shortens the code.	2024-11-02 15:00:32 +01:00
Jonas Jenwald	196f7d7df1	Inline the `flushChunks` helper function, used in `getPdfManager` on the worker-thread - This helper function has only a single call-site, and the function is fairly short. - It'll only be invoked if range requests are disabled, or if the entire PDF manages to load before the headers are resolved (which is very unlikely). Hence, by default, this helper function is not invoked. - By inlining the code we're able to utilize the existing error-handling at the call-site, rather than having to duplicate it, which further reduces the size of this code. Finally, while slightly unrelated, this patch also adds optional chaining in one spot in the file (PR 16424 follow-up).	2024-11-02 11:06:30 +01:00
Jonas Jenwald	b26dc19392	Ensure that serializing of StructTree-data cannot fail during loading I discovered that doing skip-cache re-reloading of https://opensource.adobe.com/dc-acrobat-sdk-docs/pdfstandards/PDF32000_2008.pdf would intermittently cause (some of) the AnnotationLayers to break with errors printed in the console (see below). In hindsight this bug is really obvious, however it took me quite some time to find it, since the `StructTreePage.prototype.serializable` getter will lookup various data and all of those cases can fail during loading when streaming and/or range requests are being used. Finally, to prevent any future errors, ensure that the viewer won't break in these sort of situations. ``` Uncaught (in promise) Object { message: "Missing data [19098296, 19098297)", name: "UnknownErrorException", details: "MissingDataException: Missing data [19098296, 19098297)", stack: "BaseExceptionClosure@resource://pdf.js/build/pdf.mjs:453:29\n@resource://pdf.js/build/pdf.mjs:456:2\n" } viewer.mjs:8801:55 \#renderAnnotationLayer: "UnknownErrorException: Missing data [17552729, 17552730)". viewer.mjs:8737:15 Uncaught (in promise) Object { message: "Missing data [17552729, 17552730)", name: "UnknownErrorException", details: "MissingDataException: Missing data [17552729, 17552730)", stack: "BaseExceptionClosure@resource://pdf.js/build/pdf.mjs:453:29\n@resource://pdf.js/build/pdf.mjs:456:2\n" } viewer.mjs:8801:55 ```	2024-11-01 17:43:59 +01:00
Jonas Jenwald	8f47d06d07	Add helper functions to allow using new `Uint8Array` methods This allows using the new methods in browsers that support them, e.g. Firefox 133+, while still providing fallbacks where necessary; see https://github.com/tc39/proposal-arraybuffer-base64 Please note: These are not actual polyfills, but only implements what we need in the PDF.js code-base. Eventually this patch should be reverted, once support is generally available.	2024-10-29 10:22:35 +01:00
Jonas Jenwald	bfc645bab1	Introduce some `Uint8Array.fromBase64` and `Uint8Array.prototype.toBase64` usage in the main code-base See https://github.com/tc39/proposal-arraybuffer-base64	2024-10-29 10:22:35 +01:00
Jonas Jenwald	f9fc477080	Improve the implementation of the `PDFDocument.fingerprints`-getter - Add explicit `length` validation of the /ID entries. Given the `EMPTY_FINGERPRINT` constant we're already implicitly assuming a particular length. - Move the constants into the `fingerprints`-getter, since they're not used anywhere else. - Replace the `hexString` helper function with the standard `Uint8Array.prototype.toHex` method; see https://github.com/tc39/proposal-arraybuffer-base64	2024-10-29 10:22:35 +01:00
Jonas Jenwald	48a18585f2	Allow `StreamsSequenceStream` to skip sub-streams that are not actual Streams (issue 18973) This extends PR 13796 to also handle the case where sub-streams contain invalid data, i.e. anything that isn't a Stream, however please note that in these cases there's no guarantee that we'll render the page "correctly". Note that Adobe Reader, i.e. the PDF reference implementation, cannot render the last page of the referenced PDF document.	2024-10-29 09:36:08 +01:00
Calixte Denizet	b649b6f8dd	Use a BMP decoder when resizing an image The image decoding won't block the main thread any more. For now, it isn't enabled for Chrome because issue6741.pdf leads to a crash.	2024-10-28 14:09:52 +01:00
Tim van der Meij	5418060bbc	Merge pull request #18951 from Snuffleupagus/CMap-isCompressed [api-minor] Remove the `CMapCompressionType` enumeration	2024-10-27 14:42:00 +01:00
Jonas Jenwald	8a2b95418a	Re-factor the `ImageResizer._goodSquareLength` definition Move the `ImageResizer._goodSquareLength` definition into the class itself, since the current position shouldn't be necessary, and also convert it into an actually private field.	2024-10-27 11:03:04 +01:00
Jonas Jenwald	b048420d21	[api-minor] Remove the `CMapCompressionType` enumeration After the binary CMap format had been added there were also some ideas about maybe providing other formats, see [here](https://github.com/mozilla/pdf.js/pull/8064#issuecomment-279730182), however that was over seven years ago and we still only use binary CMaps. Hence it now seems reasonable to simplify the relevant code by removing `CMapCompressionType` and instead just use a boolean to indicate the type of the built-in CMaps.	2024-10-24 11:08:16 +02:00
Jonas Jenwald	50c291eb33	Unconditionally cache built-in CMaps on the worker-thread Given that we've not shipped, nor used, anything except binary CMaps for years let's just cache them unconditionally (since that's a tiny bit less code).	2024-10-24 10:15:09 +02:00
calixteman	1ad09779f1	Merge pull request #18910 from calixteman/image_decoder1 Use ImageDecoder in order to decode jpeg images (bug 1901223)	2024-10-23 13:54:07 +02:00
Calixte Denizet	b6c4f0b69e	Use ImageDecoder in order to decode jpeg images (bug 1901223)	2024-10-23 10:42:01 +02:00
Jonas Jenwald	236c8d862e	Re-factor how we handle missing, corrupt, or empty font-file entries This improves the fixes for e.g. issue 9462 and 18941 slightly and allows better fallback behaviour for non-standard fonts.	2024-10-22 17:07:12 +02:00
Jonas Jenwald	63b34114b1	Fallback to a standard font if a font-file entry doesn't contain a Stream (issue 18941) The PDF document is clearly corrupt, since it has /FontFile2 entries that are Dictionaries which obviously isn't correct. While there's obviously no guarantee that things will look perfect this way, actually rendering the text at all should be an improvement in general.	2024-10-22 11:51:28 +02:00
Jonas Jenwald	805f962181	Reduce duplication when collecting optional content groups After PR 18825 we can easily "compute" the optional content groups, and can thus avoid tracking them manually.	2024-10-15 13:20:30 +02:00
Jonas Jenwald	424f81c4db	Merge pull request #18825 from agrahn/rbgroups implementing optional content radiobutton groups	2024-10-15 13:11:19 +02:00
Alexander Grahn	441efe456e	Optional Content (OC) radiobutton (RB) groups implemented. Resolves #18823 . The code parses the /RBGroups entry in the OC configuration dict and adds the property `rbGroups' to instances of the OptionalContentGroup class. rbGroups takes an array of Sets, where each Set instance represents an RB group the OptionalContentGroup instance is a member of. Such a Set instance contains all OCG ids within the corresponding RB group. RB groups an OCG is associated with are processed when its visibility is set to true, as required by the PDF spec.	2024-10-15 11:34:45 +02:00
Calixte Denizet	8b7b39f5d6	Some jpx images can have a mask It fixes #18896.	2024-10-14 21:50:32 +02:00
calixteman	e1f9fa4ea5	Merge pull request #18895 from calixteman/issue18894 Fallback on gray colorspace when there are no colorspace and no name in the scn/SCN arguments	2024-10-13 17:56:52 +02:00
Calixte Denizet	e7ab8cd8c1	Fallback on gray colorspace when there are no colorspace and no name in the scn/SCN arguments It fixes #18894.	2024-10-13 16:02:07 +02:00
Calixte Denizet	4dea773c5b	Clamp the hival parameter of Indexed color space to the range [0; 255] Since this value is used to allocate an array, it makes sense to avoid to use too much memory. From the specs, this value must be in the range [0; 255] (see section 8.6.6.3). This patch removes the unused property 'highVal'.	2024-10-12 23:50:58 +02:00

1 2 3 4 5 ...

3110 Commits