Beanz/pdf.js - pdf.js - Gitea: Git with a cup of tea

Beanz/pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	fc22d3afc7	Support the `maxCanvasPixels` option in the thumbnails code This addresses an inconsistency in the viewer, since the thumbnails don't respect the `maxCanvasPixels` option. Note that, as far as I know, this has not lead to any bugs since the thumbnails render with a fixed (and small) width, however it really cannot hurt to address this (especially after the introduction of the `maxCanvasDim` option). To support this a new `OutputScale`-method was added, to avoid having to duplicate code in multiple files.	2025-03-10 14:12:07 +01:00
Jonas Jenwald	c2e33307b1	Introduce some `URL.parse()` usage in the code-base This (fairly new) static method allows parsing URLs without having to wrap `new URL(...)` calls within `try...catch` blocks, thus simplifying the code; see https://developer.mozilla.org/en-US/docs/Web/API/URL/parse_static For older browsers/environments the functionality will be polyfilled, but only in `legacy` builds, via `core-js`; see https://github.com/zloirock/core-js?tab=readme-ov-file#url-and-urlsearchparams Please note: This is currently limited to the `src/`- and `web/`-folders, such that we don't break development/testing, since the functionality is not available in all Node.js versions that we support; see https://developer.mozilla.org/en-US/docs/Web/API/URL/parse_static#browser_compatibility	2025-02-15 19:10:36 +01:00
Jonas Jenwald	bd05b255fa	[api-major] Apply the `userUnit` using CSS, to fix the text/annotation layers (bug 1947248) Rather than modifying the "raw" dimensions of the page, we'll instead apply the `userUnit` as an additional scale-factor via CSS. Please note: It's not clear to me if this solution is fully correct either, or if there's other problems with it, but it at least appears to work. --- With these changes, the following CSS variables are now assumed to be available/set as necessary: `--total-scale-factor`, `--scale-factor`, `--user-unit`, `--scale-round-x`, and `--scale-round-y`.	2025-02-11 14:36:06 +01:00
Calixte Denizet	2c737bc420	Move the array containing the supported image mime types in display_utils and make it available for the viewer (it'll be used in the signature UI).	2025-01-30 15:35:17 +01:00
Jonas Jenwald	c6e3fc4fe6	Take the `userUnit` into account in the `PageViewport` class (issue 19176)	2024-12-08 15:51:04 +01:00
Calixte Denizet	e695d04ca2	[Editor] Disallow to have multiple pointers while dragging an editor It'll let the user dragging with two fingers.	2024-11-27 19:41:13 +01:00
Jonas Jenwald	af3c1f2a0d	Enable the ESLint `no-console` rule in parts of the code-base The purpose of these changes is to make it more difficult to accidentally include logging statements, used during development and debugging, when submitting patches for review. For (almost) all code residing in the `src/` folder we should use our existing helper functions to ensure that all logging can be controlled via the `verbosity` API-option. For the `test/unit/` respectively `test/integration/` folders we shouldn't need any "normal" logging, but it should be OK to print the occasional warning/error message. Please find additional details about the ESLint rule at https://eslint.org/docs/latest/rules/no-console	2024-11-14 12:41:39 +01:00
Jonas Jenwald	4e12906061	Move the various DOM-factories into their own files - Over time the number and size of these factories have increased, especially the `DOMFilterFactory` class, and this split should thus aid readability/maintainability of the code. - By introducing a couple of new import maps we can avoid bundling the `DOMCMapReaderFactory`/`DOMStandardFontDataFactory` classes in the Firefox PDF Viewer, since they are dead code there given that worker-thread fetching is always being used. - This patch has been successfully tested, by running `$ ./mach test toolkit/components/pdfjs/`, in a local Firefox artifact-build. Note: This patch reduces the size of the `gulp mozcentral` output by `1.3` kilo-bytes, which isn't a lot but still cannot hurt.	2024-11-01 13:31:28 +01:00
Jonas Jenwald	df696063b9	Change the `BaseCMapReaderFactory` fetch-helper to return a `Uint8Array` This moves more functionality into the base-class, rather than having to duplicate that in the extending classes. For consistency, also updates the `BaseStandardFontDataFactory` and introduces more `async`/`await` in various relevant code.	2024-10-21 13:20:18 +02:00
Jonas Jenwald	8afc542ae2	Move the regular expression caching into the `PDFDateString` class This code is old enough that it pre-dates availability of private class fields, hence why this wasn't done originally.	2024-10-11 15:34:58 +02:00
Jonas Jenwald	bb302dd993	[api-minor] Pass `CanvasFactory`/`FilterFactory`, rather than instances, to `getDocument` This unifies the various factory-options, since it's consistent with `CMapReaderFactory`/`StandardFontDataFactory`, and ensures that any needed parameters will always be consistently provided when creating `CanvasFactory`/`FilterFactory`-instances. As shown in the modified example this may simplify some custom implementations, since we now provide the ability to access the `CanvasFactory`-instance used with a particular `getDocument`-invocation.	2024-09-23 11:26:30 +02:00
Calixte Denizet	46fac8b2c1	[Editor] Take into account the device pixel ratio when drawing an added image Fixes #18626.	2024-09-16 14:48:26 +02:00
Calixte Denizet	68332ec236	Avoid to have a white line around the canvas The canvas must have the same dims as the page in order to avoid to see the page background.	2024-09-07 20:12:29 +02:00
Jonas Jenwald	4c45948bc4	Fix `DOMFilterFactory.#createUrl` in MOZCENTRAL builds (18417 PR follow-up) Somehow I managed to mess up the URL creation relevant to e.g. MOZCENTRAL builds, which is breaking the pending PDF.js update in mozilla-central; sorry about that! To avoid future issues, we'll now always check if absolute filter-URLs are necessary regardless of the build-target.	2024-07-15 14:10:59 +02:00
Jonas Jenwald	50a5a15088	Create absolute filter-URLs when needed in `DOMFilterFactory` (issue 18406) This functionality is purposely limited to development mode and GENERIC builds, since it's unnecessary in e.g. the built-in Firefox PDF Viewer, and will only be used when a `<base>`-element is actually present. Please note: We also have tests in mozilla-central that will indirectly ensure that relative filter-URLs work as intended in the Firefox PDF Viewer, see https://searchfox.org/mozilla-central/source/toolkit/components/pdfjs/test/browser_pdfjs_filters.js --- To test that the issue is fixed, the following code can be used: ```html <!DOCTYPE html> <html> <head> <meta charset="UTF-8"> <base href="."> <title>base href (issue 18406)</title> </head> <body> <ul> <li>Place this code in a file, named `base_href.html`, in the root of the PDF.js repository</li> <li>Run <pre>npx gulp dist-install</pre></li> <li>Run <pre>npx gulp server</pre></li> <li>Open <a href="http://localhost:8888/base_href.html">http://localhost:8888/base_href.html</a> in a browser</li> <li>Compare rendering with <a href="http://localhost:8888/web/viewer.html?file=/test/pdfs/issue16287.pdf">http://localhost:8888/web/viewer.html?file=/test/pdfs/issue16287.pdf</a></li> </ul> <canvas id="the-canvas" style="border: 1px solid black; direction: ltr;"></canvas> <script src="/node_modules/pdfjs-dist/build/pdf.mjs" type="module"></script> <script id="script" type="module"> // // If absolute URL from the remote server is provided, configure the CORS // header on that server. // const url = '/test/pdfs/issue16287.pdf'; // // The workerSrc property shall be specified. // pdfjsLib.GlobalWorkerOptions.workerSrc = '/node_modules/pdfjs-dist/build/pdf.worker.mjs'; // // Asynchronous download PDF // const loadingTask = pdfjsLib.getDocument(url); const pdf = await loadingTask.promise; // // Fetch the first page // const page = await pdf.getPage(1); const scale = 1.5; const viewport = page.getViewport({ scale }); // Support HiDPI-screens. const outputScale = window.devicePixelRatio \|\| 1; // // Prepare canvas using PDF page dimensions // const canvas = document.getElementById("the-canvas"); const context = canvas.getContext("2d"); canvas.width = Math.floor(viewport.width * outputScale); canvas.height = Math.floor(viewport.height * outputScale); canvas.style.width = Math.floor(viewport.width) + "px"; canvas.style.height = Math.floor(viewport.height) + "px"; const transform = outputScale !== 1 ? [outputScale, 0, 0, outputScale, 0, 0] : null; // // Render PDF page into canvas context // const renderContext = { canvasContext: context, transform, viewport, }; page.render(renderContext); </script> </body> </html> ```	2024-07-11 11:30:45 +02:00
Calixte Denizet	ff6180a4c9	Add an option to enable/disable hardware acceleration (bug 1902012)	2024-06-12 18:41:07 +02:00
Jonas Jenwald	a790f2df5d	[api-minor] Remove the unused `onlyStripPath` option from the `getFilenameFromUrl` helper function	2024-05-03 08:29:41 +02:00
Calixte Denizet	82989e6790	[api-minor] Remove the use of (get/put)ImageData when drawing SMasks (bug 1874013) and implement then in using some SVG filters and composition. Composing in using destination-in in order to multiply RGB components by the alpha from the mask isn't perfect: it'd be a way better to natively have alpha masks support, it induces some small rounding errors and consequently computed RGB are approximatively correct. In term of performance, it's a real improvement, for example, the pdf in issue #17779 is now rendered in few seconds. There are still some room for improvement, but overall it should be a way better.	2024-05-01 15:40:44 +02:00
Jonas Jenwald	33c5ef349e	Simplify the `XMLHttpRequest`-branch in the `fetchData` helper function (PR 17277 follow-up)	2024-02-21 23:00:24 +01:00
Jonas Jenwald	f9a384d711	Enable the `arrow-body-style` ESLint rule This manually ignores some cases where the resulting auto-formatting would not, as far as I'm concerned, constitute a readability improvement or where we'd just end up with more overall indentation. Please see https://eslint.org/docs/latest/rules/arrow-body-style	2024-01-21 16:20:55 +01:00
Jonas Jenwald	9dfe9c552c	Use shorter arrow functions where possible For arrow functions that are both simple and short, we can avoid using explicit `return` to shorten them even further without hurting readability. For the `gulp mozcentral` build-target this reduces the overall size of the output by just under 1 kilo-byte (which isn't a lot but still can't hurt).	2024-01-21 10:13:12 +01:00
Calixte Denizet	83c78dbfa8	Make the caret visible in the text layer in caret browsing mode In order to do that we must change the text layer opacity to 1 but it has several implications: - the selection color must have an alpha component, - the background color of the span used for highlighted words must have an alpha component either, but now the opacity is 1 we can use some backdrop-filters in HCM making the highlighted words more visible. - fix a regression caused by #17196: the css variable --hcm-highlight-filter has to live under the #viewer element because in HCM it's overwritten by js at this level, hence links annotations for example didn't have the right colors when hovered.	2024-01-18 19:44:21 +01:00
Jonas Jenwald	fd7a7e2859	Use the `fetchData` helper function in more cases - Extend the `fetchData` helper function to also support fetching of "blob" data. - Use the `fetchData` helper function more in the code-base, when fetching non-PDF data. Given that the Fetch API isn't supported for all protocols, this should improve compatibility for the PDF.js library.	2023-11-24 13:05:57 +01:00
Jonas Jenwald	709d89420e	Re-factor how the `GenericL10n` class fetches localization-data - Re-factor the existing `fetchData` helper function such that it can fetch more types of data, and it now supports "arraybuffer", "json", and "text". This only needed minor adjustments in the `DOMCMapReaderFactory` and `DOMStandardFontDataFactory` classes.[1] - Expose the `fetchData` helper function in the API, such that the viewer is able to access it. - Use the `fetchData` helper function in the `GenericL10n` class, since this should allow fetching of localization-data even if the default viewer is run in an environment without support for the Fetch API. --- [1] While testing this I also noticed a minor inconsistency when handling standard font-data on the worker-thread.	2023-11-14 13:45:14 +01:00
Jonas Jenwald	927e50f5d4	[api-major] Output JavaScript modules in the builds (issue 10317) At this point in time all browsers, and also Node.js, support standard `import`/`export` statements and we can now finally consider outputting modern JavaScript modules in the builds.[1] In order for this to work we can only use proper `import`/`export` statements throughout the main code-base, and (as expected) our Node.js support made this much more complicated since both the official builds and the GitHub Actions-based tests must keep working.[2] One remaining issue is that the `pdf.scripting.js` file cannot be built as a JavaScript module, since doing so breaks PDF scripting. Note that my initial goal was to try and split these changes into a couple of commits, however that unfortunately didn't really work since it turned out to be difficult for smaller patches to work correctly and pass (all) tests that way.[3] This is a classic case of every change requiring a couple of other changes, with each of those changes requiring further changes in turn and the size/scope quickly increasing as a result. One possible "issue" with these changes is that we'll now only output JavaScript modules in the builds, which could perhaps be a problem with older tools. However it unfortunately seems far too complicated/time-consuming for us to attempt to support both the old and modern module formats, hence the alternative would be to do "nothing" here and just keep our "old" builds.[4] --- [1] The final blocker was module support in workers in Firefox, which was implemented in Firefox 114; please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Statements/import#browser_compatibility [2] It's probably possible to further improve/simplify especially the Node.js-specific code, but it does appear to work as-is. [3] Having partially "broken" patches, that fail tests, as part of the commit history is really not a good idea in general. [4] Outputting JavaScript modules was first requested almost five years ago, see issue 10317, and nowadays there should be much better support for JavaScript modules in various tools.	2023-10-07 09:31:08 +02:00
Jonas Jenwald	1df31c0284	Use one `noContextMenu` function in both the src/- and web/-folders Currently we duplicate this event handler function in multiple places, which seems unnecessary.	2023-09-23 15:37:13 +02:00
Calixte Denizet	d185db2b70	Add tagged annotations in the structure tree (bug 1850797)	2023-08-31 12:35:32 +02:00
Calixte Denizet	400699687e	[Editor] Fix the dimensions of the annotation editor layer (follow-up of #16794 )	2023-08-08 11:19:04 +02:00
Jonas Jenwald	e414dfcff7	Use the `round` CSS function in the `setLayerDimensions` helper function This has now been enabled unconditionally in Firefox, see https://bugzilla.mozilla.org/show_bug.cgi?id=1814589 For the `page`-containers in the viewer, this patch should restore the behaviour prior to PR 15770; see e.g. https://github.com/mozilla/pdf.js/pull/15770/files#diff-c48e3561004f5db8f11d5ebab2fd661591222ba911cb4173fbced15f026bac6bL182-L183 Note that these changes this will lead to a tiny bit of movement in some `text` and `annotations` reference tests. Please find additional information at https://developer.mozilla.org/en-US/docs/Web/CSS/round	2023-08-04 14:33:06 +02:00
Calixte Denizet	c519cc821b	Improve highlightments and popups in HCM (bug 1830850) - Modify the text and background colors in popup to fit a11y requirements - Add a backdrop filter on clickable areas in using a svg filter mapping canvas colors to Highlight and HighlightText ones.	2023-06-26 14:45:03 +02:00
Jonas Jenwald	25a07d92d0	[Firefox] Stub out the `isValidFetchUrl` function in MOZCENTRAL builds This helper function is completely unused in the Firefox PDF Viewer.	2023-06-22 15:02:54 +02:00
Jonas Jenwald	0650be4641	Merge pull request #16550 from Snuffleupagus/rm-RenderingCancelledException-type [api-minor] Remove the `type` from `RenderingCancelledException` (PR 16226 follow-up)	2023-06-15 12:26:27 +02:00
Jonas Jenwald	225734dd00	[api-minor] Remove the `type` from `RenderingCancelledException` (PR 16226 follow-up) After PR 16226 we're only using `RenderingCancelledException` together with canvas-rendering, hence the `type`-property is no longer necessary.	2023-06-14 15:40:25 +02:00
Jonas Jenwald	fee850737b	Enable the `unicorn/prefer-optional-catch-binding` ESLint plugin rule According to MDN this format is available in all browsers/environments that we currently support, see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Statements/try...catch#browser_compatibility Please also see https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/prefer-optional-catch-binding.md	2023-06-12 11:46:11 +02:00
Jonas Jenwald	e8030752f3	Introduce even more modern JavaScript features in the code-base After PR 12563 we're now free to use e.g. logical OR assignment, nullish coalescing, and optional chaining in the entire code-base.	2023-05-18 18:55:41 +02:00
Jonas Jenwald	86cf62699c	Add a helper, in `DOMFilterFactory`, to reduce duplication when creating `<feFuncX>`s Currently we repeat the same code verbatim multiple times in the `DOMFilterFactory`, which seems completely unnecessary.	2023-03-20 11:27:47 +01:00
Calixte Denizet	da080cc26e	[api-minor] Use a SVG filter when rendering pages in HCM The idea is to apply an overall filter on each page: the main advantage is to have some filtered images which could help to make them visible for some users.	2023-03-18 12:45:10 +01:00
Jonas Jenwald	fc055dbd80	[api-minor] Extend general transfer function support to browsers without `OffscreenCanvas` This patch extends PR 16115 to work in all browsers, regardless of their `OffscreenCanvas` support, such that transfer functions will be applied to general rendering (and not just image data). In order to do this we introduce the `BaseFilterFactory` that is then extended in browsers/Node.js environments, similar to all the other factories used in the API, such that we always have the necessary factory available in `src/display/canvas.js`. These changes help simplify the existing `putBinaryImageData` function, and the new method can easily be stubbed-out in the Firefox PDF Viewer. Please note: This patch removes the old partial transfer function support, which only applied to image data, from Node.js environments since the `node-canvas` package currently doesn't support filters. However, this should hopefully be fine given that: - Transfer functions are not very commonly used in PDF documents. - Browsers in general, and Firefox in particular, are the primary development target for the PDF.js library. - The FAQ only lists Node.js as mostly supported, see https://github.com/mozilla/pdf.js/wiki/Frequently-Asked-Questions#faq-support	2023-03-14 13:09:08 +01:00
Tim van der Meij	05f29c8720	Merge pull request #16141 from Snuffleupagus/FilterFactory-docId Include the document-id in the SVG-filter names (PR 16062 follow-up)	2023-03-11 15:45:42 +01:00
Tim van der Meij	1234d207e9	Merge pull request #16142 from Snuffleupagus/issue-16134 Fix the JSDoc `returns`-type for two `PageViewport`-methods (issue 16134)	2023-03-11 14:53:21 +01:00
Calixte Denizet	f9539e57ef	Move the svg definitions in its own div This way the svg element doesn't take up space in the viewer. Fixes issue #16135.	2023-03-10 17:03:40 +01:00
Jonas Jenwald	9232264b8a	Fix the JSDoc `returns`-type for two `PageViewport`-methods (issue 16134) The affected methods have always returned Arrays, however the JSDoc did not accurately reflect that.	2023-03-09 15:57:16 +01:00
Jonas Jenwald	92296fa6a1	Include the document-id in the SVG-filter names (PR 16062 follow-up) In the general PDF.js library multiple PDF documents may be opened on the same web-page, which is why we many years ago started using document-specific identifiers to prevent issues with global data such e.g. with fonts. Hence we need to treat the identifiers generated by the `FilterFactory` in the same way, since the SVG-filters for two separate PDF documents may otherwise get identical ids.	2023-03-09 15:35:29 +01:00
Calixte Denizet	8304df2520	Apply transfer filters to any graphic commands	2023-03-07 22:17:19 +01:00
Calixte Denizet	fd03cd5493	[api-minor] Generate images in the worker instead of the main thread. We introduced the use of OffscreenCanvas in #14754 and this patch aims to use them for all kind of images. It'll slightly improve performances (and maybe slightly decrease memory use). Since an image can be rendered in using some transfer maps but because of OffscreenCanvas we don't have the underlying pixels array the transfer maps stuff is re-implemented in using the SVG filter feComponentTransfer.	2023-03-01 17:40:12 +01:00
Jonas Jenwald	91524d1a60	[api-minor] Allow specifying an extra-delay, in `RenderTask.cancel`, for worker-thread aborting of operatorList parsing This is done to support upcoming viewer-changes, and in order to prevent third-party users from outright breaking things we'll simply ignore too large values.	2022-12-14 12:34:16 +01:00
Jonas Jenwald	cafdc48147	[api-minor] Add a new `PageViewport`-getter to access the original, un-scaled, viewport dimensions While reviewing recent patches, I couldn't help but noticing that we now have a lot of call-sites that manually access the `PageViewport.viewBox`-property. Rather than repeating that verbatim all over the code-base, this patch adds a lazily computed and cached getter for this data instead.	2022-12-11 18:37:35 +01:00
Calixte Denizet	a989b5a879	Set the dimensions of the various layers at their creation - Use a unique helper function in display/display_utils.js; - Move those dimensions in css' side.	2022-12-10 14:35:06 +01:00
Jonas Jenwald	5f8598abb7	[api-minor] Normalize the `view`-getter on the worker-thread Please note: I don't really expect that this is will be an observable change, since virtually all PDF documents already order e.g. /MediaBox and /CropBox entries correctly. By normalizing boundingBoxes already on the worker-thread, we can be sure that even a corrupt document won't cause issues. Note how we're passing the `view`-getter to the `PartialEvaluator.getTextContent` method, in order to detect textContent which is outside of the page, hence it makes sense to ensure that it's formatted as expected. Furthermore, by normalizing this once on the worker-tread we should no longer have to worry about a possibly negative width/height in the `PageViewport` constructor. Finally, the patch also simplifies the `view`-getter a little bit.	2022-12-02 15:46:39 +01:00
Jonas Jenwald	f3e0f86641	Simplify the `getFilenameFromUrl` helper function	2022-11-23 11:48:08 +01:00

1 2 3