Beanz/pdf.js - pdf.js - Gitea: Git with a cup of tea

Beanz/pdf.js

Author	SHA1	Message	Date
Calixte Denizet	d7cbda6cb5	Avoid to create any subarrays when optimizing 'save, transform, constructPath, restore' (bug 1961107) Removing those `subarray`calls helps to improve performance by a factor 6 on Linux and by a factor of 3 on Windows 11.	2025-04-17 19:14:01 +02:00
Jonas Jenwald	22657e2b6e	Merge pull request #19778 from Snuffleupagus/replace-getAll [api-minor] Replace various `getAll` methods with iterators	2025-04-08 17:26:24 +02:00
Jonas Jenwald	d882d0869c	Move the `IDENTITY_MATRIX` constant into `src/core/core_utils.js` (PR 19772 follow-up) After the changes in PR 19772 the `IDENTITY_MATRIX` constant is now only used on the worker-thread, which leads to Webpack marking the code as unused in the built `pdf.mjs` file; see https://phabricator.services.mozilla.com/D244533#change-8oITAexCvrlQ	2025-04-07 11:40:18 +02:00
Jonas Jenwald	2c593b06e4	[api-minor] Replace various `getAll` methods with iterators These `getAll` methods are not used anywhere within the PDF.js code-base, outside of tests, and were mostly added (speculatively) for third-party users. To still allow access to the same data we instead introduce iterators on these classes, which (slightly) shortens the code and allows us to remove the `objectFromMap` helper function. A summary of the changes in this patch: - Replace the `getAll` methods with iterators in the following classes: `AnnotationStorage`, `Metadata`, and `OptionalContentGroup`. - Change, and also re-name, `AnnotationStorage.prototype.setAll` into a test-only method since it's not used elsewhere. - Remove the `Metadata.prototype.has` method, since it's only used in tests and can be trivially replaced by calling `Metadata.prototype.get` and checking if the returned value is `null`.	2025-04-06 21:43:16 +02:00
Calixte Denizet	e7a951547d	Replace UpdateRectMinMax by getAxialAlignedBoundingBox and don't use array destructuring because it induces a memory and perf penalties.	2025-04-04 19:57:55 +02:00
Jonas Jenwald	4262603b06	Re-name the `Util.applyTransformToBezierInPlace` method Given that all `Util.apply...` methods are now using in/out parameters, we can slightly shorten the name of this one.	2025-04-01 23:03:50 +02:00
Jonas Jenwald	c852e877d8	Change `Util.applyInverseTransform` to use the point-argument as an in/out parameter This will help reduce the total number of Array allocations, which cannot hurt.	2025-04-01 23:03:44 +02:00
Jonas Jenwald	fa643bb22f	Change `Util.applyTransform` to use the point-argument as an in/out parameter This will help reduce the total number of Array allocations, which cannot hurt, and also allows us to remove the `Util.applyTransformInPlace` method.	2025-04-01 23:03:36 +02:00
Jonas Jenwald	e5fbf52405	Merge pull request #19736 from Snuffleupagus/compileType3Glyph-worker [api-minor] Move Type3-glyph compilation to the worker-thread	2025-04-01 19:40:30 +02:00
calixteman	25693dc0ee	Merge pull request #19712 from calixteman/optimize_save_construct Optimize save-transform-constructPath-restore	2025-04-01 18:20:59 +02:00
Jonas Jenwald	9cd5a9658a	[api-minor] Move Type3-glyph compilation to the worker-thread After PR 19731 the format of compiled Type3-glyphs is now simple enough that the compilation can be moved to the worker-thread, without introducing any significant additional complexity. This allows us to, ever so slightly, simplify the implementation in `src/display/canvas.js` since the Type3 operatorLists will now directly include standard path-rendering operators (using the format introduced in PR 19689). As part of these changes we also stop caching Type3 image masks since: we've not come across any cases where that actually helps, they're usually fairly small, and it simplifies the code. Note that one "negative" change introduced in this patch is that we'll now compile Type3-glyphs eagerly, whereas previously we'd only do that lazily upon their first use. However, this doesn't seem to impact performance in any noticeable way since the compilation is fast enough (way below 1 ms/glyph in my testing) and Type3-fonts are also limited to just 256 glyphs. Also, many (or most?) Type3-fonts don't even use image masks and are thus not affected by these changes.	2025-04-01 09:09:00 +02:00
Calixte Denizet	6e9fbd9420	Simplify singularValueDecompose2dScale in order to make it using less memory In using the Firefox profiler (with JS allocations tracking) and wuppertal.pdf, I noticed we were using a bit too much memory for a function which is supposed to just compute 2 numbers. The memory used by itself isn't so important but having a too much objects lead to waste some time to gc them. So this patch aims to simplify it a bit.	2025-03-31 10:28:01 +02:00
Calixte Denizet	6146e5fee7	Optimize save-transform-constructPath-restore The 4 operations can be replaced with just one in applying the transform to the points coordinates.	2025-03-25 15:31:45 +01:00
Jonas Jenwald	b85f0903ca	Add new bounding-box helpers in `Util` to reduce code duplication Currently we have a `Util`-helper for computing the bounding-box of a Bézier curve, however for simple points and rectangles we repeat virtually identical code in many spots throughout the code-base. - Introduce new `Util.pointBoundingBox` and `Util.rectBoundingBox` helpers. - Remove the "fallback" from `Util.bezierBoundingBox` and only support passing in a `minMax`-array, since there's only a single call-site using the other format and it could be easily updated.	2025-03-23 19:20:02 +01:00
Jonas Jenwald	e73224ded7	Introduce `Math.sumPrecise` usage in the code-base This is a new JavaScript feature that makes it easy to compute the sum of list of values; see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/sumPrecise This allows us to remove most cases of `Array.prototype.reduce`, which helps improve readability since that (in my opinion) often isn't the most intuitive code.	2025-03-23 13:03:56 +01:00
calixteman	d009e4b3a7	Merge pull request #19689 from calixteman/use_path2d [api-minor] Use a Path2D when doing a path operation in the canvas (bug 1946953)	2025-03-22 21:46:27 +01:00
Calixte Denizet	be1f5671bb	[api-minor] Use a Path2D when doing a path operation in the canvas (bug 1946953) With this patch, all the paths components are collected in the worker until a path operation is met (i.e., stroke, fill, ...). Then in the canvas a Path2D is created and will replace the path data transfered from the worker, this way when rescaling, the Path2D can be reused. In term of performances, using Path2D is very slightly improving speed when scaling the canvas.	2025-03-22 20:35:24 +01:00
Jonas Jenwald	028e4f7ea8	[api-minor] Add a basic `AbortSignal.any` polyfill in PDF.js `legacy` builds This is an admittedly very basic polyfill, to allow us to remove a bunch of inline feature testing, that I've thrown together based on reading https://developer.mozilla.org/en-US/docs/Web/API/AbortSignal/any_static and related MDN articles. Compared to PR 19218 it's obviously much more "primitive", however the implementation is simple and it doesn't suffer from any licensing issues (since I wrote the code myself).	2025-03-18 10:54:49 +01:00
Jonas Jenwald	07bbbf75a5	Introduce a helper function for clamping a value to a range Currently we have a number of spots in the code-base where we need to clamp a value to a [min, max] range. This is either implemented using `Math.min`/`Math.max` or with a local helper function, which leads to some unnecessary duplication. Hence this patch adds and re-uses a single helper function for this, which we'll hopefully be able to remove in the future once https://github.com/tc39/proposal-math-clamp/ becomes generally available.	2025-03-06 14:05:26 +01:00
Jonas Jenwald	165d90fe26	Re-use the `isValidExplicitDest` helper function in the worker/viewer Currently we re-implement the same helper function twice, which in hindsight seems like the wrong decision since that way it's quite easy for the implementations to accidentally diverge. The reason for doing it this way was because the code in the worker-thread is able to check for `Ref`- and `Name`-instances directly, which obviously isn't possible in the viewer but can be solved by passing validation-functions to the helper.	2025-03-01 12:08:56 +01:00
Jonas Jenwald	c2e33307b1	Introduce some `URL.parse()` usage in the code-base This (fairly new) static method allows parsing URLs without having to wrap `new URL(...)` calls within `try...catch` blocks, thus simplifying the code; see https://developer.mozilla.org/en-US/docs/Web/API/URL/parse_static For older browsers/environments the functionality will be polyfilled, but only in `legacy` builds, via `core-js`; see https://github.com/zloirock/core-js?tab=readme-ov-file#url-and-urlsearchparams Please note: This is currently limited to the `src/`- and `web/`-folders, such that we don't break development/testing, since the functionality is not available in all Node.js versions that we support; see https://developer.mozilla.org/en-US/docs/Web/API/URL/parse_static#browser_compatibility	2025-02-15 19:10:36 +01:00
Jonas Jenwald	d5d3d8b3f7	[GENERIC viewer] Add Fluent `PLATFORM` function (PR 19414 follow-up) This patch extends the `FeatureTest.platform` getter to provide the necessary information, and uses that one to implement a Fluent `PLATFORM` function based on: https://searchfox.org/mozilla-central/rev/d1fbe983fb7720f0a4aca0e748817af11c1a374e/intl/l10n/rust/fluent-ffi/src/bundle.rs#81-98	2025-02-06 18:59:35 +01:00
calixteman	42c2b7b657	Merge pull request #19339 from calixteman/signature_tools [Editor] (WIP) Add a new tool in order to add an handwritten signature to a pdf (bug 1942343)	2025-01-30 10:54:56 +01:00
Calixte Denizet	2f828c7bf4	[Editor] (WIP) Add a new tool in order to add an handwritten signature to a pdf (bug 1942343) This patch is adding some code in order to extract a drawing as curves from an image. The algorithm is basically the following: - reduce the dimensions - make it gray - apply a bilateral filter in order to add some blurryness while keeping the edges - compute the histogram - guess what's the background color which should contain a large majority of the pixels - make a binary image - extract the contours in using the Suzuki algorithm - apply the Douglas-Peucker algorithm in order to reduce the number of points The algorithm is improvable but it should work pretty well if there's a clear difference between the background and the drawing. In a v2 we could use a ML model in order to improve the extraction. There's few changes related to the UI in order to make the tool usable, but they're very basic for the moment.	2025-01-29 21:52:14 +01:00
Jonas Jenwald	9241e1be8c	[api-minor] Simplify clean-up of page resources after rendering After PR 2317, which landed in 2012, we'd immediately clean-up after rendering for pages with large image resources. This had the effect that re-rendering, e.g. after zooming, would force us to re-parse the entire page which could easily lead to bad performance. In PR 16108, which landed in 2023, we tried to lessen the impact of that by slightly delaying clean-up however that's obviously not a perfect solution (and it increased the complexity of the relevant code). Furthermore, the condition for this "immediate" clean-up seems a bit arbitrary to me since a page could easily contain a large number of smaller images whose total size vastly exceeds the threshold. Hence this patch, which suggests that we remove the conditional and delayed clean-up after rendering. Compared to the situation back in 2012, a number of things have improved since: - We have multiple caches for repeated image-resources on the worker-thread[1], which helps reduce overall memory usage and improves performance. - We downsize huge images on the worker-thread, which means that the images we're using on the main-thread cannot be arbitrarily large. - The amount of available RAM on devices should be a lot higher, since more than a decade has passed. A future improvement here, for more resource constrained environments, could be to instead clean-up when actually needed using e.g. `WeakRef`s (see issue 18148). --- [1] More specifically: - `LocalImageCache`, which caches image-data by /Name and /Ref on the `PartialEvaluator.prototype.getOperatorList` level. - `RegionalImageCache`, which caches image-data by /Ref on the `PartialEvaluator`-instance (i.e. at the page) level. - `GlobalImageCache`, which caches image-data by /Ref globally at the document level.	2025-01-22 12:19:44 +01:00
Jonas Jenwald	75cba72ca6	[api-major] Replace `MissingPDFException` and `UnexpectedResponseException` with one exception These old exceptions have a fair amount of overlap given how/where they are being used, which is likely because they were introduced at different points in time, hence we can shorten and simplify the code by replacing them with a more general `ResponseException` instead. Besides an error message, the new `ResponseException` instances also include: - A numeric `status` field containing the server response status, similar to the old `UnexpectedResponseException`. - A boolean `missing` field, to allow easily detecting the situations where `MissingPDFException` was previously thrown.	2025-01-16 22:51:05 +01:00
Calixte Denizet	2b05924504	Improve perfs of the font renderer Some SVG paths are generated from the font and used in the main thread to render the glyphs.	2024-12-08 17:12:10 +01:00
Jonas Jenwald	ef0331877d	Introduce `Promise.try()` usage in the code-base This simplifies the creation of Promises in some cases; see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Promise/try	2024-12-04 10:31:42 +01:00
Calixte Denizet	cee65fcd4e	[Editor] Add a new base class to allow to add a drawing in the SVG layer. This patch makes a clear separation between the way to draw and the editing stuff. It adds a class DrawEditor which should be extended in order to create new drawing tools. As an example, the ink tool has been rewritten in order to use it.	2024-11-28 15:23:03 +01:00
Jonas Jenwald	c290a12ce1	Simplify the `getUuid` helper function We can remove most feature testing from this helper function, with the exception of `randomUUID` since that's only available in "secure contexts", and also remove the fallback code-path. Note that this code was only added for Node.js compatibility, and it's no longer necessary now that the minimum support version is `20`; see also https://developer.mozilla.org/en-US/docs/Web/API/Crypto#browser_compatibility Finally, this patch also adds a basic unit-test for the helper function.	2024-11-21 13:11:10 +01:00
Jonas Jenwald	af3c1f2a0d	Enable the ESLint `no-console` rule in parts of the code-base The purpose of these changes is to make it more difficult to accidentally include logging statements, used during development and debugging, when submitting patches for review. For (almost) all code residing in the `src/` folder we should use our existing helper functions to ensure that all logging can be controlled via the `verbosity` API-option. For the `test/unit/` respectively `test/integration/` folders we shouldn't need any "normal" logging, but it should be OK to print the occasional warning/error message. Please find additional details about the ESLint rule at https://eslint.org/docs/latest/rules/no-console	2024-11-14 12:41:39 +01:00
Jonas Jenwald	65eedfb0fc	[api-minor] Add a `getDocument` option to disable `ImageDecoder` usage This allows end-users to forcibly disable `ImageDecoder` usage, even if the browser appears to support it (similar to the pre-existing option for `OffscreenCanvas`).	2024-11-12 17:12:42 +01:00
Jonas Jenwald	2145a7b9ca	Use the `hexNumbers` structure in the `stringToUTF16HexString` helper We can re-use the `hexNumbers` structure here, since that allows us to directly lookup the hexadecimal values and shortens the code.	2024-11-02 15:00:32 +01:00
Jonas Jenwald	db1238aae3	Re-factor the `toHexUtil` helper (PR 17862 follow-up) We can re-use the `hexNumbers` structure, since that allows us to directly lookup the hexadecimal values and shortens the code.	2024-10-29 16:35:44 +01:00
Jonas Jenwald	8f47d06d07	Add helper functions to allow using new `Uint8Array` methods This allows using the new methods in browsers that support them, e.g. Firefox 133+, while still providing fallbacks where necessary; see https://github.com/tc39/proposal-arraybuffer-base64 Please note: These are not actual polyfills, but only implements what we need in the PDF.js code-base. Eventually this patch should be reverted, once support is generally available.	2024-10-29 10:22:35 +01:00
Jonas Jenwald	b048420d21	[api-minor] Remove the `CMapCompressionType` enumeration After the binary CMap format had been added there were also some ideas about maybe providing other formats, see [here](https://github.com/mozilla/pdf.js/pull/8064#issuecomment-279730182), however that was over seven years ago and we still only use binary CMaps. Hence it now seems reasonable to simplify the relevant code by removing `CMapCompressionType` and instead just use a boolean to indicate the type of the built-in CMaps.	2024-10-24 11:08:16 +02:00
Calixte Denizet	a45e4a391a	Use Calibri and Lucida Console, when it's possible, in place of sans-serif and monospaced (bug 1922063) A recent change in Firefox induced too much difference between the text widths computed in using a Canvas and the ones computed by the text layout engine when rendering the text layer. Consequently, the text selection can be bad on Windows with some fonts like Arial or Consolas. This patch is a workaround to try to use in first place some fonts which don't have the problem.	2024-10-05 20:45:25 +02:00
Jonas Jenwald	aebb8534f3	Limit base-class initialization checks to development and TESTING modes We have a number of base-classes that are only intended to be extended, but never to be used directly. To help enforce this during development these base-class constructors will check for direct usage, however that code is obviously not needed in the actual builds. Note: This patch reduces the size of the `gulp mozcentral` output by `~2.7` kilo-bytes, which isn't a lot but still cannot hurt.	2024-08-12 12:26:35 +02:00
Calixte Denizet	482994cc04	Use a transparent color when setting fill/stroke colors in a pattern context but with no colorspace	2024-07-22 09:56:10 +02:00
Jonas Jenwald	a4ffc1066c	Move the internal API/Worker `isEditing`-state into `RenderingIntentFlag` In hindsight this seems like a better idea, since it avoids the need to manually pass `isEditing` around as a boolean value. Note that `RenderingIntentFlag` is internal functionality, not exposed in the official API, which means that it can be extended and modified as necessary.	2024-07-04 23:34:30 +02:00
Calixte Denizet	551e63901c	Simplify the way to pass the glyph drawing instructions from the worker to the main thread and remove the use of eval in the font loader.	2024-04-27 21:28:31 +02:00
Jonas Jenwald	e4d0e84802	[api-minor] Replace the `PromiseCapability` with `Promise.withResolvers()` This replaces our custom `PromiseCapability`-class with the new native `Promise.withResolvers()` functionality, which does almost the same thing[1]; please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Promise/withResolvers The only difference is that `PromiseCapability` also had a `settled`-getter, which was however not widely used and the call-sites can either be removed or re-factored to avoid it. In particular: - In `src/display/api.js` we can tweak the `PDFObjects`-class to use a "special" initial data-value and just compare against that, in order to replace the `settled`-state. - In `web/app.js` we change the only case to manually track the `settled`-state, which should hopefully be OK given how this is being used. - In `web/pdf_outline_viewer.js` we can remove the `settled`-checks, since the code should work just fine without it. The only thing that could potentially happen is that we try to `resolve` a Promise multiple times, which is however not a problem since the value of a Promise cannot be changed once fulfilled or rejected. - In `web/pdf_viewer.js` we can remove the `settled`-checks, since the code should work fine without them: - For the `_onePageRenderedCapability` case the `settled`-check is used in a `EventBus`-listener which is removed on its first (valid) invocation. - For the `_pagesCapability` case the `settled`-check is used in a print-related helper that works just fine with "only" the other checks. - In `test/unit/api_spec.js` we can change the few relevant cases to manually track the `settled`-state, since this is both simple and test-only code. --- [1] In browsers/environments that lack native support, note [the compatibility data](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Promise/withResolvers#browser_compatibility), it'll be polyfilled via the `core-js` library (but only in `legacy` builds).	2024-04-01 11:42:37 +02:00
Calixte Denizet	39aeea3e94	[Editor] Add a toggle button to show/hide all the highlights (bug 1867740)	2024-03-07 13:16:59 +01:00
Jonas Jenwald	49a2aff532	Tweak how the `hexNumbers` Array, used by `Util.makeHexColor`, is built Please note: This is a micro optimization, hence I fully understand if the patch is rejected. Currently we create two temporary Arrays and have to iterate twice in total when building the final `hexNumbers` Array. With this patch there's only one temporary Array and a single iteration required to build the final `hexNumbers` Array.	2024-02-23 14:20:53 +01:00
Calixte Denizet	b5e446213e	[Editor] Disable the thickness slider for non-free highlight	2024-02-09 22:54:53 +01:00
Calixte Denizet	7f2428a77e	Reduce memory use and improve perfs when computing the bounding box of a bezier curve (bug 1875547) It isn't really a fix for the mentioned bug but it slightly improve things. In reducing the memory use, the time spent in the GC is reduced either. The algorithm to compute the bounding box is the same as before but it has just been rewritten to be more efficient.	2024-01-24 23:41:14 +01:00
Calixte Denizet	2b8ecf5688	[Editor] Add the possibility to change the thickness of a free highlight (bug 1876096)	2024-01-24 22:04:39 +01:00
Jonas Jenwald	f9a384d711	Enable the `arrow-body-style` ESLint rule This manually ignores some cases where the resulting auto-formatting would not, as far as I'm concerned, constitute a readability improvement or where we'd just end up with more overall indentation. Please see https://eslint.org/docs/latest/rules/arrow-body-style	2024-01-21 16:20:55 +01:00
Jonas Jenwald	b37536c38c	Remove the `isArrayBuffer` helper function This old helper function can now be replaced with `ArrayBuffer.isView()` and/or `instanceof ArrayBuffer` checks, as needed depending on the situation.	2024-01-19 14:10:52 +01:00
Calixte Denizet	ff23d37fa2	[Editor] Add a color picker with predefined colors for highlighting text (bug 1866434) The doorhanger for highlighting has a basic color picker composed of 5 predefined colors to set the default color to use. These colors can be changed thanks to a preference for now but it's something which could be changed in the Firefox settings in the future. Each highlight has in its own toolbar a color picker to just change its color. The different color pickers are so similar (modulo few differences in their styles) that this patch introduces a new class ColorPicker which provides a color picker component which could be reused in future editors. All in all, a large part of this patch is dedicated to color picker itself and its style and the rest is almost a matter of wiring the component.	2023-12-05 23:27:22 +01:00

1 2 3 4 5 ...