Beanz/pdf.js - pdf.js - Gitea: Git with a cup of tea

Beanz/pdf.js

Author	SHA1	Message	Date
zenocross	9a8641a59a	Merge f0964c7dfde6c0c8c006b55c5ed0cdd2bc131554 into ec71e4ed651e659b06a4fa46ef0b18ff9ab2a8c7	2025-11-25 15:21:08 -05:00
calixteman	1a8689b9be	Merge pull request #20340 from Aditi-1400/serialize-pattern-ab Serialize pattern data into ArrayBuffer	2025-10-22 11:05:22 +02:00
Calixte Denizet	199b3d04df	Fix stream use when getting the text (follow-up of #20373 )	2025-10-18 22:58:27 +02:00
Aditi	fa631806bf	Serialize pattern data into ArrayBuffer Follow up on https://github.com/mozilla/pdf.js/pull/20197, This serializes pattern data into an ArrayBuffer which is then transferred from the worker to the main thread. It sets up the stage for us to eventually switch to a SharedArrayBuffer in the future.	2025-10-11 01:58:07 +05:30
Calixte Denizet	4d15bfec0d	Only apply word spacing when there is a 0x20 in the text chunk Fixes #20319.	2025-10-03 22:18:02 +02:00
Ujjwal Sharma	4bed7370f4	[WIP] Serialize font data into an ArrayBuffer This PR serializes font data into an ArrayBuffer that is then transfered from the worker to the main thread. It's more efficient than the current solution which clones the "export data" object which includes the font data as a Uint8Array. It prepares us to switch to a SharedArrayBuffer in the future, which would allow us to share the font data with multiple agents, which would be crucial for the upcoming "renderer" worker.	2025-09-19 12:02:40 +05:30
Calixte Denizet	b6d772d71d	Consider a ttf font with both Symbolic and Nonsymbolic flags set with a Differences array in the encoding dict as non-symbolic It fixes #20232.	2025-09-14 18:52:16 +02:00
Calixte Denizet	1d4ae786f4	Check the setDash arguments It fixes #20155.	2025-08-09 22:34:44 +02:00
Noritaka Kobayashi	fa568e826d	Fix typos across the codebase	2025-07-07 09:59:36 +09:00
Calixte Denizet	3bdc5d54fe	Get the text under highlight/squiggly/underline/strikethrough annotations (bug 1885505) and add an invisible element containing the text in the annotation layer to make it readable by a screen reader.	2025-06-22 21:47:29 +02:00
ritwic	04511c607b	Parse the group blend mode from ExtGState in Form XObject Resources	2025-06-19 07:51:05 +08:00
Jonas Jenwald	36b40d959b	Merge pull request #19955 from Snuffleupagus/issue-19954 Support Type3 fonts with an incomplete /FontDescriptor dictionary (issue 19954)	2025-05-19 17:26:46 +02:00
Jonas Jenwald	c02ea0c681	Simplify how we handle Type3 fonts without a /FontDescriptor dictionary Part of this is very old code, which we can now simplify a little bit.	2025-05-19 15:26:11 +02:00
Calixte Denizet	5789afd3f8	Create the css color to use with the canvas in the worker It slightly reduces the time spent to draw and the memory used.	2025-05-19 14:52:24 +02:00
Jonas Jenwald	5f5d9dfc28	Support Type3 fonts with an incomplete /FontDescriptor dictionary (issue 19954) We have a fallback for the common case of Type3 fonts without a /FontDescriptor dictionary, however we also need to handle the case where it's present but lacking the required /FontName entry.	2025-05-19 12:56:14 +02:00
Jonas Jenwald	64007e777e	Ensure that the /Form XObject /Resources-entry is actually a dictionary (issue 19848)	2025-04-23 10:19:20 +02:00
Jonas Jenwald	1048508dd1	Catch circular references in /Form XObjects (issue 19800) For simplicity we will abort /Form XObject parsing immediately when encountering a circular reference, rather than letting it continue up until some limit (as e.g. PDFium appears to do), which should be fine since there are never any guarantees if/how corrupt PDF documents will render.	2025-04-11 16:54:22 +02:00
Jonas Jenwald	12c7c7b0af	Merge pull request #19773 from Snuffleupagus/inline-PDFImage-createRawMask Inline `PDFImage.createRawMask` in the `PDFImage.createMask` method	2025-04-08 17:19:09 +02:00
Jonas Jenwald	dc3e24a76a	Inline `PDFImage.createRawMask` in the `PDFImage.createMask` method After the introduction of `OffscreenCanvas` support we now have two separate mask-methods in the `PDFImage` class, and the reason that they were not combined is likely that we need the "raw" bytes when parsing Type3-glyph image masks. However, that case is easy to support simply by disabling `OffscreenCanvas` usage when parsing Type3-glyphs and that way we're able to reduce some code duplication. Another slightly strange property of the `PDFImage.createMask` method is that it needs various image-dictionary parameters manually provided, which is probably because this is very old code. That feels slightly unwieldy, and we instead change the method to pass in the image-stream directly and do the necessary data-lookup internally. A side-effect of this re-factoring is that we now support using the custom `isSingleOpaquePixel` operator in Type3-glyphs, which shouldn't hurt even though it seems extremely unlikely for that to ever happen in Type3-glyphs.	2025-04-08 12:01:50 +02:00
Jonas Jenwald	d882d0869c	Move the `IDENTITY_MATRIX` constant into `src/core/core_utils.js` (PR 19772 follow-up) After the changes in PR 19772 the `IDENTITY_MATRIX` constant is now only used on the worker-thread, which leads to Webpack marking the code as unused in the built `pdf.mjs` file; see https://phabricator.services.mozilla.com/D244533#change-8oITAexCvrlQ	2025-04-07 11:40:18 +02:00
Calixte Denizet	4c63905a18	Avoid to create an array when setting the text matrix	2025-04-05 20:45:26 +02:00
Jonas Jenwald	7cfb1be650	Merge pull request #19758 from Snuffleupagus/OperatorList-setOptions Initialize the `isOffscreenCanvasSupported` option, in the `OperatorList` class, once per document	2025-04-05 18:45:55 +02:00
Calixte Denizet	41bed561f0	Simplify updateRectMinMax in order to use slightly less memory	2025-04-03 17:06:58 +02:00
Jonas Jenwald	4a6c47489e	Initialize the `isOffscreenCanvasSupported` option, in the `OperatorList` class, once per document Currently we're setting this option for each small inline image, which seems unnecessary since it should suffice to do that once per document.	2025-04-03 14:00:07 +02:00
Jonas Jenwald	e5fbf52405	Merge pull request #19736 from Snuffleupagus/compileType3Glyph-worker [api-minor] Move Type3-glyph compilation to the worker-thread	2025-04-01 19:40:30 +02:00
Jonas Jenwald	9cd5a9658a	[api-minor] Move Type3-glyph compilation to the worker-thread After PR 19731 the format of compiled Type3-glyphs is now simple enough that the compilation can be moved to the worker-thread, without introducing any significant additional complexity. This allows us to, ever so slightly, simplify the implementation in `src/display/canvas.js` since the Type3 operatorLists will now directly include standard path-rendering operators (using the format introduced in PR 19689). As part of these changes we also stop caching Type3 image masks since: we've not come across any cases where that actually helps, they're usually fairly small, and it simplifies the code. Note that one "negative" change introduced in this patch is that we'll now compile Type3-glyphs eagerly, whereas previously we'd only do that lazily upon their first use. However, this doesn't seem to impact performance in any noticeable way since the compilation is fast enough (way below 1 ms/glyph in my testing) and Type3-fonts are also limited to just 256 glyphs. Also, many (or most?) Type3-fonts don't even use image masks and are thus not affected by these changes.	2025-04-01 09:09:00 +02:00
Jonas Jenwald	213830f44f	Use, and re-name, the `addLocallyCachedImageOps` helper for global images too This avoids having to "manually" set the image operators for globally cached images.	2025-03-31 10:57:04 +02:00
Jonas Jenwald	e0e59eaf01	Define the global cache-data once in `buildPaintImageXObject` Currently we duplicate the same identical code three times, which seems both unnecessary and error prone.	2025-03-31 10:29:29 +02:00
Jonas Jenwald	8e3a3387e0	Reduce duplication when specifying the fn-operations in `buildPaintImageXObject` Currently we explicitly specify the fn-`OPS` both when adding entries to the operatorList and to the image-caches, and by using a temporary variable we can reduce a bit of duplication (similar to the existing args-handling).	2025-03-29 15:56:46 +01:00
Jonas Jenwald	b85f0903ca	Add new bounding-box helpers in `Util` to reduce code duplication Currently we have a `Util`-helper for computing the bounding-box of a Bézier curve, however for simple points and rectangles we repeat virtually identical code in many spots throughout the code-base. - Introduce new `Util.pointBoundingBox` and `Util.rectBoundingBox` helpers. - Remove the "fallback" from `Util.bezierBoundingBox` and only support passing in a `minMax`-array, since there's only a single call-site using the other format and it could be easily updated.	2025-03-23 19:20:02 +01:00
Calixte Denizet	be1f5671bb	[api-minor] Use a Path2D when doing a path operation in the canvas (bug 1946953) With this patch, all the paths components are collected in the worker until a path operation is met (i.e., stroke, fill, ...). Then in the canvas a Path2D is created and will replace the path data transfered from the worker, this way when rescaling, the Path2D can be reused. In term of performances, using Path2D is very slightly improving speed when scaling the canvas.	2025-03-22 20:35:24 +01:00
Calixte Denizet	4b4f85484e	Always use the absolute value of the line thickness (issue 19633)	2025-03-11 14:03:23 +01:00
calixteman	13474aca63	Merge pull request #19620 from calixteman/cmyk_icc [api-minor] Use an icc profile for converting CMYK to RGB	2025-03-10 16:55:48 +01:00
Jonas Jenwald	0edfd29a3e	Improve text-selection for Type3 fonts, using `d0` operators, with empty /FontBBox-entries (issue 19624) For Type3 glyphs with `d1` operators it's easy to compute a fallback bounding box, however for `d0` the situation is more difficult. Given that we nowadays compute the min/max of basic path-rendering operators on the worker-thread, we can utilize that by parsing these Type3 operatorLists to guess a more suitable fallback bounding box.	2025-03-10 16:21:54 +01:00
Calixte Denizet	7280540901	[api-minor] Use an icc profile for converting CMYK to RGB	2025-03-10 14:18:20 +01:00
Jonas Jenwald	d8d7235876	Simplify the `ColorSpaceUtils.singletons` handling (PR 19564 follow-up) With the changes in PR 19564 the actual `ColorSpace`-classes where separated from the various static "helper" methods. Hence it seems that we can now simplify/shorten this old code to instead cache the "standard" ColorSpaces directly on the `ColorSpaceUtils`-class.	2025-03-05 15:02:05 +01:00
Jonas Jenwald	fbf1f2ba15	Remove `ColorSpaceUtils.parseAsync` and simplify the ColorSpace "API-surface" This patch reduces the number of `ColorSpaceUtils` static-methods, and in particular the `parseAsync` method is removed and it's now instead possible to have `parse` optionally return a Promise. This thus removes the need to manually check if a `ColorSpace`-instance is cached, note the changes in the `src/core/evaluator.js` file.	2025-03-05 12:43:58 +01:00
Calixte Denizet	971be48b60	Support using ICC profiles in using qcms (bug 860023)	2025-03-05 10:29:59 +01:00
Jonas Jenwald	4be79748c9	Add a `GlobalColorSpaceCache` to reduce unnecessary re-parsing This complements the existing `LocalColorSpaceCache`, which is unique to each `getOperatorList`-invocation since it also caches by `Name`, which should help reduce unnecessary re-parsing especially for e.g. `ICCBased` ColorSpaces once we properly support those.	2025-03-01 14:21:05 +01:00
Jonas Jenwald	bdfa96878d	Invoke `TranslatedFont.prototype.loadType3Data` only once per font Currently we're first loading the font, and then for Type3 fonts we're invoking `loadType3Data` every time that the font is encountered. That seems completely unnecessary, and it's probably connected to the age of this code, since the `loadType3Data`-method will only run once anyway (note the caching).	2025-02-26 15:17:11 +01:00
Jonas Jenwald	d428db63c3	Improve the "FontFallback" handling on the worker-thread Remove the `Catalog.prototype.fontFallback` method, and move its code into `PDFDocument.prototype.fontFallback` instead, to reduce the indirection a little bit. Pass the `evaluatorOptions` directly to the `TranslatedFont.prototype.fallback` method, since nothing else in the `TranslatedFont`-class needs it now.	2025-02-24 09:34:58 +01:00
Jonas Jenwald	839e23f5c2	Send `disableFontFace` and `fontExtraProperties` as part of the exported font-data These options are needed in the `FontFaceObject` class, and indirectly in `FontLoader` as well, which means that we currently need to pass them around manually in the API. Given that the options are (obviously) available on the worker-thread, it's very easy to just provide them when creating `Font`-instances and then send them as part of the exported font-data. This way we're able to simplify the code (primarily on the main-thread), and note that `Font`-instances even had a `disableFontFace`-field already (but it wasn't properly initialized).	2025-02-24 09:34:48 +01:00
Jonas Jenwald	641e2f506e	[api-minor] Re-factor how the `useWorkerFetch` option is used internally With the recently added OpenJPEG no-wasm fallback we need to send the `wasmUrl` option to the worker-thread regardless of the value of the `useWorkerFetch` option, since the fallback won't work if we don't have a URL to `import` it from. For consistency the code is re-factored to always send the factory-urls to the worker-thread, and simply check the `useWorkerFetch` option there instead. Also, as a follow-up to PR 19525, introduce a new `useWasm` option that can be used in e.g. browser-tests to forcibly disable WebAssembly usage.	2025-02-22 09:56:53 +01:00
Jonas Jenwald	36979e9eb2	Fix all outstanding ESLint `arrow-body-style` warnings Currently this rule is disabled in a number of spots across the code-base, and unless absolutely necessary we probably shouldn't disable linting, so let's just update the code to fix all the outstanding cases.	2025-02-17 15:45:44 +01:00
Jonas Jenwald	88e5da1e37	Combine the main-thread message handlers for CMap-, StandardFontData-, and Wasm-files Currently we have three separate and virtually identical message handlers for this data, which can easily be combined into a single message handler instead.	2025-02-07 14:33:15 +01:00
Jonas Jenwald	db53320da8	Initialize the image-options, on the worker-thread, once per document Currently we're initializing the image-options for every page, which seems unnecessary since it should suffice to do that once per document. Also, changes the `BasePdfManager` constructor to improve readability/documentation a little bit.	2025-01-30 11:52:15 +01:00
Jonas Jenwald	6038b5a992	Handle JPX wasm fetch-response errors correctly (PR 19329 follow-up) Currently we're not checking that the response is actually OK before getting the data, which means that rather than throwing an error we can get an empty `ArrayBuffer`. To avoid duplicating code we can move an existing helper into `src/core/core_utils.js` and re-use it when fetching the JPX wasm-file as well.	2025-01-17 10:20:16 +01:00
Calixte Denizet	94b4b54ef6	[api-major] Add openjpeg.wasm to pdf.js (bug 1935076) In order to fix bug 1935076, we'll have to add a pure js fallback in case wasm is disabled or simd isn't supported. Unfortunately, this fallback will take some space. So, the main goal of this patch is to reduce the overall size (by ~93k). As a side effect, it should make easier to use an other wasm file (which must export _jp2_decode, _malloc and _free).	2025-01-16 21:09:50 +01:00
Jonas Jenwald	74c1795c9f	Use `Dict` iteration more (PR 19051 follow-up) There's a few cases where we're looping through the result of `Dict.prototype.getKeys` and then manually look-up the values, which after PR 19051 can be replaced with direct iteration instead.	2025-01-02 15:09:19 +01:00
Jonas Jenwald	20d5332009	For images that include SMask/Mask entries, ignore an SMask defined in the current graphics state From section [11.6.4.3 Mask Shape and Opacity](https://opensource.adobe.com/dc-acrobat-sdk-docs/pdfstandards/PDF32000_2008.pdf#G10.4848628) in the PDF specification: - An image XObject may contain its own soft-mask image in the form of a subsidiary image XObject in the `SMask` entry of the image dictionary (see "Image Dictionaries"). This mask, if present, shall override any explicit or colour key mask specified by the image dictionary's `Mask` entry. Either form of mask in the image dictionary shall override the current soft mask in the graphics state.	2024-12-30 14:25:07 +01:00

1 2 3 4 5 ...