pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	ee34c5c648	Let `Lexer.prototype.getNumber` treat more cases of a single minus sign as zero (bug 1953099) This patch extends the approach of PR 14543, by also treating e.g. minus signs followed by '(' or '<' as zero. Inside of a /Contents stream those characters will generally mean the start of one or more glyphs.	2025-03-12 17:50:13 +01:00
Calixte Denizet	4b4f85484e	Always use the absolute value of the line thickness (issue 19633)	2025-03-11 14:03:23 +01:00
Jonas Jenwald	0edfd29a3e	Improve text-selection for Type3 fonts, using `d0` operators, with empty /FontBBox-entries (issue 19624) For Type3 glyphs with `d1` operators it's easy to compute a fallback bounding box, however for `d0` the situation is more difficult. Given that we nowadays compute the min/max of basic path-rendering operators on the worker-thread, we can utilize that by parsing these Type3 operatorLists to guess a more suitable fallback bounding box.	2025-03-10 16:21:54 +01:00
Jonas Jenwald	10a99ea0a7	Let SMask/Mask images fallback to the parent image dimensions (issue 19611) One of the images have a corrupt SMask, where the /Height-entry is bogus; see the excerpt below (via https://brendandahl.github.io/pdf.js.utils/browser/). ``` SMask (stream) [id: 17, gen: 0] ColorSpace = /DeviceGray Height = /Length Subtype = /Image Filter = /FlateDecode Type = /XObject Width = 157 Matte (array) BitsPerComponent = 8 Length = 3893 <view contents> download ``` Hence we enable SMask/Mask images to fallback to the parent image dimensions, and also add more validation of the width/height to get a better error message when that data is wrong.	2025-03-10 12:37:44 +01:00
Calixte Denizet	971be48b60	Support using ICC profiles in using qcms (bug 860023)	2025-03-05 10:29:59 +01:00
Jonas Jenwald	59cb9a064e	Extend `getGlyphMapForStandardFonts` with some Cyrillic entries (issue 19550)	2025-02-26 10:16:06 +01:00
Ross Johnson	4f25d7f6cd	Fix decryption of R=4, V=4 files with < 16-byte keys by 0-padding - undocumented but matches Acrobat behavior (issue #19484 )	2025-02-24 15:36:37 -06:00
Jonas Jenwald	641e2f506e	[api-minor] Re-factor how the `useWorkerFetch` option is used internally With the recently added OpenJPEG no-wasm fallback we need to send the `wasmUrl` option to the worker-thread regardless of the value of the `useWorkerFetch` option, since the fallback won't work if we don't have a URL to `import` it from. For consistency the code is re-factored to always send the factory-urls to the worker-thread, and simply check the `useWorkerFetch` option there instead. Also, as a follow-up to PR 19525, introduce a new `useWasm` option that can be used in e.g. browser-tests to forcibly disable WebAssembly usage.	2025-02-22 09:56:53 +01:00
Jonas Jenwald	6d3bb47655	Merge pull request #19525 from calixteman/bug1935076_part2 Provide a js fallback when the wasm version of openjpeg is failing to load (bug 1935076)	2025-02-22 09:34:40 +01:00
Xiphoseer	24aa39eb14	Consider textRise when showing type3 font glyphs Add test case for issue 19532	2025-02-21 21:31:04 +01:00
Calixte Denizet	36e4f5c222	Provide a js fallback when the wasm version of openjpeg is failing to load (bug 1935076)	2025-02-21 19:03:47 +01:00
Jonas Jenwald	db7cf40a30	Don't cache free/missing XRef entries (issue 19510) During the XRef stream parsing we're attempting to lookup an entry that hasn't yet been found, since parsing is currently running, and given that we'd also cache free/missing XRef entries we'd then return an incorrect value during normal PDF parsing. The simplest solution here is to just not cache free/missing XRef entries, since a properly generated PDF document shouldn't be trying to access objects it doesn't contain. Furthermore, the amount of "extra" parsing now needed for such XRef entries shouldn't be significant enough to be an issue.	2025-02-18 18:04:00 +01:00
Jonas Jenwald	65df1d336f	Check more of the stream when looking for commands after inline image (issue 19494) Currently we only check `followingBytes`, which turns out to be too short to find e.g. valid transform (cm) commands with decimal arguments.	2025-02-15 15:14:47 +01:00
Jonas Jenwald	bd05b255fa	[api-major] Apply the `userUnit` using CSS, to fix the text/annotation layers (bug 1947248) Rather than modifying the "raw" dimensions of the page, we'll instead apply the `userUnit` as an additional scale-factor via CSS. Please note: It's not clear to me if this solution is fully correct either, or if there's other problems with it, but it at least appears to work. --- With these changes, the following CSS variables are now assumed to be available/set as necessary: `--total-scale-factor`, `--scale-factor`, `--user-unit`, `--scale-round-x`, and `--scale-round-y`.	2025-02-11 14:36:06 +01:00
Calixte Denizet	24417a1a0b	[Editor] Add the ability to print and save some newly added signatures (bug 1946795)	2025-02-07 23:07:27 +01:00
Jonas Jenwald	6f2706fad6	Support the password field-flag in TextWidgetAnnotation (issue 19389)	2025-01-29 12:40:09 +01:00
Calixte Denizet	1ccf6ed976	Correctly render the glyph outline when it has a stroke pattern It fixes #19360. Each glyph in the test case has a fill and a stroke pattern, so the current transform used to scale the glyph outline must be the same. In setting the stroke color to green, I noticed that the last outline contains some non-closed subpaths, so when generating the glyph outline, every time we 'moveTo', we close the previous subpath.	2025-01-21 15:30:16 +01:00
Jonas Jenwald	c4ba3ac23f	Replace the EXIF-block with dummy data to prevent JPEG images being rotated (bug 1942064) The `ImageDecoder` will respect the EXIF orientation, which can lead to JPEG images being incorrectly rotated. To avoid this we replace the entire EXIF-block with dummy data, which works since it'll cause EXIF parsing to bail out early in Firefox; see https://searchfox.org/mozilla-central/rev/9a66d18cb35595c89f499a1011c9dd7e573fce77/image/decoders/EXIF.cpp#130-138	2025-01-20 16:50:22 +01:00
Calixte Denizet	94b4b54ef6	[api-major] Add openjpeg.wasm to pdf.js (bug 1935076) In order to fix bug 1935076, we'll have to add a pure js fallback in case wasm is disabled or simd isn't supported. Unfortunately, this fallback will take some space. So, the main goal of this patch is to reduce the overall size (by ~93k). As a side effect, it should make easier to use an other wasm file (which must export _jp2_decode, _malloc and _free).	2025-01-16 21:09:50 +01:00
Jonas Jenwald	e5bc760316	Access the number of components correctly in JPEG 2000 images with color space entries (issue 19326) This small typo appears to be a regression from PR 18204.	2025-01-15 10:16:06 +01:00
Jonas Jenwald	5e569cade5	Improve performance when reading very large TrueType "cmap" tables (issue 19319) In the affected font the total number of mapping-entries is `1142348`, and no less than `997473` of them are duplicates. Given that every duplicate causes a lot of Array elements to be moved this becomes extremely inefficient, which we can avoid by keeping track of seen `charCode`s and directly build the final mappings-Array instead.	2025-01-13 13:09:47 +01:00
Jonas Jenwald	916fff0e42	Access the bbox/background data correctly in the `MeshShadingPattern` class (issue 18816) This appears to have regressed in PR 13808, since it removed the `matrix`-entry from array returned by the `MeshShading.prototype.getIR` method without also updating the indexes in the `MeshShadingPattern` constructor.	2025-01-08 15:57:56 +01:00
Jonas Jenwald	6f062abb76	Skip LinkAnnotations when collecting field objects (issue 19281) The `/Root/AcroForm/Fields` array contains a "ridiculous" number of LinkAnnotations, which obviously makes no sense since those are not form fields. To improve performance we'll thus ignore those when collecting the field objects.	2025-01-04 11:54:45 +01:00
Jonas Jenwald	20d5332009	For images that include SMask/Mask entries, ignore an SMask defined in the current graphics state From section [11.6.4.3 Mask Shape and Opacity](https://opensource.adobe.com/dc-acrobat-sdk-docs/pdfstandards/PDF32000_2008.pdf#G10.4848628) in the PDF specification: - An image XObject may contain its own soft-mask image in the form of a subsidiary image XObject in the `SMask` entry of the image dictionary (see "Image Dictionaries"). This mask, if present, shall override any explicit or colour key mask specified by the image dictionary's `Mask` entry. Either form of mask in the image dictionary shall override the current soft mask in the graphics state.	2024-12-30 14:25:07 +01:00
Jonas Jenwald	189183aa1a	Add basic support for non-embedded HelveticaLTStd-Bold fonts (issue 19234)	2024-12-18 09:39:22 +01:00
Jonas Jenwald	5dc2d257ad	Merge pull request #19196 from Snuffleupagus/issue-19176 Take the `userUnit` into account in the `PageViewport` class (issue 19176)	2024-12-09 13:38:31 +01:00
Calixte Denizet	f6662d3f7c	Add a ref test for setting disableFontFace to true	2024-12-08 16:06:25 +01:00
Jonas Jenwald	c6e3fc4fe6	Take the `userUnit` into account in the `PageViewport` class (issue 19176)	2024-12-08 15:51:04 +01:00
Calixte Denizet	6fe6b6d6b7	Get the first codepoint instead of the first char when using the toUnicode map It fixes #19182.	2024-12-06 18:25:13 +01:00
Calixte Denizet	7e02c77250	[Editor] Make ink annotation editable	2024-12-02 17:15:33 +01:00
Jonas Jenwald	867aaf01fa	Merge pull request #19117 from Snuffleupagus/bot-forceNoChrome Disable the browser-tests in Google Chrome on the bots	2024-12-01 18:28:03 +01:00
Calixte Denizet	cee65fcd4e	[Editor] Add a new base class to allow to add a drawing in the SVG layer. This patch makes a clear separation between the way to draw and the editing stuff. It adds a class DrawEditor which should be extended in order to create new drawing tools. As an example, the ink tool has been rewritten in order to use it.	2024-11-28 15:23:03 +01:00
Jonas Jenwald	fd31e728f7	Disable the browser-tests in Google Chrome on the bots Given that `browsertest` repeatedly timeout in Google Chrome, and considering that Firefox is the primary development target, we stop running them on the bots to avoid having to repeatedly deal with this. Note that we already disabled these tests on Windows almost three years ago, because of stability issues; see PR 14392.	2024-11-27 17:59:51 +01:00
Calixte Denizet	1ef670411a	Rescale the image data when they're really too large It fixes #17190.	2024-11-23 22:42:30 +01:00
Calixte Denizet	b0b0de98e7	Use the V entry as an option when no options in a choice widget It fixes #19083. It isn't really a fix but more a workaround (we should correctly implement the choice widget as a mix of text input+select).	2024-11-21 17:27:34 +01:00
calixteman	7a962031e9	Merge pull request #19024 from calixteman/disable_test_chrome Disable ref test 'issue18896' for Chrome because it takes too much time	2024-11-11 19:11:30 +01:00
Calixte Denizet	92b7374aad	Disable ref test 'issue18896' for Chrome because it takes too much time	2024-11-11 18:55:56 +01:00
Calixte Denizet	79e1f155ac	Apply gradient when stroking text It fixes #19022. I noticed that the glyph contours weren't correct (for T and x) and because we forgot to close the contour.	2024-11-11 15:53:07 +01:00
Jonas Jenwald	e92a929a58	Try to improve handling of missing trailer dictionaries in `XRef.indexObjects` (issue 18986) The problem with the referenced PDF document has nothing to do with invalid dates, as the issue seems to suggest, but rather with the fact that it has neither an XRef table nor a trailer dictionary. Given that crucial parts of the internal document structure is missing, you might argue that it's not really a PDF document. In an attempt to support this kind of corruption, we'll simply iterate through all (previously found) XRef entries and pick one that might be a valid /Root dictionary. There's obviously no guarantee that this works, and it might not be fast in larger PDF documents, but at least it cannot be any worse than immediately throwing `InvalidPDFException` as we previously did here. Please note: I'm totally fine with this patch being rejected, since it's somewhat questionable if we should actually attempt to support "PDF documents" with this level of corruption.	2024-11-05 18:19:26 +01:00
Jonas Jenwald	48a18585f2	Allow `StreamsSequenceStream` to skip sub-streams that are not actual Streams (issue 18973) This extends PR 13796 to also handle the case where sub-streams contain invalid data, i.e. anything that isn't a Stream, however please note that in these cases there's no guarantee that we'll render the page "correctly". Note that Adobe Reader, i.e. the PDF reference implementation, cannot render the last page of the referenced PDF document.	2024-10-29 09:36:08 +01:00
Calixte Denizet	d114f71feb	Always fill the mask with the backdrop color It fixes #18956. In the patch #18029, for performance reasons and because I thought it was useless, I deliberately chose to not fill the mask with the backdrop color when it's full black: it was a bad idea. So in this patch we always add the backdrop color to the mask.	2024-10-26 14:14:51 +02:00
Jonas Jenwald	63b34114b1	Fallback to a standard font if a font-file entry doesn't contain a Stream (issue 18941) The PDF document is clearly corrupt, since it has /FontFile2 entries that are Dictionaries which obviously isn't correct. While there's obviously no guarantee that things will look perfect this way, actually rendering the text at all should be an improvement in general.	2024-10-22 11:51:28 +02:00
Jonas Jenwald	689ffda9df	Merge pull request #18902 from Snuffleupagus/pdkids-rm-linked-test Add the `pdkids` PDF document to the repository	2024-10-15 22:15:09 +02:00
Jonas Jenwald	424f81c4db	Merge pull request #18825 from agrahn/rbgroups implementing optional content radiobutton groups	2024-10-15 13:11:19 +02:00
Alexander Grahn	441efe456e	Optional Content (OC) radiobutton (RB) groups implemented. Resolves #18823 . The code parses the /RBGroups entry in the OC configuration dict and adds the property `rbGroups' to instances of the OptionalContentGroup class. rbGroups takes an array of Sets, where each Set instance represents an RB group the OptionalContentGroup instance is a member of. Such a Set instance contains all OCG ids within the corresponding RB group. RB groups an OCG is associated with are processed when its visibility is set to true, as required by the PDF spec.	2024-10-15 11:34:45 +02:00
Jonas Jenwald	fb3c7b6d8f	Add the `pdkids` PDF document to the repository Given that the sub-title of that document is "Public domain texts for young people." and that the images have clear sources at the end of the document, it should (hopefully) be OK to add it to the repository rather than relying on a linked test-case.	2024-10-15 10:55:17 +02:00
Calixte Denizet	8b7b39f5d6	Some jpx images can have a mask It fixes #18896.	2024-10-14 21:50:32 +02:00
Calixte Denizet	e7ab8cd8c1	Fallback on gray colorspace when there are no colorspace and no name in the scn/SCN arguments It fixes #18894.	2024-10-13 16:02:07 +02:00
Calixte Denizet	3194f3de8b	Keep the empty lines in the text fields It fixes #18036.	2024-10-05 16:19:41 +02:00
Calixte Denizet	3103deaa44	Fix missing annotation parent in using the one from the Fields entry Fixes #15096.	2024-10-04 20:00:19 +02:00

1 2 3 4 5 ...

1402 Commits