Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/py-pdf/pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
https://github.com/py-pdf/pypdf

ENH: Add link annotation (#1189)

* Add AnnotationBuilder.link(...)
* Allow creating a RectangleObject from a RectangleObject. Th...

ab01f14f9bdebecd4efe552093605d5bb81f42c5 authored over 2 years ago
MAINT: Consistant usage of warnings / log messages (#1164)

2a5a199757f97b0f85a08055414bf56871f23140 authored over 2 years ago
STY: Apply pre-commit (#1188)

89033cb37aec3a520da01b95da0a8fdd8dcf38fb authored over 2 years ago
ENH: Add capability to filter text extraction by orientation (#1175)

Closes #1071

8a27fa4eea0c072cd7c8718a4c04869223c31ef6 authored over 2 years ago
MAINT: Consistent terminology for outline items (#1156)

This PR makes sure PyPDF2 uses a consistent nomenclature for the outline:

* **Outline**: A do...

8c532a0ff13395b706d0ae1f183dd24bab577bfc authored over 2 years ago
DOC: Update changelog url in package metadata (#1180)

2d480685a72d665826dbd53f973173b34cf4c872 authored over 2 years ago
DOC: Table extraction (#1179)

85ca871b007c23e0b5dcb8ab5915b63b1d9ac7e7 authored over 2 years ago
DOC: Mention pyHanko for signing PDF documents (#1178)

8d5037c590fbab28d9980962070d28a94dfd9be5 authored over 2 years ago
DOC: We now have CMAP support (#1177)

7b852acb3350033a9f76fbc61f6e0d27f561b444 authored over 2 years ago
BUG: Named Dest in PDF1.1 (#1174)

Named destinations are stored in a dictionary in PDF 1.1

Closes #1173

9c8252d5bc876c0048b6dfe3b531bc8fa6cfd81e authored over 2 years ago
BUG: Incomplete Graphic State save/restore (#1172)

Graphic state shall store also the font, font size, ...

See #1142

d8bd12f3e1d6b5a5b0a488413dbe8ec598b84355 authored over 2 years ago
REL: 2.8.1

Bug Fixes (BUG):
- u_hash in AlgV4.compute_key (#1170)

Robustness (ROB):
- Fix loading of fil...

5b75160144a45eb75441158046edc3c5805b0749 authored over 2 years ago
BUG: u_hash in AlgV4.compute_key (#1170)

Closes #1088

3b73b34b1014a1ccf2c8ab21153b41071fa52ef0 authored over 2 years ago
ROB: Fix loading of file from #134 (#1167)

See #134

a) cmap : strip lines when processing cmap from fonts
b) look for %EOF up to beginn...

844f2380d68ef047c2f9403699a933875633af11 authored over 2 years ago
TST: Add test from #325 (#1169)

Closes #325

ebcf88940e876a4661e7cedcde247d908a4dcd5e authored over 2 years ago
ROB: Cope with empty DecodeParams (#1165)

See #1143, 2nd part

0b2728737809fa6fe253b2a4505e86cd093d7006 authored over 2 years ago
DOC: Typo in warning message (#1166)

ab7a9ada067d88f03b226072c6fce6b55f89a6b5 authored over 2 years ago
MAINT: Package updates; solve mypy strict remarks (#1163)

db3439b3a603bd370c97994df24d8ecf8711faf6 authored over 2 years ago
REL: 2.8.0

New Features (ENH):
- Add writer.add_annotation, page.annotations, and generic.AnnotationBuilde...

ec30171a9da60755763ed8b2c24c96298f9ee902 authored over 2 years ago
MAINT: Break up parse_to_unicode (#1162)

Just move parts in separate functions for easier readability

2de09730c13e3e380f0038d241e5ee81b0509a40 authored over 2 years ago
BUG: Set /AS for /Btn form fields in writer (#1161)

Closes #434

Co-authored-by: liuzhuoling <[email protected]>

fa5e3f76da2048b50c9d1dd94d7a938a11ac53e8 authored over 2 years ago
ROB: Cope with utf16 character for space calculation (#1155)

See #1143

Co-authored-by: Martin Thoma <[email protected]>

35bec4034e503cac97c23de9f923154785d48767 authored over 2 years ago
BUG: Ignore if '/Perms' verify failed (#1157)

It seems to be save to ignore the /Perms entry:

Qpdf ignores it:
https://github.com/qpdf/qpd...

2bf40f4a70a35434086eab2054d11425380b919c authored over 2 years ago
MAINT: Add diagnostic output to exception in read_from_stream (#1159)

Co-authored-by: speedplane <[email protected]>

102260d8d5d21559371f7154ec647db5ce659dc2 authored over 2 years ago
TST: Add workflow tests found by arc testing (#1154)

Done with https://github.com/py-pdf/pdf-crawler/blob/main/get_coverage_by_pdf.py

c52988489fa5d1b83e327bbeba02a7eca2e211bb authored over 2 years ago
DEV: Introduce _utils.logger_warning (#1148)

- Exceptions: User code should handle the issue
- warnings.warn: User should re-write something...

b429b395316021ff97ab41b9626287adb221f6fe authored over 2 years ago
ROB: Cope with null params for FitH /FitV destination (#1152)

iaw PDF specifications, page 583

Closes #1145

27702c2e098fcf62b37d34ee52cfeeb6c3cc4f12 authored over 2 years ago
MAINT: Reduce PdfReader.read complexity (#1151)

a6d27d754776fe1501ccffed26f0e5becf3d2faa authored over 2 years ago
ROB: Handle outlines without valid destination (#1076)

Adjust `PdfReader._build_outline(...)` and `PdfReader._build_destination(...)` to handle outline...

89c0ff2e95f76960ffa7958e956270d41d3fea79 authored over 2 years ago
TST: Decrypt file which is not encrypted (#1149)

f233c1ad5adebe405e1184afa73442c92491aa8f authored over 2 years ago
TST: Test CryptRC4 encryption class; test image extraction filters (#1147)

aaac604bce06519906f834b4e5b0d5edd6ae4924 authored over 2 years ago
ENH: Add writer.add_annotation, page.annotations, and generic.AnnotationBuilder (#1120)

* Add `page.annotations` (getter and setter)
* Add `writer.add_annotation(page_number, annotati...

1a65a4663cdd05e09d005a425ba674b0238fe0a0 authored over 2 years ago
DOC: Recognize KourFrost as a contributor

91357f047b697345cba9eb736b7e22862d9bcdaa authored over 2 years ago
REL: 2.7.0

New Features (ENH):
- Add `outline_count` property (#1129)

Bug Fixes (BUG):
- Make reader.get...

6899c7448ee6d3546b4e3afa60754bd595556ead authored over 2 years ago
STY: Variable naming / opening PDF with PdfReader (#1144)

0f520528b881688e7324ee5aab3c379dac678e1f authored over 2 years ago
BUG: Make reader.get_fields also return dropdowns with options (#1114)

Added /Opt to the checked field_attributes within reader.get_fields

Closes #391

7cba98a57789c4058898b47875d2dda0a48d6bb5 authored over 2 years ago
BUG: Add deprecated EncodedStreamObject functions back until PyPDF2==3.0.0 (#1139)

Accidentally, PyPDF2 did not follow the deprecation process:
https://pypdf2.readthedocs.io/en/l...

e1f9772693b788deae6b0fcdcb5ff49577706549 authored over 2 years ago
DEV: Add .git-blame-ignore-revs (#1141)

See https://docs.github.com/en/repositories/working-with-files/using-files/viewing-a-file#ignore...

fa96d66f6a82321ed13f2410754309f4c4c1db1c authored over 2 years ago
DOC: Recognize Lightup1 as a contributor

c667ae4355507cccde493f000238bb1724159f15 authored over 2 years ago
ROB: Cope with missing /W entry (#1136)

Closes #1134

fd00f205f0ba34290cc61e2594e55c21f4c99c23 authored over 2 years ago
ROB: Cope with invalid parent xref (#1133)

Rebuild the xref table if the parent chained xref is invalid

Closes #1089

2abae354f4ce8e1cf44f90eba8a89da5f275dd03 authored over 2 years ago
STY: Fixing typos (#1137)

There were typos in:
- docs/meta/project-governance.md
- tests/test_reader.py
- tests/test_wr...

d41201b9f76fd93484f259e359877d9b87e1d201 authored over 2 years ago
DOC: Contributors file (#1132)

We value the work of our contributors - of all of them. The CONTRIBUTORS file might give them mo...

c63a0ff24965bdbe9339ca5d837b5460f93c3c13 authored over 2 years ago
DOC: Fix type in signature of PdfWriter.add_uri (#1131)

f2983e142f504d1fde7874af78975431e287043b authored over 2 years ago
STY: Re-use code via get_outlines_property in tests (#1130)

df95aae5215c7dcf7bfb14504b153427bbf8f44a authored over 2 years ago
ENH: Add `outline_count` property (#1129)

Enables retrieval of "/Count" attribute of outline item in PdfReader.outlines by implementing pr...

25cba33f88c6708ebc50169808f02b80e96fb0ab authored over 2 years ago
REL: 2.6.0

New Features (ENH):
- Add color and font_format to PdfReader.outlines[i] (#1104)
- Extract Tex...

33634d40ffce9351f96fb35f491c2b3fe98b2406 authored over 2 years ago
TST: Remove xfail from test_outline_title_issue_1121

1800514a7e066c3a042b7d5ed93960b34c7fac2f authored over 2 years ago
BUG: Use `build_destination` for named destination outlines (#1128)

Closes #1121

7fba86b65e25809367ff169e779dbccb517e1b25 authored over 2 years ago
MAINT: Text extraction improvements (#1126)

Credits to pubpub-zz, see
https://github.com/py-pdf/PyPDF2/pull/1118#issuecomment-1186148575

...

e24b0a046635995c08c91ccf9d6900560d7fb390 authored over 2 years ago
TST: Add test for arab text (#1127)

0b693e1122d568f29f266340121915b3813eb8c2 authored over 2 years ago
BUG: Avoid a crash when a ToUnicode CMap has an empty dstString in beginbfchar (#1118)

This is not a principled fix, but it is a hack to avoid a crash when
encountering an empty dstS...

ae0ff49058e6c57a8edcfcd3d956665ddaa8a787 authored over 2 years ago
STY: Apply black and isort

baeb7d23278de0f8d00ca9f2b656bf0674f08937 authored over 2 years ago
TST: Add xfail for decryption fail (#1125)

See #1088

cd87bbb4083347dc64aafa2571f5ebbe61f445f0 authored over 2 years ago
DOC: Explanation for git submodule

8a010a5c899be2361ecd7dba29d2438425819ed4 authored over 2 years ago
TST: Add xfail test for IndexError when extracting text (#1124)

See #1091

b1d4ea1fb4364336f84f1f3add19163aab2084a6 authored over 2 years ago
TST: Add MCVE showing outline title issue (#1123)

See #1121

5ddf4cb32505cb034496ac4be13747a61fb6ce46 authored over 2 years ago
MAINT: Destination.color returns ArrayObject instead of tuple as fallback (#1119)

ed5ecd9d55cd669045fe47eadef4d049c7959b7d authored over 2 years ago
BUG: Prevent deduplication of PageObject (#1105)

Make sure that PageObject is not deduplicated if it is not exactly same page object.
Adobe Read...

dd2d69a8d89a1370753f1418b3e0df9a7908d928 authored over 2 years ago
BUG: None-check in DictionaryObject.read_from_stream (#1113)

Guard pdf.strict with check if pdf is None in DictionaryObject.read_from_stream

Closes #1107

9bbe827ab534cdbbb6e2687e0a41dda4b269d387 authored over 2 years ago
BUG: Avoid IndexError in _cmap.parse_to_unicode (#1110)

The code within the if block assumes that `lst` has index 0 and index 1.

Fixes #1091
Related...

bb2d1dbf20dbe6a77d60be46cbd8646fde6b418c authored over 2 years ago
ENH: Add color and font_format to PdfReader.outlines[i] (#1104)

5e1cc57677c2ee9b80d9ed27a41321d3cae2d7c3 authored over 2 years ago
ENH: Extract Text Enhancement (whitespaces) (#1084)

* ENH : extract width from CIDFontType0/2
* ENH : improve cr/lf and space extraction
* BUG : ...

682eff93a1250403ed08c058e65d8576772ca858 authored over 2 years ago
STY: Use IntFlag for permissions_flag / update_page_form_field_values (#1094)

af5a0c3394a33eae154c63f5ccaa403ca54dbf1b authored over 2 years ago
STY: Simplify code (#1101)

d376d0e71939decbe21de8e93d016f09b3ce2210 authored over 2 years ago
MAINT: Use add_bookmark_destination in add_bookmark (#1100)

Reduce code duplication

See #1098

c420beb32c89f17822fb0e25d23db3f25ebd9af9 authored over 2 years ago
MAINT: Use add_bookmark_destination in add_bookmark_dict (#1099)

Re-use code

See #1098

d7b64dc817a948b275f221d326746ee07130e2de authored over 2 years ago
DOC: Watermark and stamp (#1095)

See #307

b2279eeb40fa0557e6e61e3bada0de0373e6da28 authored over 2 years ago
REL: 2.5.0

New Features (ENH):
- Add PageObject._get_fonts (#1083)
- Add support for indexed color spaces...

8f47939c5d056970153ffcf428412c52727645f1 authored over 2 years ago
STY: Apply black

46ba4aeae2009ef94f2bf89047af01d28fc0807e authored over 2 years ago
MAINT: Set page.rotate(angle: int) (#1092)

Closes #1021

Co-authored-by: probel_hero <[email protected]>

c5c0b5547cc607e6c1a340873dfbc987ed0e752b authored over 2 years ago
ENH: Add PageObject._get_fonts (#1083)

Add possibility to get names of fonts

See #153

Co-authored-by: tiarno <[email protected]>

e51141d7ed735703bb07f5ffa7e5d2f4d9a79347 authored over 2 years ago
PI: Use iterative DFS in PdfWriter._sweep_indirect_references (#1072)

* Recursive Depth-first search (DFS) was changed to iterative DFS
* Removed PdfWriter.external_...

1e4c2c9b4649449241b0ae166e7e90f6bc61596d authored over 2 years ago
DOC: Stamps and watermarks (#1082)

Closes #307
Closes #410

b42e0dbdf400a0c96391c2b391708ac8eb83a311 authored over 2 years ago
DOC: OCR vs PDF text extraction (#1081)

Closes #1073

9794ef65b7ed84916c8279909026c5d970687275 authored over 2 years ago
BUG: Let Page.scale also scale the crop-/trim-/bleed-/artbox (#1066)

Closes #272

Co-authored-by: Brian Painter <[email protected]>

8edaca8f1d747ccdb2abe770da55270fd31c6295 authored over 2 years ago
ROB: Guard against None-value in _get_outlines (#1060)

See #1059

439c7499e82b268a469d8971b27ccc4ff3dbb7ef authored over 2 years ago
TST: Image extraction (#1080)

3a8199cdf9e51407e6583aa36ad7ec417d04f87d authored over 2 years ago
ENH: Add support for indexed color spaces / BitsPerComponent for decoding PNGs (#1067)

Closes #535
Closes #536

Co-authored-by: Christopher Egner <[email protected]>

f776f54c673674c31e81b8c4dba88f60e51c1006 authored over 2 years ago
BUG: Column default for CCITTFaxDecode (#1079)

f4f74c7c9415dc96333cf6fd712868edf7fcff33 authored over 2 years ago
MAINT: Issue #416 was fixed by #1015 (#1078)

9b048a266894d0ab0a549915c519220007b1d1e3 authored over 2 years ago
TST: Image extraction (#1077)

67d962d54e34453a0fb33df5a1939e68cf850130 authored over 2 years ago
STY: Typo in Changelog

67d6e098ef2d802d3fd9d265992d1dc80f582d34 authored over 2 years ago
DOC: Python Version support

d63ab2c3f2aff08936afd7702028adbf6257d384 authored over 2 years ago
DOC: Formatting of CHANGELOG

810691a13d43264c2b56f1ecac7b49bae613a8bd authored over 2 years ago
DEV: Cache downloaded files (#1070)

Also: Fix pycryptodome uninstall in CI

ff5fd88fec3a350d513f85f1ddf50c9b80e00d29 authored over 2 years ago
DEV: Speed-up for CI (#1069)

- pytest: Set pytest testpaths and norecursedirs
- dependencies: Cache pip packages (pycryptodome)

a1aaf91e6d8cde09e5efbbd17e82937be180001f authored over 2 years ago
REL: 2.4.2

New Features (ENH):
- Add PdfReader.xfa attribute (#1026)

Bug Fixes (BUG):
- Wrong page inser...

a34569089e6699b2461d71143dbb2a0a0ff1796b authored over 2 years ago
BUG: Wrong page inserted when PdfMerger.merge is done (#1063)

Caused-by: #207

Why it wasn't detected by the tests: We don't have any tests that check
for ...

621a51f9552731928e73689ae17161214f9cddcf authored over 2 years ago
STY: DOC of compress_content_streams (#1061)

1e9c4ddec01f6d5b658a687437d0caf0b09db768 authored over 2 years ago
MAINT: Remove PyPDF2._utils.bytes_type (#1053)

ffacabc7a01a975d03a444fcfa03154c5b131164 authored over 2 years ago
BUG: Resolve IndirectObject when it refers to a free entry (#1054)

From the PDF 1.7 docs https://opensource.adobe.com/dc-acrobat-sdk-docs/standards/pdfstandards/pd...

02c601c86819578d9796479a1b8953accefea92b authored over 2 years ago
TST: Scale page (indirect rect object) (#1057)

70605ae240cf1a4bb6578fcf75e95d0682b5fb92 authored over 2 years ago
TST: Simplify pathlib PdfReader test (#1056)

The Path constructor allows a variable amount of arguments to it which it joins together similar...

4a62a47a984a78449347f52512c017ca0f306f09 authored over 2 years ago
DEV: Added {posargs} to tox.ini (#1055)

This allows adding options from command line.

For example, run only one test in py39 env usi...

32ab2a3423739b3327a2781782b0134b1c0aaa9f authored over 2 years ago
TST: IndexError of VirtualList (#1052)

04d576ceece98eb5dddddc3eeb0dad9fb6479f8a authored over 2 years ago
TST: Invalid XML in xmp information (#1051)

See #1030

4097db98a7abd21308b40445c22e8e97bc3b5cda authored over 2 years ago
TST: No pycryptodome (#1050)

Co-authored-by: Matthew Peveler <[email protected]>

5d213ea31e9ee3b1b30d63d40411f38056ec12f6 authored over 2 years ago
STY: Minimize diff for #879 (#1049)

3531603c52f84fdc5511bb7535be44c9bb9972e8 authored over 2 years ago
ENH: Add PdfReader.xfa attribute (#1026)

Closes #408

Co-authored-by: George Alverson <[email protected]>

0e18938fbb7ee56a57b20352e06df719683d7b42 authored over 2 years ago