Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/py-pdf/pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
https://github.com/py-pdf/pypdf

ROB: Allow more whitespaces within linearized file (#1701)

Closes #1689

dd83660510303cad9ce659bb3e50b7a7860dc992 authored almost 2 years ago by pubpub-zz <[email protected]>
BUG: Pass logging parameters correctly in set_need_appearances_writer (#1697)

Without this, the logging module logs another error:

TypeError: not all arguments convert...

7f1a3f81db9a4e709be59e8a81cce39a98a0c943 authored almost 2 years ago by Roland Szabo <[email protected]>
BUG: Write /Root/AcroForm in set_need_appearances_writer (#1639)

PdfWriter.set_need_appearances_writer() (whether called directly or indirectly by PdfWriter.upda...

8b0f0911e4ca454728b672cdf7239d9a56923abd authored almost 2 years ago by Chris Braun <[email protected]>
REL: Version 3.5.1, 2023-03-05

Robustness (ROB)
- Some attributes not copied in DictionaryObject._clone (#1635)
- Allow mergi...

081751c468dc2d59854621bcf0b2c189cc3a7700 authored almost 2 years ago by Martin Thoma <[email protected]>
ROB: Some attributes not copied in DictionaryObject._clone (#1635)

* "/N" attributes wrongly ignored during copy process
* The object referenced via `src[field]` ...

39f52dceaef6b980d1e0e36214e373a97fb1fd15 authored almost 2 years ago by pubpub-zz <[email protected]>
DOC: Fix misuse of the word 'operand' in the docs (#1684)

0f939832da4dd4d1e588679eea3895c0c49774a2 authored almost 2 years ago by Joshua Gerstein <[email protected]>
STY: Fix spelling / names (#1682)

f20b3631f757057930cf8de39cd0ee21d5bf601f authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Enable Ruff isort rules (#1681)

ce1afcdbc22180b96a950c17d06d1d5c4d23b086 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Upgrade ruff, remove isort, naming conventions (#1680)

2882abfee2ac0d800025d1ad7237b3d7604ad042 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Enable flake8-simplify rules (#1679)

47bf835c3a3d27cb2db5f55b61cbb1e4e8b7aab6 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Ruff rules (#1678)

ff2f2f337ea88313562f4bc27796849056e4edb7 authored almost 2 years ago by Martin Thoma <[email protected]>
TST: Replace pytest.mark.external by enable_socket (#1657)

pypdf has three types of tests: (1) Tests that work only with the repository itself, (2) tests t...

8588aa161b6b535ecf9a2c934cb1d8d6f1a9b24b authored almost 2 years ago by Martin Thoma <[email protected]>
ROB : allow merging multiple time pages with annots (#1624)

Closes #1623

ae42106aa4433e38f5ab7fceb277a41cf16e66d2 authored almost 2 years ago by pubpub-zz <[email protected]>
STY: Use pathlib (#1668)

See https://stackoverflow.com/a/60086137/562769

4fa4cedba42f70d6c0a3aba04cffe5e10f9dc7d5 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Enable D100

b8258c3cbc9732f79e73cc9dc089be6097678e9b authored almost 2 years ago by Martin Thoma <[email protected]>
REL: Version 3.5.0, 2023-02-26

New Features (ENH)
- Add reader.attachments public interface (#1611, #1661)
- Add PdfWriter.re...

5e792c2519f101045e512ec047ebfcaf5e87ee28 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Minor code cleanups (#1667)

b0d4e859406b9d5a392bf2511ee99b9a710dc2eb authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Cleanup of the documentation (#1666)

638903c50d7761dbdbc6a5c5283ca280d5cdd6e4 authored almost 2 years ago by Martin Thoma <[email protected]>
ENH: Add reader.attachments public interface (#1661)

Add `PdfReader.attachments -> Mapping[str, List[bytes]] as a public interface.

The heavy-lift...

d3434454dc9ca7ba1a41457eac3446b37c81c9c2 authored almost 2 years ago by Martin Thoma <[email protected]>
ENH: Add PdfWriter.remove_objects_from_page(page: PageObject, to_delete: ObjectDeletionFlag) (#1648)

This fixes remove_text to set contents as indirect_objects in accordance with the PDF specificat...

67b085ba35d2fb9854563b5f69ad203ecb4e95a4 authored almost 2 years ago by pubpub-zz <[email protected]>
ENH: Allow free-text annotation to have transparent border/background (#1664)

44bc1fc10407c53c8e9b0c34b2aa6853f6c52a03 authored almost 2 years ago by Martin Thoma <[email protected]>
ROB: Cope with invalid objects in IndirectObject.clone (#1637)

Cope with invalid PDF files where an object can not be found

Fixes #1631

b14dabb63c4e6aa75d52e7582bb6f840e35a3a1b authored almost 2 years ago by pubpub-zz <[email protected]>
DOC: Recognize Louis Paternault as a contributor

0a83028e9c6684279fbcb17542f92f425bd830db authored almost 2 years ago by Martin Thoma <[email protected]>
BUG: Allow decryption with empty password for AlgV5 (#1663)

The error message seen before was:

ValueError: Incorrect IV length (it must be 16 bytes l...

14b61ea75fb83b8ad6f2c3256cea100498885c54 authored almost 2 years ago by Christopher <[email protected]>
BUG: Let PdfWriter.pages return PageObject after calling `clone_document_from_reader()` (#1613)

Currently, the PdfWriter.pages attribute returns a `DictionartyObject`. With this fix, the attri...

a2a1275587df39883e576f2d8331b88fdcdaa492 authored almost 2 years ago by pubpub-zz <[email protected]>
DOC: PdfReader private functions (#1662)

5412dd331a7d6469c855a2aaf1005bd37591d07f authored almost 2 years ago by Martin Thoma <[email protected]>
ROB: Improve tolerance to invalid Names/Dests (#1658)

Fixes #1652

5b4bb2767ef56592ca1cdb12f28a02197acdc599 authored almost 2 years ago by pubpub-zz <[email protected]>
BUG : Invalid font pointed during merge_resources (#1641)

Fixed #1640

51732385a115e447791d4b1a5534fc678d01fb92 authored almost 2 years ago by pubpub-zz <[email protected]>
ROB: Decode encoded values in get_fields (#1636)

Fixes #424

2e4b6574832b7478d29d37842818da09a2cb71da authored almost 2 years ago by pubpub-zz <[email protected]>
ENH: Complete attachments functions (get/add) (#1611)

Fixes #1047
Fixes #527
Fixes #169

215df56b16e09fae467421ebe76051fe20300a3b authored almost 2 years ago by pubpub-zz <[email protected]>
DOC: merge_transformed_page() (#1647)

Several issues could have been avoided if the example in this PR existed before, e.g. #1630, #14...

cc32b59aaabdbfdfb10bdc6c8fb67827113962a6 authored almost 2 years ago by Louis <[email protected]>
ROB: Let PdfWriter.merge cope with missing "/Fields" (#1628)

Fixes #1627

f98ff02f5346b4372ba0a2994c4af0e0ec879594 authored almost 2 years ago by pubpub-zz <[email protected]>
TST: Add tests for page labels (#1642)

2204fd6d93c56fed5259e42a3420b7db84e334dc authored almost 2 years ago by pubpub-zz <[email protected]>
TST: Mark new external tests appropriately (#1632)

0e0b94311e3340f11dca7a5121199eb90911e8c2 authored almost 2 years ago by dkg <[email protected]>
DOC: paramter -> parameter (#1634)

ad7d176f39871584c7b0b635269e7d087c5b9be7 authored almost 2 years ago by engpetermwangi <[email protected]>
REL: 3.4.1

Bug Fixes (BUG):
- Switch from trimbox to cropbox when merging pages (#1622)
- Text extraction...

4e276b28a37561a58d42027f28d1bcf475b03d0f authored almost 2 years ago by Martin Thoma <[email protected]>
TST : Add multiple retry on get_url (#1626)

This reduces CI fails due to network issues

4bf3e32ef61773670b78189391596b32bf324847 authored almost 2 years ago by pubpub-zz <[email protected]>
ROB: Fix 2 cases of "object has no attribute 'indirect_reference'" (#1616)

Fixes #1614

7a807d226aea90e60f22281a8e1d7317ba87d1d7 authored almost 2 years ago by pubpub-zz <[email protected]>
BUG: Switch from trimbox to cropbox when merging pages (#1622)

While the old behavior can be considered a bug, people might rely on trimbox being used.
To all...

c70dcd05a6d1cbfbe2298149ba1892aed3f1f508 authored almost 2 years ago by Martin Thoma <[email protected]>
BUG: Text extraction not working with one glyph to char sequence (#1620)

Fixes #1619

f5ac79bb552f2e55579854565a31a67fdfdee275 authored almost 2 years ago by pubpub-zz <[email protected]>
DOC: Complete unfinished sentence (#1617)

cfcba1a62947c398a0a55a10255dcfea434d8857 authored almost 2 years ago by Daan Koning (he/him) <[email protected]>
DOC: Add contributor Mitchell Kotler

e4853e47fa667b88f7424aeac70ba265b63fb91f authored almost 2 years ago by Martin Thoma <[email protected]>
REL: 3.4.0

NOTICE: pypdf changed the way it represents numbers parsed from PDF files.
pypdf<3.4.0 represe...

1b54821e5e02ff044fd1c535262e9522b1f70c2b authored almost 2 years ago by Martin Thoma <[email protected]>
DEV: Use ruff in CI (#1609)

d9d78da3fea0e174bdec8d92c7dfc59aafefe072 authored almost 2 years ago by Martin Thoma <[email protected]>
BUG: Fix getter of the PageObject.rotation property with an indirect object (#1602)

The setter worked. Now the getter for the `PageObject.rotation` works as well.

Fixes #337

d2c3e1bf43b67848789c3ec75e758be3ba71d38b authored almost 2 years ago by mitchelljkotler <[email protected]>
STY: Type annotations and variable names (#1539)

0134233c15dc8866819c2b76799ebedcb93e94a8 authored almost 2 years ago by Martin Thoma <[email protected]>
MAINT: Remove decimal (#1608)

This is a preparation to completely get rid of decimal at some point. For now, I don't want to b...

430f92f0c49bb998d0deca300458948d0820ec4e authored almost 2 years ago by Martin Thoma <[email protected]>
DEV: Introduce ruff (#1586)

Move code style to extra execution path. This way we can see more easily what the style issues a...

90476ef7af1442b5a3d37cbc7c5a5244014944c6 authored almost 2 years ago by Martin Thoma <[email protected]>
MAINT: Bump sample-files

9bef0d3094cb79aebb17c607f6b41b051e17eb68 authored almost 2 years ago by Martin Thoma <[email protected]>
BUG: Restore merge_transformed_page & co (#1567)

Fixes #1426
Fixes #1601

2d60c715379877404409c4fc2bbe60ca09c03332 authored almost 2 years ago by pubpub-zz <[email protected]>
BUG: Replace decimal by float (#1563)

Decimal was replaced by float in order to fix bugs.

It might also improve speed in some cases...

6ec88ad27f82ebc7ddffcd054cd1408226b4256e authored almost 2 years ago by pubpub-zz <[email protected]>
STY: Remove commented-out code (ruff:ERA001) (#1596)

3956eb3c2f119c522ca9f28e7e10155086191f87 authored almost 2 years ago by Martin Thoma <[email protected]>
ROB: PdfWriter.remove_images - /Contents might not be in page_ref (#1598)

b8d4b8fb89cb9fbf4321129969bdf0797a0e189b authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: PDFX namespace (#1592)

Closes #1587

645bd73de65ec62045bd43473e14e030e0dd433a authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Fix typo on extract text page (#1591)

56d9a1a3d451971543bfba5cadeb41cfdb9e88d7 authored almost 2 years ago by Matthew Peveler <[email protected]>
DOC: Missing semantic layer (#1590)

Closes #1588

b148b9453e4af0d70ba9ca7422ece7a574150617 authored almost 2 years ago by Martin Thoma <[email protected]>
ENH: Enable merging forms with overlapping names (#1553)

Add functions to add a top level grouping form field.
Functions to rename top level field also ...

5d1654059ab493d7d197d31a6b69cdb261be72b1 authored almost 2 years ago by pubpub-zz <[email protected]>
STY: Apply docformatter (#1584)

See https://pypi.org/project/docformatter/

I tried pre-commit, but docformatter collides with...

8e819d1d00c758d4ccb8c8cd09c79269d0f3a3b9 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Variable names in documentation

beb70b49cbb9a4caa1bfedd57e4b948702cd6083 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Docstring formatting of darglint (#1582)

f22ecd4c03f4cc9a9d225184e6c4b7ada48b854f authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Fix spelling of "requires" (#1580)

be353c8b5ad2b9f029297a4694b6fa2dd19fc8a3 authored almost 2 years ago by dkg <[email protected]>
TST: Unlink test output once successfully written (#1579)

Avoid having two tests write to the same filename "stream.pdf".

When a writing test has compl...

01f4b6ff398c786aba52573a3b3379272e1419c8 authored almost 2 years ago by dkg <[email protected]>
DOC: Added Huon Wilson as Contributor

59a265ecbd08cdd65f4f330cfdc19c95052b9397 authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Add Karen McNeil as a contributor

4cec498dc91e4b103caa3e96e489e294f75b94fd authored almost 2 years ago by Martin Thoma <[email protected]>
TST: mark external test (#1575)

755b775eea53a9c922295b1f34cb9008cc9c41f6 authored almost 2 years ago by dkg <[email protected]>
DOC: Improve documentation by fixing darglint issues (#1574)

bee73dc1d3afb683dd5f1f892cf201ed6c23e5ce authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Update sample-files

422aca48fd627d94617d7ca39271aad43300af4a authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Replace PyPDF2 by pypdf (#1573)

Fixes an issue of #1550

9cf7b0c2194a7fb66813b889297d4e3fd78241dc authored almost 2 years ago by Martin Thoma <[email protected]>
REL: 3.3.0

New Features (ENH):
- Add page label support to PdfWriter (#1558)
- Accept inline images with ...

98c5a9f83f138483cb5ce808bd509cf076b6e83f authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Apply ruff suggestions (#1570)

98511ac3bcac7574e92e3cd78ae0b1eb6f16d89f authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Add attrs because myst-parser didn't

962ce556592eb95a92e610305b15263e4de0d5e4 authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Set sphinx<5 to allow myst-parser==0.16.1

Fixes #1569

6abb50e247ab4eeab0e3969a9fcb827d12cf4a94 authored almost 2 years ago by Martin Thoma <[email protected]>
Sort computed /ProcSet in a merged page, for reproducibility (#1542)

Sorting the `/ProcSet` array when merging two pages makes the ProcSet deterministic across multi...

d1233dfa5089100c3b4916ee3478300f01c6bcd1 authored almost 2 years ago by Huon Wilson <[email protected]>
ENH: Rename resources deterministically in merge_page (#1543)

This fixes #1532 by adjusting the procedure used for renaming resources in `merge_page`, so that...

b6b6a66651c982c1e5cc0df1c826e7aba55e2afe authored almost 2 years ago by Huon Wilson <[email protected]>
STY: Minor refactoring to simplify #1543 (#1568)

3b1b9d4c7a22360dbf3ee8e614a7e93674502ae2 authored almost 2 years ago by Martin Thoma <[email protected]>
BUG: Fix error in cmap extraction (#1544)

Fixes #1533 and late #1091

c1f8742bf4e8f0341b244088dfe4f9ecd40f2fc6 authored almost 2 years ago by pubpub-zz <[email protected]>
BUG: Remove erroneous assertion check (#1564)

Closes #1559

This is due to a bad interpretation of text at bottom of page 108 of PDF 1.7 ref...

a6aad314d4937c01972f16092d391c3615047f34 authored almost 2 years ago by pubpub-zz <[email protected]>
ROB: Set ignore_eof=True for read_until_regex (#1521)

This was initially motivated by `NumberObject.read_from_stream`, which
was calling `read_until_...

53645efd8cb5a40723c1131318ccb75c9c8726d6 authored almost 2 years ago by Ronuk Raval <[email protected]>
ENH: Add page label support to PdfWriter (#1558)

Add PageLabelStyle constants

e7118462cd23226490a1907a0b22f824dd585907 authored almost 2 years ago by lorenzomanini <[email protected]>
Paper size (#1550)

e6500f8180abed30d813e314c7d21f732183ad3c authored almost 2 years ago by Sandro Panchame <[email protected]>
ENH: Add circle annotation support (#1556)

See #107

3560550eeef7a6e55a05c8d656d88b1dc05a1a2e authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Add Jonah Majumder as a contributor

d942a49074de9fb89ea374cd7f36f6d74a4a3451 authored almost 2 years ago by Martin Thoma <[email protected]>
BUG: Fix dictionary access of optional page label keys (#1562)

Fixes #1560

Co-authored-by: jonahmajumder <[email protected]>

c293b954e6f589ecfd6671df80144b83259a7fe4 authored almost 2 years ago by Martin Thoma <[email protected]>
ENH: Add Polygon annotation (#1557)

See #107

f494e61e96bef4952ebabd1817ad6a4d7cf6eef8 authored almost 2 years ago by Martin Thoma <[email protected]>
ENH: Accept inline images with space before EI (#1552)

Closes #1541

df90053c3e673dfafafba8557e1d379883b62dc0 authored almost 2 years ago by pubpub-zz <[email protected]>
STY: Simplify test (#1555)

f0c0a1d0cb0d8a72b988c63ba1d3f390e6dbe575 authored almost 2 years ago by Martin Thoma <[email protected]>
Annotate tests appropriately (#1551)

By annotating these tests, we can use pytest markers to skip external
tests and tests that depe...

767047b98ee3ea7aca331cfbd63502a284bfed93 authored almost 2 years ago by dkg <[email protected]>
DEV: Fix broken combination of dependencies of docs.txt

f1cd7ae85df9ff90361255ca3fa869eb3a457ebf authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Whitespace issue

32b5034c6c652e3cf56cca784cbaa6ea2cf1685a authored almost 2 years ago by Martin Thoma <[email protected]>
REL: 3.2.1

New Features (ENH):
- Accept hierarchical fields (#1529)

Documentation (DOC):
- Use google st...

e7e4ffc7e74fd3f1a191bc63527bee0d7986be1f authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Adjust variable names (#1540)

0ebc70314f3a708141232cafce9512336e5307bf authored almost 2 years ago by Martin Thoma <[email protected]>
ENH: Accept hierarchical fields (#1529)

Indexed names are implemented with `.` not `_` (possible mix up with names).
An optional parame...

683c130d372aafd1cfa032b3def29d44d999a88e authored almost 2 years ago by pubpub-zz <[email protected]>
DOC: Use google style docstrings (#1534)

The Google Docstring style is way more readable than ReST style docstrings. For this reason, thi...

5c0b11d6f6dfe8f7744565c59cf0ffef9f706608 authored almost 2 years ago by Martin Thoma <[email protected]>
DOC: Fix linked markdown documents (#1537)

Internal markdown links in docs do not work due to a bug in myst-parser.
https://github.com/exe...

0ba45eaf8b36c9295e1af6eb0b6e4162eca1e13c authored almost 2 years ago by Aziz Berkay Yesilyurt <[email protected]>
DOC: Add maxbeer99 as a contributor

377e18b3a4948f612d52671349fc5cea5a45cdf8 authored almost 2 years ago by Martin Thoma <[email protected]>
DEV: Install pypdf for docs

00f1d3433a73ad3c0ab1f4706492b138e7f58bcb authored almost 2 years ago by Martin Thoma <[email protected]>
DEV: Update docs config (#1535)

Credit goes to Adel Haddad
See https://github.com/py-pdf/pypdf/pull/1430

Co-authored-by: ade...

438f38e5b43c8a4291b02adb88836604921fade3 authored almost 2 years ago by Martin Thoma <[email protected]>
STY: Apply pre-commit

ea598dda99c6823d637e28f2f558ee5877499146 authored almost 2 years ago by Martin Thoma <[email protected]>
MAINT: Move URLs from mstamy2 to py-pdf

a5fae372ca8f961c59316453f1b7fc537ba3598b authored about 2 years ago by Martin Thoma <[email protected]>
REL: 3.2.0

Performance Improvement (PI)
- Help the specializing adpative interpreter (#1522)

New Features...

c2c4be6be5fde84d9898f133c43ebeaef7665c56 authored about 2 years ago by Martin Thoma <[email protected]>