Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/py-pdf/pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
https://github.com/py-pdf/pypdf

PERF: Help the specializing adpative interpreter (#1522)

This PR aims at making minor changes that do not influence readability in a negative way and hel...

6407e1dd12aa797c55465768d9e32889d01c2cb1 authored about 2 years ago
MAINT: Minor Improvements (#1523)

MAINT: Use a generator+join instead of iterative concatenation _security.py
MAINT: Shortened if...

82f9c1edb4e6a307508018be93d05e8d86678ed2 authored about 2 years ago
BUG : upgrade clone_document_root (#1520)

DOC : Add clarifications/details in doc strings of clone_reader_document_root and clone_document...

41aa58471c440d09832b1eebfee438b419ff81d3 authored about 2 years ago
ENH: Add support for page labels (#1519)

Introduce a new PdfReader property `page_labels` that returns a list of strings.

In most case...

8c9505cca821a0d6c74590253a4407d45810331c authored about 2 years ago
TST: Adding outline items after clone_document_from_reader

See #1518

cfed01f502afd1af7ed331b32ee84ddd531f9e8a authored about 2 years ago
DOC: Comparison with pdfminer

c700617a02dfd7941817db70823f22c9ba527aaf authored about 2 years ago
DOC: Fix migration guide link (#1516)

The link to migration guide in README pointed to a diff instead of the actual guide.

f6a7b35ccecdc70011cfa1b5eda74d718b38bb3d authored about 2 years ago
DOC: Mention migration guide more often

318ace3233b25050a25b76dc45a686185913203b authored about 2 years ago
REL: 3.1.0

Move PyPDF2 to pypdf (#1513). This now it's all lowercase, no number in the
name. For installati...

7d07401f456267480eb8deac85e3fde2f6927366 authored about 2 years ago
DOC: Update logo

0844c938651c8f20154a204146fa584b4114ffbb authored about 2 years ago
MAINT: Move PyPDF2 to pypdf (#1513)

All lowercase, no number.

See #1488

3825c7b8e7890ef507de4099f5ed9c1c3cf9ec8a authored about 2 years ago
REL: 3.0.0

BREAKING CHANGES:
- Deprecate features with PyPDF2==3.0.0 (#1489)
- Refactor Fit / Zoom parame...

3214dde1d2c0f7c9071ba4157d82cdb796dd73d4 authored about 2 years ago
DOC: How to read PDFs from S3 (#1509)

3fb9b690fe326c806a126b2af20c222045d4f987 authored about 2 years ago
DOC: Use google-style docstrings in _page.py (#1503)

e5e26ad20c160350e09566049a05cc33ba4c46f9 authored about 2 years ago
DOC: Make MyST parse all links as simple hyperlinks (#1506)

See https://myst-parser.readthedocs.io/en/latest/configuration.html

Closes #1492

9b23a39b987c9003ef48b5015faf957e2bd02849 authored about 2 years ago
MAINT: Deprecate features with PyPDF2==3.0.0 (#1489)

Deprecate features, but keep helpful exceptions. That means that the names still need to be ther...

a4629d3dbdef2759ff6059446fcffd13ecf6a396 authored about 2 years ago
DOC: Add Joseph Hale as a contributor

c95eabfab0a4848edb727a8390e343e04eacb325 authored about 2 years ago
MAINT: Use typing.IO for file streams (#1498)

In addition, the following changes were made:

* DEV: Add in-project virtual envs to .gitignor...

b8f787e318125cdd0b85f3ca07c412da932b046a authored about 2 years ago
DOC: Add Olson Perrensen

10ab0854c4cb7f2d4b490ff4178fd170f5d0a9e5 authored about 2 years ago
DOC: Changed 'latest' for 'stable' generated docs (#1495)

Closes #1494

690247a430bfde85f11352387f3f34a005f13418 authored about 2 years ago
ENH: Add Cloning (#1371)

The method `.clone(pdf_dest,[force_duplicate])` clones the objects and all referenced objects.
...

74b8a6364ee806cc0510b024864502d1fdbd8204 authored about 2 years ago
Add type-ignores for cloning PR

9674f5ff31993e8cc2d684a89180a2d9a68f52d0 authored about 2 years ago
ENH: Allow int for indirect_reference in PdfWriter.get_object (#1490)

Co-authored-by: pubpub-zz <[email protected]>

2aba7188b307196592655a7f163449f6f5498fe0 authored about 2 years ago
DOC: Adjust deprecation procedure (#1487)

437aec9473adbc2b98740dd33644c6a5002657f6 authored about 2 years ago
MAINT: Refactor Fit / Zoom parameters (#1437)

Introduce a new `PyPDF2.generic.Fit` class which captures the type and parameter for how a page ...

76334777b5d54342b25eb3e030f02df9bf01b2ca authored about 2 years ago
REL: 2.12.1

Documentation (DOC)
- Deduplicate extract_text docstring (#1485)
- How to cite PyPDF2 (#1476)
...

ce0e190b702dba15fe1238f1491dbf9b6c5b080e authored about 2 years ago
DOC: Deduplicate extract_text docstring (#1485)

Closes #1455

2fe895648d361a74c1c338c95650a4aa7e752c80 authored about 2 years ago
MAINT: position ➔ page_number in Merger.merge (#1482)

Changed method parameter name for consistency as mentioned in #1187

- `position` ➔ `page_num...

40086e1fb98927320c1aa88456275d24e27f6b0b authored about 2 years ago
MAINT: indirect_ref ➔ indirect_reference (#1484)

Remove xfail from passing test
Don't consider deprecated code for coverage testing

a0abf1ec097ba8a8bb3a2d27a14f3493b7f33821 authored about 2 years ago
MAINT: Consistent method keywords (#1467)

* indirect_reference ➔ indirect_ref and ido ➔ indirect_ref in PdfReader._get_object_from_stream ...

25770098719338b2715ff1f97be18edf2467020c authored about 2 years ago
MAINT: Consistent parameter/variable names (#1483)

* owner_pwd ➔ owner_password
* user_pwd ➔ user_password

See #1187

e356a39f28331c172b7d2fc2c22a140d13da37ab authored about 2 years ago
DOC: How to cite PyPDF2 (#1476)

Co-authored-by: Matthew Peveler <[email protected]>

fe461a171e425e558c8afa3bfe1f279af5ec5689 authored about 2 years ago
REL: 2.12.0

New Features (ENH):
- Add support to extract gray scale images (#1460)
- Add 'threads' propert...

f64f36a8abfde3baeca2aff2562d0305d5cf6e14 authored about 2 years ago
BUG: Scale PDF annotations (#1479)

PDF annotations - for example hyperlinks and comments - were not properly scaled when using the ...

5fd96d01c1febe6d38ba9b90ba774e95ceaca784 authored about 2 years ago
MAINT: Package updates 🌱 (#1481)

* Update docs.txt to 3.7 as https://readthedocs.io/ seems to use 3.7
* Update dev.txt to 3.7 as...

b1938aaa9ec950bdc188c072f557e21ebd7ccc4b authored about 2 years ago
ROB: Padding issue with AES encryption (#1469)

Fixes #1221

Credit goes to Alper Ahmetoglu for the fix

Co-authored-by: Alper Ahmetoglu <ah...

f804f3af83b1856492ea94e537de1c5350585add authored about 2 years ago
STY: Docstrings and variable names (#1470)

9821468f7f6c425dc30bd961592ff4cf518f6fd7 authored about 2 years ago
Add support to extract gray scale images (#1460)

22214e83d24c346e4633f32ec6d77c62f799c79d authored about 2 years ago
ROB: Accept empty object as null objects (#1477)

Fixes #1474

7f586ae55157785ce63881add2149a1d62f2bc47 authored about 2 years ago
MAINT: Bump sample-files

635f0cae1818ca9a5a7c3c1c8f9bd026c49922d5 authored about 2 years ago
MAINT: Use 'page_number' instead of 'pagenum' (#1365)

This PR ensures PyPDF2 uses page_number instead of pagenum as a parameter name.

* It does not...

deb0667321bb1e60bea2d9b21700054f66e6b445 authored about 2 years ago
ENH: Add 'threads' property to PdfWriter (#1458)

This currently returns only an empty list . Pending to PR#1371

3e250c5b025bb82407ad10f3db9906f784b3f42e authored about 2 years ago
MAINT: Add List of pages to PageRangeSpec (#1456)

Applies to merge/append

1a9f7d9434f9c68254d00f3c91a21d5fc6d1e22b authored about 2 years ago
ENH: Make PdfReader.get_object accept integer arguments (#1459)

Also fix various type annotations

17897d986a6c678ff55ecdef5ed26e59f9d9bc39 authored about 2 years ago
TST: Cleanup temporary files (#1454)

940819ffa88223b586dc777f7d60b24a55878033 authored about 2 years ago
DOC: Add module documentation the PaperSize class (#1447)

b6bfc17c38a2a75b1eb89dbcb997ab30210fd599 authored about 2 years ago
ENH: Add PdfWriter.open_destination property (#1431)

fdd105812e201e9f466030711a20a669b9acf4af authored about 2 years ago
TST: Mark test_tounicode_is_identity as external (#1449)

89bf4c1c0750cd3b72c93953fa08dda2fa828de6 authored about 2 years ago
TST: Use Ubuntu 20.04 for running CI test suite (#1452)

Closes #1451

This sets the test suite to use ubuntu-20.04 as opposed to ubuntu-latest as the ...

420a545836630585f9e03b1df855c423e77c7731 authored about 2 years ago
REL: 2.11.2

fb34775edb80c191ac0cf5bd5906d18ec8f83947 authored about 2 years ago
DEV: Link 'Full Changelog' automatically

4b1cbde09f7b8ed02c9771c15370594ef864e02d authored about 2 years ago
MAINT: Update pre-commit-config

6f73cb198367598c32ca71a9e6fd7eded962256e authored about 2 years ago
DOC: Link PyPDF2 1.x Changelog

b14be94bd25266eaaaae5c93721efe7d89062694 authored about 2 years ago
DOC: Link 'Full Changelog' in CHANGELOG

Closes #1423

Also split PyPDF2 1.X changelog to separate page as it becomes really
long.

3d59febcba0c0b99b84cba8b408f140ce554a83c authored about 2 years ago
BUG: JavaScript executed twice (#1439)

Fixes #1425, #1420, #482

03f82d23af1e29ab90096b0442e5add965101a4b authored about 2 years ago
DOC: Typos in PDF format documentation (#1438)

906c338d4f5b031b6df7b9990c8af6ad6494184c authored about 2 years ago
DOC: Fix links

See #1436

Co-authored-by: pavlidvg <[email protected]>

1842c2128873ec64f8a467381458d3d03a0955ea authored about 2 years ago
ENH: Add remove_from_tree (#1432)

The remove_from_tree method can be used to remove an outline item

Fixes #1427

df933f2401292d19f6d612475b944724a9aad15e authored about 2 years ago
BUG: ToUnicode stores /Identity-H instead of stream (#1433)

fixes #1417

56395e9509f03ae07b5cf63a3b0ebc04372b2c02 authored about 2 years ago
DOC: Fix typos in installation.md (#1419)

0b2b3ec997b81d6bb22c5c834ef5ba9e4f2330e7 authored about 2 years ago
MAINT: Fix types in test annotations (#1411)

6a61089fce4f4ac8f0d1c5278b0d0c80e588230c authored about 2 years ago
DEV: Modify read_string_from_stream to a benchmark (#1415)

The test before was to brittle. We need to keep an open eye to the
benchmarks in future, but al...

abe19f761450756e6bc837a626f8b3afd71236e1 authored about 2 years ago
DEV: Improve error reporting of read_object (#1412)

Additionally, use time.process_time instead of time.time for test_readStringFromStream_performance

d82172374c1322ece2919d45263f7232f63d23fd authored about 2 years ago
DEV: Test Python 3.11 (#1404)

1d551af8d7bd73ccc6d6981a9b7a81026df73632 authored about 2 years ago
DEV: Extend Flake8 ignore list (#1410)

613b3706e072259f50299387945afcc0ada0cffb authored about 2 years ago
TST: Fix test_get_attachments (#1409)

752d496790a6454cb7656851e8a652b3585b2eea authored about 2 years ago
DEV: Use correct pytest markers (#1407)

It was tested like this:

## slow

$ pytest --durations=50 -m "not slow"

## external

D...

4b8d849e4d1042c139542cfbfd0c0e2ed430e831 authored about 2 years ago
MAINT: Update isort config (#1408)

b49034e8fdcd854e775e6c6e36699c0357956619 authored about 2 years ago
DEV: Move project configuration to pyproject.toml (#1382)

The pyproject.toml file is the new standard for project configuration in Python defined by PEP-6...

abe01995304470d7c110bf74f24e972073030426 authored about 2 years ago
DOC: Mention pdfalyzer

d64b578d135f7cd72b538fe4afb917be955080ef authored over 2 years ago
PKG: Declare Pillow as optional dependency (#1392)

Closes #1390

664ddf28e9c7037c5e95566a7f5538e85dad0df5 authored over 2 years ago
DOC: Github ➔ GitHub (#1391)

cc553210524a8009c91409b76f05912469cfa927 authored over 2 years ago
ENH: Add AnnotationBuilder.rectangle (#1388)

2e6ea6fa21cf654dadb48f90a542f874c716008f authored over 2 years ago
REL: 2.11.1

Bug Fixes (BUG):
- td matrix (#1373)
- Cope with cmap from #1322 (#1372)

Robustness (ROB):
- C...

d14f1deb54d5767407b5f51c51cc1f560008bb63 authored over 2 years ago
STY: Use relative imports (#1387)

d5111e1a056e17ea32f01429258ad12aa02d7450 authored over 2 years ago
BUG: td matrix (#1373)

50c1b527361ff9da4ea8ce6db77b67f2e2d837a4 authored over 2 years ago
DOC: Remove watermark PageObject declaration as it is already present inside for-loop (#1384)

7faa9b321c694fdadaba71f4fe1700fe81cf6ccd authored over 2 years ago
ROB: Cope with str returned from get_data in cmap (#1380)

Fixes #1379

92c894d247e803dd5df4535ec1180d51305d3508 authored over 2 years ago
TST: Adding test for #1370 (#1375)

9d870a2b01bcdc2816f3fde3bcb1e51d0a9da398 authored over 2 years ago
FIX : cope with cmap from #1322 (#1372)

Cope with cmap where the range contains first and last code are on variable length.

Also fixe...

f3b6d0e7d723aebc72c8de638ad852f5c98e9168 authored over 2 years ago
DOC: Black formatting and variable naming

d9aa64c2a41c379c4b1210e3689e4edb4f694a77 authored over 2 years ago
DOC: How to use visitor functions (#1369)

Add two samples using visitor-functions when calling extract_text.

The first example may be o...

a5f4f41e5675f78902f9881545e2afb9d8455112 authored over 2 years ago
STY: Variable naming, black, and isort (#1367)

Type annotations as well

d508c69c3dba15bc834d3d43df14415d46643ffa authored over 2 years ago
REL: 2.11.0

New Features (ENH):
- Addition of optional visitor-functions in extract_text() (#1252)
- Add m...

67a5ccfad2664c7854c3fac4fc6fac6500a52232 authored over 2 years ago
BUG: Lookup index in _xobj_to_image can be ByteStringObject (#1366)

DEV: Adjusted File classes __str__ and __repr__ to easy debugging

eca1a848633871548dd700a147060dfbfa0da7c2 authored over 2 years ago
DOC: Add Sascha Rogmann as a contributor

26bdc6b36e4bcd8bae11f73dfdba0926abd62ae3 authored over 2 years ago
MAINT: Update sample-files

cf76824e3324d1bf7ad38710be7118f1689d272c authored over 2 years ago
ENH: Addition of optional visitor-functions in extract_text() (#1252)

Optional visitor-callback were added to extract_text().

_extract_text() calls these visitor-m...

ebb3b83c3aa6e2305b32710c609e0e7534186934 authored over 2 years ago
TST: read_string_from_stream performance (#1355)

There is a saftey margin of a factor of 10 in both directions,
so the test should be fairly sta...

30a9e68beecb0d710339e85b991add63d9141c93 authored over 2 years ago
ENH: Add metadata.creation_date and modification_date (#1364)

Closes #1222

2845c6d94a3fb56a13393191fc8e050ad9d6bb37 authored over 2 years ago
ENH: Add PageObject.images attribute (#1330)

85b3e8785830bf35fc1c21f5c2edf29636281290 authored over 2 years ago
BUG: 'IndexError: index out of range' when using extract_text (#1361)

Fixes #1358

Co-authored-by: diavral <[email protected]>

dcab241e4ab386834d6864538b99b556b04a3f7c authored over 2 years ago
DOC: Add programmarchy to contributors

4ff5e0a3faf40b8eb1632c0708366591c1b41ea3 authored over 2 years ago
BUG: Errors in transfer_rotation_to_content() (#1356)

See https://github.com/py-pdf/PyPDF2/issues/1280#issuecomment-1251018614

7803a72c1e446b957b7919b8097f272b14ae9fdc authored over 2 years ago
ROB: Ensure update_page_form_field_values does not fail if no fields (#1346)

Fixes #1343

141a765621235a395ce04ef078ab2060d359d2fa authored over 2 years ago
REL: 2.10.9

New Features (ENH):
- Add rotation property and transfer_rotate_to_content (#1348)

Performance...

620d2fa03c8b87e2923d9f41e2bc0da8a4321c6a authored over 2 years ago
BUG: Format floats using their intrinsic decimal precision (#1267)

Since FloatObject is represented as a decimal, format numbers using their intrinsic precision, i...

5aeb92633769785e50470a3acecffe883112a99d authored over 2 years ago
ENH: Add rotation property and transfer_rotate_to_content (#1348)

See #1280 for the context of this change

71de6c8d2792c25e40d4d82dba1de4a788991196 authored over 2 years ago
ROB: Fix merge_page for pages without resources (#1349)

Closes #270

6f51189d880980b19fed900e43f73f91ac119239 authored over 2 years ago
PI: Avoid string concatenation with large embedded base64-encoded images (#1350)

Certain PDF libraries do embed images as base64 strings. This causes performance issues
in `rea...

3be01fda141817c90a0df18fb22e1747ac5832b3 authored over 2 years ago