Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
https://github.com/py-pdf/pypdf
This PR aims at making minor changes that do not influence readability in a negative way and hel...
6407e1dd12aa797c55465768d9e32889d01c2cb1 authored about 2 years ago
MAINT: Use a generator+join instead of iterative concatenation _security.py
MAINT: Shortened if...
DOC : Add clarifications/details in doc strings of clone_reader_document_root and clone_document...
41aa58471c440d09832b1eebfee438b419ff81d3 authored about 2 years agoIntroduce a new PdfReader property `page_labels` that returns a list of strings.
In most case...
8c9505cca821a0d6c74590253a4407d45810331c authored about 2 years agoSee #1518
cfed01f502afd1af7ed331b32ee84ddd531f9e8a authored about 2 years agoc700617a02dfd7941817db70823f22c9ba527aaf authored about 2 years ago
The link to migration guide in README pointed to a diff instead of the actual guide.
f6a7b35ccecdc70011cfa1b5eda74d718b38bb3d authored about 2 years ago318ace3233b25050a25b76dc45a686185913203b authored about 2 years ago
Move PyPDF2 to pypdf (#1513). This now it's all lowercase, no number in the
name. For installati...
0844c938651c8f20154a204146fa584b4114ffbb authored about 2 years ago
All lowercase, no number.
See #1488
3825c7b8e7890ef507de4099f5ed9c1c3cf9ec8a authored about 2 years ago
BREAKING CHANGES:
- Deprecate features with PyPDF2==3.0.0 (#1489)
- Refactor Fit / Zoom parame...
3fb9b690fe326c806a126b2af20c222045d4f987 authored about 2 years ago
e5e26ad20c160350e09566049a05cc33ba4c46f9 authored about 2 years ago
See https://myst-parser.readthedocs.io/en/latest/configuration.html
Closes #1492
9b23a39b987c9003ef48b5015faf957e2bd02849 authored about 2 years agoDeprecate features, but keep helpful exceptions. That means that the names still need to be ther...
a4629d3dbdef2759ff6059446fcffd13ecf6a396 authored about 2 years agoc95eabfab0a4848edb727a8390e343e04eacb325 authored about 2 years ago
In addition, the following changes were made:
* DEV: Add in-project virtual envs to .gitignor...
b8f787e318125cdd0b85f3ca07c412da932b046a authored about 2 years ago10ab0854c4cb7f2d4b490ff4178fd170f5d0a9e5 authored about 2 years ago
Closes #1494
690247a430bfde85f11352387f3f34a005f13418 authored about 2 years ago
The method `.clone(pdf_dest,[force_duplicate])` clones the objects and all referenced objects.
...
9674f5ff31993e8cc2d684a89180a2d9a68f52d0 authored about 2 years ago
Co-authored-by: pubpub-zz <[email protected]>
2aba7188b307196592655a7f163449f6f5498fe0 authored about 2 years ago437aec9473adbc2b98740dd33644c6a5002657f6 authored about 2 years ago
Introduce a new `PyPDF2.generic.Fit` class which captures the type and parameter for how a page ...
76334777b5d54342b25eb3e030f02df9bf01b2ca authored about 2 years ago
Documentation (DOC)
- Deduplicate extract_text docstring (#1485)
- How to cite PyPDF2 (#1476)
...
Closes #1455
2fe895648d361a74c1c338c95650a4aa7e752c80 authored about 2 years agoChanged method parameter name for consistency as mentioned in #1187
- `position` ➔ `page_num...
40086e1fb98927320c1aa88456275d24e27f6b0b authored about 2 years ago
Remove xfail from passing test
Don't consider deprecated code for coverage testing
* indirect_reference ➔ indirect_ref and ido ➔ indirect_ref in PdfReader._get_object_from_stream ...
25770098719338b2715ff1f97be18edf2467020c authored about 2 years ago
* owner_pwd ➔ owner_password
* user_pwd ➔ user_password
See #1187
e356a39f28331c172b7d2fc2c22a140d13da37ab authored about 2 years agoCo-authored-by: Matthew Peveler <[email protected]>
fe461a171e425e558c8afa3bfe1f279af5ec5689 authored about 2 years ago
New Features (ENH):
- Add support to extract gray scale images (#1460)
- Add 'threads' propert...
PDF annotations - for example hyperlinks and comments - were not properly scaled when using the ...
5fd96d01c1febe6d38ba9b90ba774e95ceaca784 authored about 2 years ago
* Update docs.txt to 3.7 as https://readthedocs.io/ seems to use 3.7
* Update dev.txt to 3.7 as...
Fixes #1221
Credit goes to Alper Ahmetoglu for the fix
Co-authored-by: Alper Ahmetoglu <ah...
f804f3af83b1856492ea94e537de1c5350585add authored about 2 years ago9821468f7f6c425dc30bd961592ff4cf518f6fd7 authored about 2 years ago
22214e83d24c346e4633f32ec6d77c62f799c79d authored about 2 years ago
Fixes #1474
7f586ae55157785ce63881add2149a1d62f2bc47 authored about 2 years ago635f0cae1818ca9a5a7c3c1c8f9bd026c49922d5 authored about 2 years ago
This PR ensures PyPDF2 uses page_number instead of pagenum as a parameter name.
* It does not...
deb0667321bb1e60bea2d9b21700054f66e6b445 authored about 2 years agoThis currently returns only an empty list . Pending to PR#1371
3e250c5b025bb82407ad10f3db9906f784b3f42e authored about 2 years agoApplies to merge/append
1a9f7d9434f9c68254d00f3c91a21d5fc6d1e22b authored about 2 years agoAlso fix various type annotations
17897d986a6c678ff55ecdef5ed26e59f9d9bc39 authored about 2 years ago940819ffa88223b586dc777f7d60b24a55878033 authored about 2 years ago
b6bfc17c38a2a75b1eb89dbcb997ab30210fd599 authored about 2 years ago
fdd105812e201e9f466030711a20a669b9acf4af authored about 2 years ago
89bf4c1c0750cd3b72c93953fa08dda2fa828de6 authored about 2 years ago
Closes #1451
This sets the test suite to use ubuntu-20.04 as opposed to ubuntu-latest as the ...
420a545836630585f9e03b1df855c423e77c7731 authored about 2 years agofb34775edb80c191ac0cf5bd5906d18ec8f83947 authored about 2 years ago
4b1cbde09f7b8ed02c9771c15370594ef864e02d authored about 2 years ago
6f73cb198367598c32ca71a9e6fd7eded962256e authored about 2 years ago
b14be94bd25266eaaaae5c93721efe7d89062694 authored about 2 years ago
Closes #1423
Also split PyPDF2 1.X changelog to separate page as it becomes really
long.
Fixes #1425, #1420, #482
03f82d23af1e29ab90096b0442e5add965101a4b authored about 2 years ago906c338d4f5b031b6df7b9990c8af6ad6494184c authored about 2 years ago
See #1436
Co-authored-by: pavlidvg <[email protected]>
1842c2128873ec64f8a467381458d3d03a0955ea authored about 2 years agoThe remove_from_tree method can be used to remove an outline item
Fixes #1427
df933f2401292d19f6d612475b944724a9aad15e authored about 2 years agofixes #1417
56395e9509f03ae07b5cf63a3b0ebc04372b2c02 authored about 2 years ago0b2b3ec997b81d6bb22c5c834ef5ba9e4f2330e7 authored about 2 years ago
6a61089fce4f4ac8f0d1c5278b0d0c80e588230c authored about 2 years ago
The test before was to brittle. We need to keep an open eye to the
benchmarks in future, but al...
Additionally, use time.process_time instead of time.time for test_readStringFromStream_performance
d82172374c1322ece2919d45263f7232f63d23fd authored about 2 years ago1d551af8d7bd73ccc6d6981a9b7a81026df73632 authored about 2 years ago
613b3706e072259f50299387945afcc0ada0cffb authored about 2 years ago
752d496790a6454cb7656851e8a652b3585b2eea authored about 2 years ago
It was tested like this:
## slow
$ pytest --durations=50 -m "not slow"
## external
D...
4b8d849e4d1042c139542cfbfd0c0e2ed430e831 authored about 2 years agob49034e8fdcd854e775e6c6e36699c0357956619 authored about 2 years ago
The pyproject.toml file is the new standard for project configuration in Python defined by PEP-6...
abe01995304470d7c110bf74f24e972073030426 authored about 2 years agod64b578d135f7cd72b538fe4afb917be955080ef authored over 2 years ago
Closes #1390
664ddf28e9c7037c5e95566a7f5538e85dad0df5 authored over 2 years agocc553210524a8009c91409b76f05912469cfa927 authored over 2 years ago
2e6ea6fa21cf654dadb48f90a542f874c716008f authored over 2 years ago
Bug Fixes (BUG):
- td matrix (#1373)
- Cope with cmap from #1322 (#1372)
Robustness (ROB):
- C...
d5111e1a056e17ea32f01429258ad12aa02d7450 authored over 2 years ago
50c1b527361ff9da4ea8ce6db77b67f2e2d837a4 authored over 2 years ago
7faa9b321c694fdadaba71f4fe1700fe81cf6ccd authored over 2 years ago
Fixes #1379
92c894d247e803dd5df4535ec1180d51305d3508 authored over 2 years ago9d870a2b01bcdc2816f3fde3bcb1e51d0a9da398 authored over 2 years ago
Cope with cmap where the range contains first and last code are on variable length.
Also fixe...
f3b6d0e7d723aebc72c8de638ad852f5c98e9168 authored over 2 years agod9aa64c2a41c379c4b1210e3689e4edb4f694a77 authored over 2 years ago
Add two samples using visitor-functions when calling extract_text.
The first example may be o...
a5f4f41e5675f78902f9881545e2afb9d8455112 authored over 2 years agoType annotations as well
d508c69c3dba15bc834d3d43df14415d46643ffa authored over 2 years ago
New Features (ENH):
- Addition of optional visitor-functions in extract_text() (#1252)
- Add m...
DEV: Adjusted File classes __str__ and __repr__ to easy debugging
eca1a848633871548dd700a147060dfbfa0da7c2 authored over 2 years ago26bdc6b36e4bcd8bae11f73dfdba0926abd62ae3 authored over 2 years ago
cf76824e3324d1bf7ad38710be7118f1689d272c authored over 2 years ago
Optional visitor-callback were added to extract_text().
_extract_text() calls these visitor-m...
ebb3b83c3aa6e2305b32710c609e0e7534186934 authored over 2 years ago
There is a saftey margin of a factor of 10 in both directions,
so the test should be fairly sta...
Closes #1222
2845c6d94a3fb56a13393191fc8e050ad9d6bb37 authored over 2 years ago85b3e8785830bf35fc1c21f5c2edf29636281290 authored over 2 years ago
Fixes #1358
Co-authored-by: diavral <[email protected]>
dcab241e4ab386834d6864538b99b556b04a3f7c authored over 2 years ago4ff5e0a3faf40b8eb1632c0708366591c1b41ea3 authored over 2 years ago
See https://github.com/py-pdf/PyPDF2/issues/1280#issuecomment-1251018614
7803a72c1e446b957b7919b8097f272b14ae9fdc authored over 2 years agoFixes #1343
141a765621235a395ce04ef078ab2060d359d2fa authored over 2 years ago
New Features (ENH):
- Add rotation property and transfer_rotate_to_content (#1348)
Performance...
620d2fa03c8b87e2923d9f41e2bc0da8a4321c6a authored over 2 years agoSince FloatObject is represented as a decimal, format numbers using their intrinsic precision, i...
5aeb92633769785e50470a3acecffe883112a99d authored over 2 years agoSee #1280 for the context of this change
71de6c8d2792c25e40d4d82dba1de4a788991196 authored over 2 years agoCloses #270
6f51189d880980b19fed900e43f73f91ac119239 authored over 2 years ago
Certain PDF libraries do embed images as base64 strings. This causes performance issues
in `rea...