Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
https://github.com/py-pdf/pypdf
Removed duplicate `Destination` class
I was experimenting with `PdfFileMerger` and when I tried to
write the data to a file the follow...
Merge remote-tracking branch 'upstream/master'
30f0a51a1c211c20312b1842a8f8b80957dd5f4e authored over 11 years ago by amorris <[email protected]>
30f0a51a1c211c20312b1842a8f8b80957dd5f4e authored over 11 years ago by amorris <[email protected]>
Corrected version number.
e304aa0a6ad7079f189cf32436f8690b7612c60a authored over 11 years ago by Matthew Stamy <[email protected]>
e304aa0a6ad7079f189cf32436f8690b7612c60a authored over 11 years ago by Matthew Stamy <[email protected]>
Merge branch 'master' of https://github.com/knowah/PyPDF2
e07b0573570b7487cffdff376b8c73de6c5bb6b5 authored over 11 years ago by mstamy2 <[email protected]>
e07b0573570b7487cffdff376b8c73de6c5bb6b5 authored over 11 years ago by mstamy2 <[email protected]>
Defined 'PyPDF2.__version__'. 'PyPDF2/__init__.py' and 'setup.py' were modified to handle the new file 'PyPDF2/_version.py'. The official version number has been set to '1.9.0'
93836758d056bffd9e8942f1aae6a0b46f43a01d authored over 11 years ago by mstamy2 <[email protected]>
93836758d056bffd9e8942f1aae6a0b46f43a01d authored over 11 years ago by mstamy2 <[email protected]>
improved error handling for PDF files with truncated streams
cc953ba9bbf8836ae10539b6a214aa2bf391d088 authored over 11 years ago by cecilkorik <[email protected]>
cc953ba9bbf8836ae10539b6a214aa2bf391d088 authored over 11 years ago by cecilkorik <[email protected]>
Implementation of LZWDecode filter
c3906e2d279ea514e6e868a6919691896c7560f3 authored over 11 years ago by Michal Horejsek <[email protected]>
c3906e2d279ea514e6e868a6919691896c7560f3 authored over 11 years ago by Michal Horejsek <[email protected]>
Don't assume that strings starting with a BOM are valid UTF-16.
Move the logic to catch invalid encoding outside the attempt
at UTF-16 decoding to catch byte ar...
Correcting syntax error from last commit upon notification by github user rippedathlete.
2857d4825ce217d74bb1a2c893940bc3083dda9b authored almost 12 years ago by Noah Kessler <[email protected]>
2857d4825ce217d74bb1a2c893940bc3083dda9b authored almost 12 years ago by Noah Kessler <[email protected]>
In reading a DictionaryObject, a \r character was assumed to be followed by a \n character, and the latter was read over if a \r was found. However, this is not always the case. The character after the \r is now checked of it is actually \n, and if not, then the stream is seeked backwards one byte to compensate for the erroneous read.
0840c6af64b8f06d605641c906e67f0d81b215b8 authored almost 12 years ago by Noah Kessler <[email protected]>
0840c6af64b8f06d605641c906e67f0d81b215b8 authored almost 12 years ago by Noah Kessler <[email protected]>
Added support for constructing PdfFileReader object with string of PDF file path. Previously a file/stream instance of the PDF was required; now one is created within the constructor to provide this functionality.
ced2890104dd4633132fe981b7f6bdd67aa17ed8 authored almost 12 years ago by Noah Kessler <[email protected]>
ced2890104dd4633132fe981b7f6bdd67aa17ed8 authored almost 12 years ago by Noah Kessler <[email protected]>
Bugfix: _alg34 and _alg35 are expecting ByteStringObject, not str as id.._entry argument.
efc8e14c6344bc5617f9a2ad1ffb50967333601a authored almost 12 years ago by Martijn The <[email protected]>
efc8e14c6344bc5617f9a2ad1ffb50967333601a authored almost 12 years ago by Martijn The <[email protected]>
For page merging, added additional argument to _mergePage so that it can access the CTM (transformation matrix); also added 'expand' argument to allow the new page to adopt the dimensons of the merged (and possibly transformed) page; defaults to false and is currently not used by any function, but will be once other methods are adapted to take the 'expand' parameter.
65d57ddfee5775ff5036038cb306488cf97cb3d8 authored almost 12 years ago by Noah Kessler <[email protected]>
65d57ddfee5775ff5036038cb306488cf97cb3d8 authored almost 12 years ago by Noah Kessler <[email protected]>
Added support for converting FloatObject to a float and NumberObject to an int; method is called as_numeric() in both classes.
c857f9d9df134807b68fc4ff89b7a04c8f83b14f authored almost 12 years ago by Noah Kessler <[email protected]>
c857f9d9df134807b68fc4ff89b7a04c8f83b14f authored almost 12 years ago by Noah Kessler <[email protected]>
Added support for getNumPages() for encrypted PDF's; this uses the /Count of the Page Tree object of the Root catalog. For non-encrypted PDF's, the implementation is unchanged.
2fc218927b9d3950106807d5aeba72eaf953f83a authored almost 12 years ago by Noah Kessler <[email protected]>
2fc218927b9d3950106807d5aeba72eaf953f83a authored almost 12 years ago by Noah Kessler <[email protected]>
A PDF was unable to be read due to a non-standard Xref table. Evidently, some programs will create an xref table with 21-byte entries due to the use of CR+LF endlines instead of single-character ones. As per the standard, PyPDF2 reads Xref entries in 20-byte chunks, regardless of endlines. Thus, each line (after the first) of the 21-byte Xref table contained the LF character from the end of the previous line. To remedy this, if a line starts with a CR or LF character, the 20-byte line is re-read, starting one byte further than before.
783f3be43dcad979574d3ff4b424fc294573e559 authored almost 12 years ago by Noah Kessler <[email protected]>
783f3be43dcad979574d3ff4b424fc294573e559 authored almost 12 years ago by Noah Kessler <[email protected]>
Changed README to use open() instead of file() [for Python 3 compatibility]
9f4c58006c21c2b657b678a25f6016d2e37a2d1f authored almost 12 years ago by Noah Kessler <[email protected]>
9f4c58006c21c2b657b678a25f6016d2e37a2d1f authored almost 12 years ago by Noah Kessler <[email protected]>
Added warning to PdfFileReader constructor; if it is passed a stream object that is not in binary mode, the user is warned of this and of the fact that the stream may not be read correctly.
ac2d80e036836d5820c1149d87ad56cedb84711a authored about 12 years ago by Noah Kessler <[email protected]>
ac2d80e036836d5820c1149d87ad56cedb84711a authored about 12 years ago by Noah Kessler <[email protected]>
Merge branch 'master' of https://github.com/knowah/PyPDF2
c513b0ac3a9f94cebad2e67a1041734387689708 authored about 12 years ago by Noah Kessler <[email protected]>
c513b0ac3a9f94cebad2e67a1041734387689708 authored about 12 years ago by Noah Kessler <[email protected]>
Fix typo in pdf.py
48a76962559139b4c260acb127b8ab7e726f2ab0 authored about 12 years ago by Fabian Henze <[email protected]>
48a76962559139b4c260acb127b8ab7e726f2ab0 authored about 12 years ago by Fabian Henze <[email protected]>
Fixed example code.
c95de72aa51123e697dbd9ed0b96ad28b3f91ca9 authored about 12 years ago by Jérémy Bethmont <[email protected]>
c95de72aa51123e697dbd9ed0b96ad28b3f91ca9 authored about 12 years ago by Jérémy Bethmont <[email protected]>
Merge pull request #2 from flyser/patch-1
Fix typo in pdf.py (flyser)
1caec56be18c5a2a76af07f375b0ddf552967e8e authored about 12 years ago by knowah <[email protected]>
Merge pull request #3 from jerem/patch-1
Fixed example code. (jerem)
03b670d17a46a0cff25b79b9943d18cc79787390 authored about 12 years ago by knowah <[email protected]>
Modified merger.py so that files can be closed after they are appended/merged to a PdfFileMerger object and before the merged PDF is written out. This is accomplished by creating a StringIO stream of the content of the file and using that as the source for the merge. The same is done with PdfFileReader objects that are to be merged/appended; the stream is wrapped into a separate StringIO object.
9cd94b64498261175615d1270156aa1abf25e5c9 authored about 12 years ago by Noah Kessler <[email protected]>
9cd94b64498261175615d1270156aa1abf25e5c9 authored about 12 years ago by Noah Kessler <[email protected]>
Fixed example code.
ca67f2add0d2f7fcb68244a66eb13202f122c45c authored about 12 years ago by Jérémy Bethmont <[email protected]>
ca67f2add0d2f7fcb68244a66eb13202f122c45c authored about 12 years ago by Jérémy Bethmont <[email protected]>
Fix for FloatObject error with Decimal context
fe8518097785a79fca2baf4a053f10d88805e91d authored about 12 years ago by amorris <[email protected]>
fe8518097785a79fca2baf4a053f10d88805e91d authored about 12 years ago by amorris <[email protected]>
Added classifiers in setup file for Python 2 and 3 support
1d08ee5d482e4675a4fceea57e8f73b51dfdd883 authored over 12 years ago by Noah Kessler <[email protected]>
1d08ee5d482e4675a4fceea57e8f73b51dfdd883 authored over 12 years ago by Noah Kessler <[email protected]>
Updated version info (pyPdf 2.0 ==> PyPDF2 1.15) for listing on PyPI
841e864381ed17ef12b4ae540312f2126b093151 authored over 12 years ago by Noah Kessler <[email protected]>
841e864381ed17ef12b4ae540312f2126b093151 authored over 12 years ago by Noah Kessler <[email protected]>
Merge pull request #1 from kushal-kumaran/master
Initial python 3 support
ad44feb082345032389694c87556727c7cb41121 authored over 12 years ago by knowah <[email protected]>
Fix typo in pdf.py
ca23e87544156af507fb5efdee289d67a312a8bf authored over 12 years ago by Fabian Henze <[email protected]>
ca23e87544156af507fb5efdee289d67a312a8bf authored over 12 years ago by Fabian Henze <[email protected]>
Compare first 16 bytes of U and real_U when security handler revision is 3 or greater only.
88cb80f2ed431f6b0ee01a939dafd2e01b3886b3 authored over 12 years ago by Kushal Kumaran <[email protected]>
88cb80f2ed431f6b0ee01a939dafd2e01b3886b3 authored over 12 years ago by Kushal Kumaran <[email protected]>
made additional changes where text needs encoding to bytes before writing to output stream.
757bb48073b8b73606b59159843f0152d22f5e5f authored over 12 years ago by Kushal Kumaran <[email protected]>
757bb48073b8b73606b59159843f0152d22f5e5f authored over 12 years ago by Kushal Kumaran <[email protected]>
further changes to make pdf writing work in python3
869c626074017195d506c977c52840feed874caf authored over 12 years ago by Kushal Kumaran <[email protected]>
869c626074017195d506c977c52840feed874caf authored over 12 years ago by Kushal Kumaran <[email protected]>
added support for building python3 package
tested pdf file reading with password-encrypted files
bcafcff11374e11abaaf6608eb5715bc9a7dff59 authored over 12 years ago by Kushal Kumaran <[email protected]>
Renamed customwarning to _formatwarning
1b5a208f95644636c1d9da3e0f4c3bdde6c693c2 authored over 12 years ago by Noah Kessler <[email protected]>
1b5a208f95644636c1d9da3e0f4c3bdde6c693c2 authored over 12 years ago by Noah Kessler <[email protected]>
Renamed zeroXref since it's an internal method; added *actual* support for warnings redirection by redefining showwarning method.
2ab68320d7b1137c40e9bc4d3168ba1b46e44e2d authored over 12 years ago by Noah Kessler <[email protected]>
2ab68320d7b1137c40e9bc4d3168ba1b46e44e2d authored over 12 years ago by Noah Kessler <[email protected]>
Moved strerr redirection to be first call in PdfFileReader constructor
9adc508201d3ce9fc51cb786c4f4c56221fd71c1 authored over 12 years ago by Noah Kessler <[email protected]>
9adc508201d3ce9fc51cb786c4f4c56221fd71c1 authored over 12 years ago by Noah Kessler <[email protected]>
Made spacing between methods uniform
b2458da5f406def6846bb648a9b7a6090005a554 authored over 12 years ago by Noah Kessler <[email protected]>
b2458da5f406def6846bb648a9b7a6090005a554 authored over 12 years ago by Noah Kessler <[email protected]>
Modified setup.py to reflect renaming/Phaseit takeover
f0589c781b44e32631dcaa8d12d89d8bc4019e50 authored over 12 years ago by Noah Kessler <[email protected]>
f0589c781b44e32631dcaa8d12d89d8bc4019e50 authored over 12 years ago by Noah Kessler <[email protected]>
Allows for redirection of stderr. This has been implemented to assist increase the functionality of the online validator.
17784214c98ec6ccf3fe5f3e5f6b16284465a209 authored over 12 years ago by Noah Kessler <[email protected]>
17784214c98ec6ccf3fe5f3e5f6b16284465a209 authored over 12 years ago by Noah Kessler <[email protected]>
Added support for new strict/relaxed version of PdfFileReader; defaults to strict.
1e352334c7ce5fa509ce90a790814b5e589a9a8e authored almost 13 years ago by Noah Kessler <[email protected]>
1e352334c7ce5fa509ce90a790814b5e589a9a8e authored almost 13 years ago by Noah Kessler <[email protected]>
Changed default PdfFileReader behavior to strict.
693f759ea03e73c7ec87a7752d24fcd0e8bb2e45 authored almost 13 years ago by Noah Kessler <[email protected]>
693f759ea03e73c7ec87a7752d24fcd0e8bb2e45 authored almost 13 years ago by Noah Kessler <[email protected]>
Removed unnecessary and confusing comment regarding previous commit's changes.
5c689b65363fac2cc0e4be4eb704b483544fb644 authored almost 13 years ago by Noah Kessler <[email protected]>
5c689b65363fac2cc0e4be4eb704b483544fb644 authored almost 13 years ago by Noah Kessler <[email protected]>
Cross reference tables that should be zero-indexed are corrected on-the-fly in relaxed mode allowing some previously-unreadable files to be read. If strict, error is thrown and user is warned.
9ef49845180496c16c0714adb6ec9ed3b0e517fb authored almost 13 years ago by Noah Kessler <[email protected]>
9ef49845180496c16c0714adb6ec9ed3b0e517fb authored almost 13 years ago by Noah Kessler <[email protected]>
Merge branch 'master' of https://github.com/knowah/PyPDF2
7204bf2e5d4fa9398056736a3ef6fc150ae6c47d authored almost 13 years ago by Noah Kessler <[email protected]>
7204bf2e5d4fa9398056736a3ef6fc150ae6c47d authored almost 13 years ago by Noah Kessler <[email protected]>
Attempting to remove index.html
eb9c02bcb718cdec3aaed699e47d624650e9fd4f authored almost 13 years ago by Noah Kessler <[email protected]>
eb9c02bcb718cdec3aaed699e47d624650e9fd4f authored almost 13 years ago by Noah Kessler <[email protected]>
Added strict parameter to pydoc for PdfFileReader class. Also made a nondefined object a fatal error in strict mode.
a4769677b40f9ecbb81bbd56aafa1f71410c6606 authored almost 13 years ago by Noah Kessler <[email protected]>
a4769677b40f9ecbb81bbd56aafa1f71410c6606 authored almost 13 years ago by Noah Kessler <[email protected]>
First real content.
e90446607bc80e6d6df8b753c37398a88db75167 authored almost 13 years ago by Cameron Laird <claird@claird-Qosmio-G15R.(none)>
e90446607bc80e6d6df8b753c37398a88db75167 authored almost 13 years ago by Cameron Laird <claird@claird-Qosmio-G15R.(none)>
Merge branch 'master' of ssh://github.com/knowah/PyPDF2
d7d9bb6fa741bf0d7f7667c26074a6428ac148d3 authored almost 13 years ago by Cameron Laird <claird@claird-Qosmio-G15R.(none)>
d7d9bb6fa741bf0d7f7667c26074a6428ac148d3 authored almost 13 years ago by Cameron Laird <claird@claird-Qosmio-G15R.(none)>
First experiment with Web page.
d3f624e539bd3aa236c27e6bac1e9b32311a3eb0 authored almost 13 years ago by Cameron Laird <claird@claird-Qosmio-G15R.(none)>
d3f624e539bd3aa236c27e6bac1e9b32311a3eb0 authored almost 13 years ago by Cameron Laird <claird@claird-Qosmio-G15R.(none)>
Added byte-location of errors in PDF when raising PdfReadError
a443a62d2ba9a689fef882c315cee5e4dbb475f9 authored almost 13 years ago by Noah Kessler <[email protected]>
a443a62d2ba9a689fef882c315cee5e4dbb475f9 authored almost 13 years ago by Noah Kessler <[email protected]>
Began implementing warnings.warn; includes line at top of file to use utils.customformat for warn output
f7f699e3ad2caa59a9dbf23eeaf45a0854799368 authored almost 13 years ago by Noah Kessler <[email protected]>
f7f699e3ad2caa59a9dbf23eeaf45a0854799368 authored almost 13 years ago by Noah Kessler <[email protected]>
Added modified method for formatting warnings.warn output to be implemented for all non-fatal errors in the future; also added a method to remove 'L' suffix from built-in hex() method.
032f2554e00aae9a4fab1b298503808348a9e209 authored almost 13 years ago by Noah Kessler <[email protected]>
032f2554e00aae9a4fab1b298503808348a9e209 authored almost 13 years ago by Noah Kessler <[email protected]>
Added strict parameter to PdfFileReader to allow for relaxed (default) or strict reading. Also modified the readObjectHeader to not warn about extra whitespace unless strict is specified. Some TODO statements about future strict implementation.
5ab7eed1f6d7537be23ac2c8afe1dbe707d649ba authored almost 13 years ago by Noah Kessler <[email protected]>
5ab7eed1f6d7537be23ac2c8afe1dbe707d649ba authored almost 13 years ago by Noah Kessler <[email protected]>
User now warned if an object referenced by another is not in the xref table (most likely because it does not exist)
79d5b647c32ec4ad97ae6d5fd69baacbd7c495c4 authored almost 13 years ago by Noah Kessler <[email protected]>
79d5b647c32ec4ad97ae6d5fd69baacbd7c495c4 authored almost 13 years ago by Noah Kessler <[email protected]>
Verbose error message upon reading an object that has an ID different from what would be expected from the XRef table.
ad55f0e10485ded4689323e517c5e8b1778e0dd3 authored almost 13 years ago by Noah Kessler <[email protected]>
ad55f0e10485ded4689323e517c5e8b1778e0dd3 authored almost 13 years ago by Noah Kessler <[email protected]>
Reverted to calling PdfFileWriter.getReference() since it has been re-implemented in pdf.py
7a25ab690656ea594e1c4dee47a7cf03e366cc94 authored almost 13 years ago by Noah Kessler <[email protected]>
7a25ab690656ea594e1c4dee47a7cf03e366cc94 authored almost 13 years ago by Noah Kessler <[email protected]>
Added many methods (destination/bookmark, etc.) back to PdfFileWriter; also made a workaround to read object headers with superfluous whitespace (readObjectHeader); various debugging statements added
73437b11069580e77714605a01dfe0f64007b75f authored almost 13 years ago by Noah Kessler <[email protected]>
73437b11069580e77714605a01dfe0f64007b75f authored almost 13 years ago by Noah Kessler <[email protected]>
Added new method to skip over whitespace in stream and return whether whitespace was read
2867d77970118b595e03669aecf0041036125497 authored almost 13 years ago by Noah Kessler <[email protected]>
2867d77970118b595e03669aecf0041036125497 authored almost 13 years ago by Noah Kessler <[email protected]>
Added TreeObject, Destination, and Bookmark classes from the NIWR local copy of PyPDF.
496461e91ed9ae2197ebdfff49b6c52a60cb7394 authored almost 13 years ago by Noah Kessler <[email protected]>
496461e91ed9ae2197ebdfff49b6c52a60cb7394 authored almost 13 years ago by Noah Kessler <[email protected]>
Added a workaround to avoid reference to obsolete method getReference() in PdfFileWriter
80ff51ae6786359d27f0e8ca6f9403b4627c49dd authored almost 13 years ago by Noah Kessler <[email protected]>
80ff51ae6786359d27f0e8ca6f9403b4627c49dd authored almost 13 years ago by Noah Kessler <[email protected]>
pdf.py updated to that of 1.13 AND further modified to support reading PDFs that use the CR+LF convention
428c6a90a3718cfc2be65df8bdf5f84e5b6a137e authored almost 13 years ago by Noah Kessler <[email protected]>
428c6a90a3718cfc2be65df8bdf5f84e5b6a137e authored almost 13 years ago by Noah Kessler <[email protected]>
Updated generic.py from the 1.12 to the 1.13 version
50426a4ac5957769056f00237a6b12000f27518b authored almost 13 years ago by Noah Kessler <[email protected]>
50426a4ac5957769056f00237a6b12000f27518b authored almost 13 years ago by Noah Kessler <[email protected]>
Updated CHANGELOG to that of PyPDF 1.13
dac8e7950f0905534a10057aa8e5b264a5ddea77 authored almost 13 years ago by Noah Kessler <[email protected]>
dac8e7950f0905534a10057aa8e5b264a5ddea77 authored almost 13 years ago by Noah Kessler <[email protected]>
First commit
Original PyPDF code. Updates should be coming from Noah soon.
c59a212a4cda512c184a987312970b464404d17b authored about 13 years ago by Adam Coleman <[email protected]>