Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Collective -
Host: opensource -
https://opencollective.com/ocrmypdf
- Code: https://github.com/jbarlow83/OCRmyPDF
Closes #275
github.com/ocrmypdf/OCRmyPDF - 58642aa98b7df984b42e7a292d8f60e5c02e6c69 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 7baaf00a38d7db757710d8627b262a6f28349bbd authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 5cc23dbf2413ec2a296d3af58f70a8ed85e7d410 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 216d60ea2c2f227a6772fe5eedbbbd8d761cb103 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 8b0496d35ed94f37d52303de39f182951342c96e authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - e44001641c49324ac7ff4888dffb14f07ba048fa authored over 6 years ago
Since pikepdf is doing the work the initial repair takes time and gives
little benefit.
It turn...
github.com/ocrmypdf/OCRmyPDF - 47885f4230dd0f0618eeb4be025c56d2fd9c1040 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 921767e82e89f859694f0d300b97a570568376ab authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 85f96b7fb0179363ad7a0ba02a9d1b9637772a5a authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 890c7fd0f659a55567e49e10dce5824450a55631 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 39c44bdd2f3438436e40d484d98e72cfd7ea96f0 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 5f99f7f6ca39a5cf9e892667138b517c89964a85 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 4f864bce983646d1e362c1ea130ebaf45f6fc77d authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 2974929b26036d0ac24956a48f0bf5996ceabc38 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - db837aa55cbb316e45ddd40299d181a21549a9a3 authored over 6 years ago
Need to use private fork of ruffus for Python 3.7. Backward compatible with Python 3.6 for ruffu...
github.com/ocrmypdf/OCRmyPDF - 72006230078727f166f595e3ffde02613a422209 authored over 6 years agohttps://github.com/eliben/pycparser/issues/251
github.com/ocrmypdf/OCRmyPDF - 73e02ae4ea12397847b34306a8230e12766a2ea7 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - d4cbef94571917beb1ade4e01843df6ccadd8511 authored over 6 years ago
Only partially optimize multipage.pdf so that it hopefully
improves speed of test suite without ...
github.com/ocrmypdf/OCRmyPDF - e725f64b6a2c656df2c170b1b500b45f4e4f2d54 authored over 6 years ago
At some point the color gets flipped, we have to flip it again,
for mono.
Incidentally this exp...
github.com/ocrmypdf/OCRmyPDF - 0029cc4fe75214f7ab4a356646286becff1a19d3 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 9637696a546d4fd115d65393762b5719388a3315 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 02b3ca6862be7a4036460d2fa71df80d04cf55a1 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - bc90f40a8fd76d3a30311e6d55a2bd188dd34a29 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 3d727ff4c03c884e55c1ad692826fc3cc725076a authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - b0eacd6586c9e8217f9b4924e7093ae976360e7d authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 779570159581e3c9509ad50b64106ddc719ca8a3 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - bf214eecb36e6914cb040e2a4897b70fa2fcafea authored over 6 years ago
These are fairly rare
github.com/ocrmypdf/OCRmyPDF - 434b96d7348c691b348ecd5af0bdf55f16533148 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - b9dc1098928e0886f54b09a59eeb7cc9322493e0 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 1f40a7055408e30628470c724ff694af3d67f7da authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - e14ffbf03f7be91319e856de66315b9475279cd2 authored over 6 years ago
This change happened sometime after the 4.0.0-beta1 release in
Ubuntu 18.04
The difference doesn't matter in 7.0.0 anymore.
github.com/ocrmypdf/OCRmyPDF - bf96171b6514c010189fd80e5baca34b8f18e141 authored over 6 years ago
This change happened sometime after the 4.0.0-beta1 release in
Ubuntu 18.04
github.com/ocrmypdf/OCRmyPDF - b81daf71d1b8d651f9158eae28b4f968adc0e440 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - faad1fc58ae85b82c60c19a9451dd1f810ef87d9 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 6f48181a56abc608a3c88ce81c88ed213660f2f1 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - f1305e5a375018c2093c633b2b1f5d023ad48d1e authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - f0e0f92776aafffc2cac7fea14995c1f198bd868 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 807c8b072638c7f8852f559ef640f88ed1b3f3d7 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 6333ec928ca679b79d3070310eb22704671a8871 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - cd220d9ed9b82d94812ecfdd00cea8c1662048f3 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 76532649b829cdbf2650fdbfe7ae9911780157f5 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - b0dbaeafc52de3f77211e0cd7f618961858e9490 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 2530d1791bc608010ba061a2e3b07a5a3523d5d9 authored over 6 years ago
We no longer need to merge pages this way. Much of the functionality
was there to implement page...
split_pages would still run if repair_pdf failed, for some reason.
Since we are no longer splitt...
Helps make it more explicit. Did not do this to tests because use of paths
is more involved there.
This helper function only had a single usage, this was always an awkward
way to support Python 3...
github.com/ocrmypdf/OCRmyPDF - 9e765ddf4644c70b8bf85433c947d76810d5c239 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 6ac9e92f17f86e4aac38d6c7e804f4103921caee authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - faaa4a1def562ce8ca83884e4cc9f9a9952d6685 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 0aa51f0f3ab3685f35b368533db747cf38dc3e26 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 73431d9761eb10be02acd06e602b8627e9c72c56 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 45cb4525cf83894919842b67c82a6cd8ba0bfe1a authored over 6 years ago
Ghostscript txtwrite seems to be quite effective at the task.
Eliminates dependency on fitz
github.com/ocrmypdf/OCRmyPDF - 8c84c515b6e5c06290b40cce708c5b57765fdb87 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 1dfbbdebf4affdf1afd1ebecc2fcadc4392dfe7a authored over 6 years ago
Note that Ghostscript always overrides Producer
github.com/ocrmypdf/OCRmyPDF - 740918daeedd46b2a6ea072afdfd6c4260a501b4 authored over 6 years ago[ci skip]
github.com/ocrmypdf/OCRmyPDF - 1d10eac764dc16db7ab34dcccf08493fb42ad6de authored over 6 years ago[ci skip]
github.com/ocrmypdf/OCRmyPDF - 3f868118cd117e95375d30824713d86ad8c86974 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 04d79b15b4551a248186ac11d1485f02148d7d88 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - a13c398c064743371ba3e1245a858a7b59d53474 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - e3b3f716ee39b12c4bb3e3daac949ecf78feaf1e authored over 6 years ago
Eliminates PyPDF2 and defusedxml as dependencies.
github.com/ocrmypdf/OCRmyPDF - cf43c06f46504d4d53ab62e7b850cd74b9a39d50 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 74a5a18607e32abd27c2081363d9e9bec3435717 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 44241c6dd531b28885e3a8fcdb24e59923b5d0ab authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 8fff496ffd0c5ff935169a490669ec078b81950a authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - edf75c519cb81f985d6a62e7439cdca6289cb43a authored over 6 years ago
Leave PDF/A check alone for now, since pikepdf has no equivalent.
github.com/ocrmypdf/OCRmyPDF - 9608b22d347019faf52ce6fe8cce961fab3d1f4d authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 8ba4968c4825ff5d4d4d138841759979f78b89db authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - ffdd78f1a56c19a76ea04cd5aba71f6351402043 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - ad9f8ca78e0ccafefe0c426b8998111e9fafd5cf authored over 6 years ago
qpdf opens files with null user password, so do the same.
github.com/ocrmypdf/OCRmyPDF - 78a686ecb446855eccaa60feb4d156ed708763ee authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 59e786eb3c4170788466ec92b8c6681c908eec3c authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 6d0461435f9908025bb87749a89aa2988625966a authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 0a04a60f69f441d303ea14283ef32e1435049b86 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 68d864298804b35f6edf0747d65e6e8dd751906a authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 16f70ff054854815422d7967e43866d151875dda authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - c00aeafff0b0fef423a1d7ae10cb5bf2adbcced8 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 83f35e00f3edaf06725e7af86ce859501053e391 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 786a2ad65a7069f6416d99c38a42870b3b9cb826 authored over 6 years ago
Removes all use of PyMuPDF in optimize
github.com/ocrmypdf/OCRmyPDF - 9425506c2aac1693962830602e17fd7cc9f3c638 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 93b858afd1df04ad1aae6de3daf157eba95e7315 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 7b0a3ec3653defd06305f68cfe317a171e99cee4 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 083d442529b40e149a0f38541ea171d8870c4829 authored over 6 years ago
Eliminates another usage of PyMuPDF in the main path.
github.com/ocrmypdf/OCRmyPDF - b52eb95cf8dd210dc8260634eee978e7e4713d48 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - f4571e25083977e9e8d114dab4a41b7b700d1321 authored over 6 years ago
(Recalculating would fail if the image is not centered.)
github.com/ocrmypdf/OCRmyPDF - b06ef03aac29588e2526f487e260425289a03762 authored over 6 years agorotation % 90 == 0 is always true.
github.com/ocrmypdf/OCRmyPDF - 1d1962a106b34c76ae4e2d921a87fdb0baad7f14 authored over 6 years agogithub.com/ocrmypdf/OCRmyPDF - 4b98e9ff08b1123139571be90a1956014768d529 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - f83ca5d8ac9c57f287cc8bd05819359b5165c02f authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 95cb4d22d7eb1877860f510f3ce823e31b2c7fc4 authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 0c279b01a4bad739234596a1cce7c35d1ef63ccc authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 3b820ffa7b26d3895634fbfc8d4704622d0b92a5 authored over 6 years ago
Prior to unsplit, if we were rebuilding the PDF we'd lose the
table of contents. With unsplit we...
github.com/ocrmypdf/OCRmyPDF - cdb737259c09b260f222bc2698fe2400b55dafdf authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 0843b5939ce2bb01353613c3f0cb86e9452e03cd authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 2b5f23a2d1f5303bb2bc8c1fe2a80cde88bca27d authored over 6 years ago
github.com/ocrmypdf/OCRmyPDF - 5e20d1d5540629bc769573db8cdcbb12a4e4ea49 authored over 6 years ago