Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Collective -
Host: opensource -
https://opencollective.com/ocrmypdf
- Code: https://github.com/jbarlow83/OCRmyPDF
github.com/ocrmypdf/OCRmyPDF - 5de107d44cbd2b42428d1a846ab6f4b096fa7771 authored over 7 years ago by James R. Barlow <[email protected]>
Seems better to not claim the existence of several entities that don’t
exist as the older one does
github.com/ocrmypdf/OCRmyPDF - 65b89687a9715b319d384d2076776e4a06094ba2 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 048ae40e75fa8f26688bf90f81a589f829ca2ceb authored over 7 years ago by James R. Barlow <[email protected]>
“txt hocr” is not acceptable and does not produce expected output .txt
while “hocr text” works f...
github.com/ocrmypdf/OCRmyPDF - fb067dc97bf4b8b5a376e54e0202a8bb0fde39d9 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - a1fea0ce160e77f2bb03193df33d87c969d5c152 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - e1e9135e93a8d17461e6c6135955c3e66aa21980 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - aff982036bed4f0eb7575332a1ac62e9153825be authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - d087649eab0efa1f81a6c5688c9ffd8b83f9a840 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 7f3fa46a40f61f1e78071bc48b108ef4cf13b198 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - b1f79e4d9797a1d2c7ab84f491d02ae4ca2db9f8 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 115d6df94ff48e6dc4f05da5976775b2fc4f3596 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 559af9635fa3e7afbeb9db66e2eae05d09b9d6ff authored over 7 years ago by James R. Barlow <[email protected]>
Travis is too slow without it, and perhaps it’s overly paranoid to
never cache Tess4. Maybe nuke...
github.com/ocrmypdf/OCRmyPDF - 5e26bb29d974a442f0f13adc357fe2005b2014c7 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - b0e95842b89585281348761b9fc9648ea28643e8 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 08e678f21f81497ede59ed49c35c5b550ae8526b authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - c17817810f6db11a7965a2b935e288db5b0a2a95 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - ff5c38b1f78d2c7dead3357692bcd74a4d4ed7ae authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 64314c1b827380326a8b6441880cc38895cee906 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 83230097aebfc13ba856f72a5d55405b1679d6a1 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 8f91acf95693d7eb743e4ff0cd3d6b770a6cc26f authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - d211722a2ff8b7207285eae424986e6f34698466 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 56e6ed1249d300608674d87d32425ca8713cf62e authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 21982cf1cb62df9b549d146c2bad76df607b4140 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - edc01408da8c0ba1832fd3bd00bfdfb207dce210 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - aee33c87eda2b9c3ea5b50563a12281779beea15 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 0dae1602c705da4e125eb619d01d223c9fc9ed0e authored over 7 years ago by James R. Barlow <[email protected]>
Grrr.
github.com/ocrmypdf/OCRmyPDF - d926f07ac1a7df3c4cdb653af9e04d4c29a55bb5 authored over 7 years ago by James R. Barlow <[email protected]>We’re well out of the “trivial updates” zone
github.com/ocrmypdf/OCRmyPDF - 96045e98f493c334f0f64009b803bb2e18bda9ff authored over 7 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - 01b7205e2c0940577b795dbb0405c30af74723a6 authored over 7 years ago by James R. Barlow <[email protected]>
Forgot to update tesseract spoofers to account for change in tesseract
parameters. Also the cha...
github.com/ocrmypdf/OCRmyPDF - 16b6442b23cc7a45a416dc9a8b094e45f49f8a6f authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 183eafa587b1952fbded889918cf75fa9b20ff1c authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 47a2997538780a52bda4d3cae8ef7d228e20df8a authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 37ebcadfa16ffaacd1c1120d1b246e056efbf34d authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 74d98216f150aa58a194b8a62dcb1f9fb5c015f5 authored over 7 years ago by James R. Barlow <[email protected]>
Let’s see if this helps the build go faster
github.com/ocrmypdf/OCRmyPDF - 4bdebf573e0f8cdf68697701fbf00c203f5a09b8 authored over 7 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - 1606b6a383a82c2015477305bdf10d9cb4bd6d47 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 2a61902df59111a5ea4502dfbae701ddd3b178a8 authored over 7 years ago by James R. Barlow <[email protected]>
Fixes #163
github.com/ocrmypdf/OCRmyPDF - 01a1c2b57642ab8e687e3b187251f18ba5302e41 authored over 7 years ago by James R. Barlow <[email protected]>[ci skip]
github.com/ocrmypdf/OCRmyPDF - c4f01de231d22da5cea02c25aa581a965a37640b authored over 7 years ago by Ingo Feinerer <[email protected]>The change introduced regressions, so find another way to fix.
This reverts commit d077c0368698...
github.com/ocrmypdf/OCRmyPDF - 63a4a761dd47e9b8c86074545c6846377f1a74d6 authored over 7 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - d077c03686981c1601305cac2eb7b97e7f823a34 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - c97ea1f2a98f81b266dbb4a78014c1be63b4c9b3 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - fd27df2abb57ea7c7c0f0bdcbc5fb8d28803e1f7 authored over 7 years ago by James R. Barlow <[email protected]>
This file is not currently used in any tests, but could be, so replace
corrupt version with a us...
github.com/ocrmypdf/OCRmyPDF - 93e802f473184deec68a39f91986c5a836da5d59 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 1464b9087ac253ea46081ed3939de2d2f346960c authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - e8cc8fc87989ff230e70f7c6718d87f369677091 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - fae2119b1ef16e1a10bbb083def90f1e13780709 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - aa859a4139271e4f44c46894b6fcb42f5b61ddc1 authored over 7 years ago by James R. Barlow <[email protected]>
Past behavior was to continue and let ruffus puke eventually
github.com/ocrmypdf/OCRmyPDF - b9b12e28798f6fe0614859887cf297f99fa3fc18 authored over 7 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - cf643c9f431cf5d68df8b275c6210072cd9c53b2 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 5b1a7880a94cd85588fae55d3aa347a4b220680e authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 474b6b050031f31548a0be3623098af8b983fdfc authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 6c8c1d8173047a3ae9241d4c8b30ce39c442383c authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 6a91fa637f7bedd48a0b368a0f91c0e9356c07e5 authored over 7 years ago by James R. Barlow <[email protected]>
The 4th argument of re.sub() is maximum number of substitutions,
not flags.
Moreover, re.MUL...
github.com/ocrmypdf/OCRmyPDF - 2846fb4e310c331c502367f13011396335c803f6 authored over 7 years ago by Jakub Wilk <[email protected]>github.com/ocrmypdf/OCRmyPDF - a1033cdc64f48a907cc136e8ec44bc267eefde7e authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 204336e1a5fb6f49447070e11cc57ceeb2453b6a authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 8954e6c3b9e609f78021943f84235c2c3b39fcea authored over 7 years ago by James R. Barlow <[email protected]>
all(<empty generator>) is True.
github.com/ocrmypdf/OCRmyPDF - fee22b6b0b8efc3451b9cc4e50e4d70b1fed35b1 authored over 7 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - 2b82c31b85f422dba502fe54198306514065106d authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 9a4813089c3c80b75a019d0a68f848c5e9fd9f23 authored over 7 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 554fcc8b9dd228ebc791f2d2749fb4a17770c030 authored over 7 years ago by James R. Barlow <[email protected]>
* fixed skip-big when there are no images in pdf
* added only_text pdf
* updated only_text...
github.com/ocrmypdf/OCRmyPDF - 345256ee99c5ac6b4de552437c4960782cf96d47 authored over 7 years ago by Tom <[email protected]>github.com/ocrmypdf/OCRmyPDF - 58d1042147ec7a430caa530d36e841e0e5966612 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 7b7e3a3e03d4052982c69b69caed32fb16315244 authored almost 8 years ago by James R. Barlow <[email protected]>
If tess4 renderer needed to skip OCR on a page it would end up
duplicating the page contents ont...
github.com/ocrmypdf/OCRmyPDF - 6e907856f233cece47dfecb60a3559fda0116462 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 8bc601917252d6f7af431f36ab958c01f882b3da authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 059f79242e174cb5d4b994855708a09ce1b82298 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 89599b4812b71b5985049d2f5e580efc6930dafe authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - a9f4047a978d7fcc8064ca824948c1939fcc5cd4 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 23227ae763e20120b5921b6fed4eb040d7b081cb authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 4a9e9e9db2117688e3371e2cd92339da40228dda authored almost 8 years ago by James R. Barlow <[email protected]>
Ghostscript 9.21 does not seem to accept Unicode above U+FFFF. Previous
versions did, but it now...
Avoid inserting docinfo keys that would be translated to null strings,
to avoid running afoul of...
At recommendation of Artifex people, don’t use the filename pdfa_def.ps
because if given without...
github.com/ocrmypdf/OCRmyPDF - 2954e72652dd7d6240d0246263b57060dbde33f3 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 199de96cff214afe3b972027c49f1f03f32b8c29 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 8ddbe8151311243a6060728a90319a960ebef30b authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - a3e26e049819a43d77ebb4b4b81d0ebfe3e8379e authored almost 8 years ago by James R. Barlow <[email protected]>
Not needed since reportlab 3.4 comes with a wheel, and that was the main difficulty.
github.com/ocrmypdf/OCRmyPDF - 4ad129d8d8a39a6c50022100e8e3b91652c641b2 authored almost 8 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - dfb9fa0736159a7dac5fd32b160f28c2d2a60848 authored almost 8 years ago by James R. Barlow <[email protected]>
[ci skip]
github.com/ocrmypdf/OCRmyPDF - eb036898e9e3710700607b9f5fcf87c70588f87e authored almost 8 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - 7c6aa76a2a64444d48a28d69a49f2763d7e588c7 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - f035cb10886a26394f10c11efe1a4b9a9bcdc123 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 35162166c5de5bf5430975e5ee38b6274efd48c7 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 107f6abcb19e96ec5c10918ee9038e1294285971 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 760a939e7d809f3bc357d65c0f430da81946b464 authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 72660d0deca8cd69fb3ebcc85df2722be98c47ce authored almost 8 years ago by James R. Barlow <[email protected]>
github.com/ocrmypdf/OCRmyPDF - 8444a8f211d57797e528a76d8966800be1cfd0fa authored almost 8 years ago by James R. Barlow <[email protected]>
Squashed commits:
[3f06c1e] Try setting up homebrew tap autobuilding
[01532f1] Strict mode error...
Unfortunately because of this issue
https://github.com/docker/hub-feedback/issues/292
Docker Hu...
github.com/ocrmypdf/OCRmyPDF - 42547f601756393e5538225e28fc1c8b5ea59585 authored almost 8 years ago by James R. Barlow <[email protected]>This reverts commit 3d3b3abc1bccf05eefb656309543312c19e3fb47.
github.com/ocrmypdf/OCRmyPDF - 0ccf564f03395b58304ddca62f197b3ba09c4f91 authored almost 8 years ago by James R. Barlow <[email protected]>
Unfortunately because of this issue
https://github.com/docker/hub-feedback/issues/292
Docker Hu...
github.com/ocrmypdf/OCRmyPDF - 65c9a07ddedb021b317383cfc1d8dc156869c92b authored almost 8 years ago by James R. Barlow <[email protected]>github.com/ocrmypdf/OCRmyPDF - 4700a193225b00da909b3abad672d3923adb6e6c authored almost 8 years ago by James R. Barlow <[email protected]>