Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Collective -
Host: opensource -
https://opencollective.com/ocrmypdf
- Code: https://github.com/jbarlow83/OCRmyPDF
Feature Request: Provide for downloading of language models
github.com/ocrmypdf/OCRmyPDF - simsong opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - simsong opened this issue over 1 year ago
Feature Request: Provide for usage with cloud-based OCR engines
github.com/ocrmypdf/OCRmyPDF - simsong opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - simsong opened this issue over 1 year ago
How to handle already ocred files efficiently?
github.com/ocrmypdf/OCRmyPDF - drnicolas opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - drnicolas opened this issue over 1 year ago
[HELP] Inconsistent Reading order
github.com/ocrmypdf/OCRmyPDF - emtee14 opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - emtee14 opened this issue over 1 year ago
Snap package shouldn't ship all of the Tesseract OCR language files
github.com/ocrmypdf/OCRmyPDF - brlin-tw opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - brlin-tw opened this issue over 1 year ago
Fix snap package building (#1082)
github.com/ocrmypdf/OCRmyPDF - brlin-tw opened this pull request over 1 year ago
github.com/ocrmypdf/OCRmyPDF - brlin-tw opened this pull request over 1 year ago
[BUG] #addopts = pytest -n "auto" no option?
github.com/ocrmypdf/OCRmyPDF - shaynababe opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - shaynababe opened this issue over 1 year ago
Only generate text files without generating PDF files
github.com/ocrmypdf/OCRmyPDF - rodrigomorales1 opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - rodrigomorales1 opened this issue over 1 year ago
Use Github Releases for notifications
github.com/ocrmypdf/OCRmyPDF - fabiante opened this issue over 1 year ago
github.com/ocrmypdf/OCRmyPDF - fabiante opened this issue over 1 year ago
ocrmypdf generating white patch in output pdf?
github.com/ocrmypdf/OCRmyPDF - gogineniravikumar opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - gogineniravikumar opened this issue almost 2 years ago
Improve PDF rasterisation safety
github.com/ocrmypdf/OCRmyPDF - sihil opened this pull request almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - sihil opened this pull request almost 2 years ago
[BUG] Snap Package not Working
github.com/ocrmypdf/OCRmyPDF - lhhel9l3 opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - lhhel9l3 opened this issue almost 2 years ago
Correct way to deskew PDF already processed by OCRmyPDF?
github.com/ocrmypdf/OCRmyPDF - pimlottc opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - pimlottc opened this issue almost 2 years ago
PDFs not created with fast web view
github.com/ocrmypdf/OCRmyPDF - dklinger opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - dklinger opened this issue almost 2 years ago
[BUG] Pathological output: PDF expands to 50x size after half an hour of processing
github.com/ocrmypdf/OCRmyPDF - gwern opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - gwern opened this issue almost 2 years ago
[BUG] pikepdf warning about missing decoders
github.com/ocrmypdf/OCRmyPDF - ajweber opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - ajweber opened this issue almost 2 years ago
JBIG2 not legally secure in many countries
github.com/ocrmypdf/OCRmyPDF - dklinger opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - dklinger opened this issue almost 2 years ago
[BUG] PIL.Image.DecompressionBombError
github.com/ocrmypdf/OCRmyPDF - JohnLockeG opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - JohnLockeG opened this issue almost 2 years ago
[BUG] crashes with `TypeError: 'NoneType' object is not subscriptable`
github.com/ocrmypdf/OCRmyPDF - frrad opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - frrad opened this issue almost 2 years ago
[BUG] cannot ocr the numbers on left side of page
github.com/ocrmypdf/OCRmyPDF - sushmitxo opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - sushmitxo opened this issue almost 2 years ago
Optimize images with SMask
github.com/ocrmypdf/OCRmyPDF - benbro opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - benbro opened this issue almost 2 years ago
Use paddleocr instead of tesseract
github.com/ocrmypdf/OCRmyPDF - aymenmtibaa opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - aymenmtibaa opened this issue almost 2 years ago
Feature Request: GPU OCR pipeline e.g. via EasyOCR
github.com/ocrmypdf/OCRmyPDF - systemofapwne opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - systemofapwne opened this issue almost 2 years ago
[BUG] Wrong optimize ratio and savings
github.com/ocrmypdf/OCRmyPDF - homocomputeris opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - homocomputeris opened this issue almost 2 years ago
[BUG] Possible to force OCR without losing vector data?
github.com/ocrmypdf/OCRmyPDF - moksamedia opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - moksamedia opened this issue almost 2 years ago
Avoid deleting /dev/null when run as root
github.com/ocrmypdf/OCRmyPDF - jbarlow83 opened this pull request almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - jbarlow83 opened this pull request almost 2 years ago
[BUG] /dev/null gets deleted when run as root (inside a Docker container)
github.com/ocrmypdf/OCRmyPDF - andymwood opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - andymwood opened this issue almost 2 years ago
handle case when candidate is None
github.com/ocrmypdf/OCRmyPDF - frrad opened this pull request almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - frrad opened this pull request almost 2 years ago
[QUESTION] Render hocr with python
github.com/ocrmypdf/OCRmyPDF - jcuenod opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - jcuenod opened this issue almost 2 years ago
tesseract-osd is also required on fedora
github.com/ocrmypdf/OCRmyPDF - white-gecko opened this pull request almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - white-gecko opened this pull request almost 2 years ago
added setting RETRIES_LOADING_FILE to watcher.py
github.com/ocrmypdf/OCRmyPDF - comzine opened this pull request almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - comzine opened this pull request almost 2 years ago
[BUG] tesseract returns SIGFPE Signal
github.com/ocrmypdf/OCRmyPDF - C0D3D3V opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - C0D3D3V opened this issue almost 2 years ago
Error processing shell script on file
github.com/ocrmypdf/OCRmyPDF - danilichti opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - danilichti opened this issue almost 2 years ago
Allow title, subject, author, and keywords to be unset with an empty string argument
github.com/ocrmypdf/OCRmyPDF - f-hansen opened this pull request almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - f-hansen opened this pull request almost 2 years ago
substitute broken link (#1057)
github.com/ocrmypdf/OCRmyPDF - LucasLarson opened this pull request almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - LucasLarson opened this pull request almost 2 years ago
[BUG] docs: links to brewformulas.org no longer work
github.com/ocrmypdf/OCRmyPDF - LucasLarson opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - LucasLarson opened this issue almost 2 years ago
output JSON format
github.com/ocrmypdf/OCRmyPDF - emresaracoglu opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - emresaracoglu opened this issue almost 2 years ago
Is it possible to add paddleocr as an option for ocr?
github.com/ocrmypdf/OCRmyPDF - nissansz opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - nissansz opened this issue almost 2 years ago
[BUG] ValueError: invalid arguments: (pikepdf._qpdf._ObjectList([]),)
github.com/ocrmypdf/OCRmyPDF - dli7319 opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - dli7319 opened this issue almost 2 years ago
[BUG] crash when trying to process a pdf
github.com/ocrmypdf/OCRmyPDF - frrad opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - frrad opened this issue almost 2 years ago
Feature request: Ask user what likely-incorrect words are
github.com/ocrmypdf/OCRmyPDF - mattention opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - mattention opened this issue almost 2 years ago
Is it possible to capture Tesseract messages and suggestions either as exceptions or exit codes?
github.com/ocrmypdf/OCRmyPDF - sergeyyurkov1 opened this issue almost 2 years ago
github.com/ocrmypdf/OCRmyPDF - sergeyyurkov1 opened this issue almost 2 years ago
[BUG] `--deskew` not compatible with blank pages or with tesseract_timeout = 0
github.com/ocrmypdf/OCRmyPDF - deexpabada opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - deexpabada opened this issue about 2 years ago
Fixed the source installation instructions
github.com/ocrmypdf/OCRmyPDF - yasoob opened this pull request about 2 years ago
github.com/ocrmypdf/OCRmyPDF - yasoob opened this pull request about 2 years ago
Fix tesseract documentation url
github.com/ocrmypdf/OCRmyPDF - CGarces opened this pull request about 2 years ago
github.com/ocrmypdf/OCRmyPDF - CGarces opened this pull request about 2 years ago
Memory leak ocrmypdf.ocr vs subprocess.run
github.com/ocrmypdf/OCRmyPDF - CGarces opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - CGarces opened this issue about 2 years ago
log completion message
github.com/ocrmypdf/OCRmyPDF - drinckes opened this pull request about 2 years ago
github.com/ocrmypdf/OCRmyPDF - drinckes opened this pull request about 2 years ago
Way to test PDF to see if there is any text?
github.com/ocrmypdf/OCRmyPDF - spedinfargo opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - spedinfargo opened this issue about 2 years ago
OCR for Comic Book PDFs -- Possible Solution
github.com/ocrmypdf/OCRmyPDF - yosamsimiti opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - yosamsimiti opened this issue about 2 years ago
Ignore Digital Signed Documents
github.com/ocrmypdf/OCRmyPDF - flaviobrunopereira opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - flaviobrunopereira opened this issue about 2 years ago
Fixed interchanged words
github.com/ocrmypdf/OCRmyPDF - yasoob opened this pull request about 2 years ago
github.com/ocrmypdf/OCRmyPDF - yasoob opened this pull request about 2 years ago
Draw/Blanking on wrong spot
github.com/ocrmypdf/OCRmyPDF - emre1e opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - emre1e opened this issue about 2 years ago
read_params_file: Can't open pdf/txt -- new issue -- help!
github.com/ocrmypdf/OCRmyPDF - yosamsimiti opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - yosamsimiti opened this issue about 2 years ago
--redo-ocr does all the work but doesn't save the new OCR text layer in the output pdf, leaving the old OCR text
github.com/ocrmypdf/OCRmyPDF - Shoresh613 opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - Shoresh613 opened this issue about 2 years ago
Garbled order of OCR'ed contents
github.com/ocrmypdf/OCRmyPDF - rkevk opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - rkevk opened this issue about 2 years ago
ocrmypdf cannot convert pages with watermarks.
github.com/ocrmypdf/OCRmyPDF - marlarius opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - marlarius opened this issue about 2 years ago
Pyinstaller OCRmyPDF pikepdf packagenotfound error and failed to determine version
github.com/ocrmypdf/OCRmyPDF - DerDoktorFaust opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - DerDoktorFaust opened this issue about 2 years ago
Remove blank page without recognizable characters of the ocr
github.com/ocrmypdf/OCRmyPDF - gitmors opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - gitmors opened this issue about 2 years ago
Question: multiple import folders possible?
github.com/ocrmypdf/OCRmyPDF - Maximus48p opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - Maximus48p opened this issue about 2 years ago
Ghostscript error + OCRmyPDF puts a space after every letter in the output.pdf file
github.com/ocrmypdf/OCRmyPDF - moritz1000 opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - moritz1000 opened this issue about 2 years ago
Issue packaging with pyinstaller
github.com/ocrmypdf/OCRmyPDF - kiyros opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - kiyros opened this issue about 2 years ago
Debian maintainer requested for OCRmyPDF and pikepdf
github.com/ocrmypdf/OCRmyPDF - jbarlow83 opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - jbarlow83 opened this issue about 2 years ago
present OCRmyPDF at normconf
github.com/ocrmypdf/OCRmyPDF - mu22le opened this issue about 2 years ago
github.com/ocrmypdf/OCRmyPDF - mu22le opened this issue about 2 years ago
Inverted black and white from optimization
github.com/ocrmypdf/OCRmyPDF - Jmuccigr opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - Jmuccigr opened this issue over 2 years ago
How to use a timeout for gs?
github.com/ocrmypdf/OCRmyPDF - svenha opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - svenha opened this issue over 2 years ago
OCR picks up all the text, but alignment is off
github.com/ocrmypdf/OCRmyPDF - nchammas opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - nchammas opened this issue over 2 years ago
OCRmyPDF assumes really large DPI for native PDF when rasterizing as image
github.com/ocrmypdf/OCRmyPDF - fabiante opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - fabiante opened this issue over 2 years ago
How to keep source file time, date, metadata.... etc for Target File?
github.com/ocrmypdf/OCRmyPDF - limopc opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - limopc opened this issue over 2 years ago
optimize.py doesn't process images with subtype Form
github.com/ocrmypdf/OCRmyPDF - imz opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - imz opened this issue over 2 years ago
ocrmypdf --tesseract-timeout=0 --deskew blocks deskewing - was: Using ocrmypdf to correct the deviation has no effect No errors were reported
github.com/ocrmypdf/OCRmyPDF - wss1801 opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - wss1801 opened this issue over 2 years ago
"--force-ocr" switch increases size of pdf by factor 25
github.com/ocrmypdf/OCRmyPDF - wildgruber opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - wildgruber opened this issue over 2 years ago
Double to quadruple file size and worse quality with --deskew --clean-final (due to mask?)
github.com/ocrmypdf/OCRmyPDF - bllngr opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - bllngr opened this issue over 2 years ago
"remove-background not implemented"
github.com/ocrmypdf/OCRmyPDF - bouboulov opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - bouboulov opened this issue over 2 years ago
Creating txt file without an output pdf. Examples missing for correct syntax.
github.com/ocrmypdf/OCRmyPDF - gevezex opened this issue over 2 years ago
github.com/ocrmypdf/OCRmyPDF - gevezex opened this issue over 2 years ago
`--redo-ocr` adds extra text to the PDF
github.com/ocrmypdf/OCRmyPDF - DUOLabs333 opened this issue almost 3 years ago
github.com/ocrmypdf/OCRmyPDF - DUOLabs333 opened this issue almost 3 years ago
support monochromatic conversion
github.com/ocrmypdf/OCRmyPDF - jknockaert opened this issue almost 3 years ago
github.com/ocrmypdf/OCRmyPDF - jknockaert opened this issue almost 3 years ago
--redo-ocr doesn't remove previous ocr-text layer made by ocrmypdf
github.com/ocrmypdf/OCRmyPDF - Mark-Joy opened this issue almost 3 years ago
github.com/ocrmypdf/OCRmyPDF - Mark-Joy opened this issue almost 3 years ago
--remove-background options currently not implemented?
github.com/ocrmypdf/OCRmyPDF - Perangelot opened this issue almost 3 years ago
github.com/ocrmypdf/OCRmyPDF - Perangelot opened this issue almost 3 years ago
cannot run under python 3.10
github.com/ocrmypdf/OCRmyPDF - starsareintherose opened this issue about 3 years ago
github.com/ocrmypdf/OCRmyPDF - starsareintherose opened this issue about 3 years ago
Blank pages cause the process to crash due to tesseract
github.com/ocrmypdf/OCRmyPDF - philayres opened this issue about 3 years ago
github.com/ocrmypdf/OCRmyPDF - philayres opened this issue about 3 years ago
ValueError: integer out of range converting 10585497845 from a 8-byte signed type to a 4-byte signed type
github.com/ocrmypdf/OCRmyPDF - gumbolastima opened this issue over 3 years ago
github.com/ocrmypdf/OCRmyPDF - gumbolastima opened this issue over 3 years ago
ocrmypdf --redo-ocr fails with DecompressionBombError on small PDF
github.com/ocrmypdf/OCRmyPDF - nicolasguinot opened this issue over 3 years ago
github.com/ocrmypdf/OCRmyPDF - nicolasguinot opened this issue over 3 years ago
Hebrew text seems to be reversed(whole line) on OCR-ed pdf
github.com/ocrmypdf/OCRmyPDF - Kors1981 opened this issue over 3 years ago
github.com/ocrmypdf/OCRmyPDF - Kors1981 opened this issue over 3 years ago
Correcting recognition errors - possible with sidecar option?
github.com/ocrmypdf/OCRmyPDF - jdescelliers opened this issue over 3 years ago
github.com/ocrmypdf/OCRmyPDF - jdescelliers opened this issue over 3 years ago
Some input metadata could not be copied because it is not permitted in PDF/A. You may wish to examine the output PDF's XMP metadata
github.com/ocrmypdf/OCRmyPDF - alicanidas opened this issue over 3 years ago
github.com/ocrmypdf/OCRmyPDF - alicanidas opened this issue over 3 years ago
[ENHANCEMENT] Google Colab notebook
github.com/ocrmypdf/OCRmyPDF - louispaulet opened this issue over 3 years ago
github.com/ocrmypdf/OCRmyPDF - louispaulet opened this issue over 3 years ago
Use Multi-threading instead of multi-processing on platforms not supporting it
github.com/ocrmypdf/OCRmyPDF - MrAdityaAlok opened this issue over 3 years ago
github.com/ocrmypdf/OCRmyPDF - MrAdityaAlok opened this issue over 3 years ago
Jbig2 dependency on windows
github.com/ocrmypdf/OCRmyPDF - mortang2410 opened this issue almost 4 years ago
github.com/ocrmypdf/OCRmyPDF - mortang2410 opened this issue almost 4 years ago
--force-ocr converts JBIG2 images to 24-bit
github.com/ocrmypdf/OCRmyPDF - alawvt opened this issue almost 4 years ago
github.com/ocrmypdf/OCRmyPDF - alawvt opened this issue almost 4 years ago
extra space in the result pdf when the input pdf is in Chinese
github.com/ocrmypdf/OCRmyPDF - Eyxxxxx opened this issue almost 4 years ago
github.com/ocrmypdf/OCRmyPDF - Eyxxxxx opened this issue almost 4 years ago
Improving Windows with PyInstaller - Ocrmypdf Distribution Not Found
github.com/ocrmypdf/OCRmyPDF - gabemorris12 opened this issue about 4 years ago
github.com/ocrmypdf/OCRmyPDF - gabemorris12 opened this issue about 4 years ago
liblept-5.dll load fails on Windows 10 (OSError 0x7F)
github.com/ocrmypdf/OCRmyPDF - Suyash458 opened this issue over 4 years ago
github.com/ocrmypdf/OCRmyPDF - Suyash458 opened this issue over 4 years ago
Can you tell me what docker command I should run in order to make the docker image work?
github.com/ocrmypdf/OCRmyPDF - 5aumy4 opened this issue over 4 years ago
github.com/ocrmypdf/OCRmyPDF - 5aumy4 opened this issue over 4 years ago