Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Collective -
Host: opensource -
https://opencollective.com/ocrmypdf
- Code: https://github.com/jbarlow83/OCRmyPDF
Doc: new infix for temp files; snap temp files folder
github.com/ocrmypdf/OCRmyPDF - mayeulk opened this pull request 3 days ago
github.com/ocrmypdf/OCRmyPDF - mayeulk opened this pull request 3 days ago
[Bug]: Refuses to process old book with existing OCR
github.com/ocrmypdf/OCRmyPDF - themaster567 opened this issue 7 days ago
github.com/ocrmypdf/OCRmyPDF - themaster567 opened this issue 7 days ago
[Bug]: File generated by OCRmyPDF doesn't open in all PDF editors
github.com/ocrmypdf/OCRmyPDF - sklart opened this issue 17 days ago
github.com/ocrmypdf/OCRmyPDF - sklart opened this issue 17 days ago
[Bug]: Highlights/annotations repeated on all pages
github.com/ocrmypdf/OCRmyPDF - Jmuccigr opened this issue 24 days ago
github.com/ocrmypdf/OCRmyPDF - Jmuccigr opened this issue 24 days ago
[Bug]: pikepdf cropbox/mediabox/trimbox as list can return strings in the list
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 25 days ago
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 25 days ago
[Bug]: Cannot create a file when that file already exists
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 30 days ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 30 days ago
[Bug]: Tesseract fails on Alpine 3.20.3
github.com/ocrmypdf/OCRmyPDF - pschichtel opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - pschichtel opened this issue about 1 month ago
[Feature]: Align pages to text baseline
github.com/ocrmypdf/OCRmyPDF - swxxii opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - swxxii opened this issue about 1 month ago
How to remove the image-with-text from the PDF
github.com/ocrmypdf/OCRmyPDF - SurinameClubcard opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - SurinameClubcard opened this issue about 1 month ago
Bump sigstore/gh-action-sigstore-python from 2.1.1 to 3.0.0
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request about 2 months ago
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request about 2 months ago
当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格
github.com/ocrmypdf/OCRmyPDF - deict opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - deict opened this issue about 2 months ago
[3rdparty]: 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格
github.com/ocrmypdf/OCRmyPDF - deict opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - deict opened this issue about 2 months ago
[Feature]: Add a flag to enable ocrmypdf to write "last-modified attribute" to the OCR'ed file.
github.com/ocrmypdf/OCRmyPDF - ashrockd opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - ashrockd opened this issue about 2 months ago
[Feature]: decrypt file if qpdf is installed (EncryptedPdfError: Input PDF is encrypted. The encryption must be removed to perform OCR.)
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue about 2 months ago
[Bug]: "AttributeError: module 'numpy.typing' has no attribute 'NDArray'" after Homebrew installation
github.com/ocrmypdf/OCRmyPDF - tillboehringer opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - tillboehringer opened this issue about 2 months ago
Recommended way of running ocrmypdf with memory limits
github.com/ocrmypdf/OCRmyPDF - andersfylling opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - andersfylling opened this issue about 2 months ago
Add mdate preservation
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this pull request about 2 months ago
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this pull request about 2 months ago
Fix broken test_rotate_page_level
github.com/ocrmypdf/OCRmyPDF - QuLogic opened this pull request about 2 months ago
github.com/ocrmypdf/OCRmyPDF - QuLogic opened this pull request about 2 months ago
[Bug]: Scan time regression in 16.4.3 with `--redo-ocr`
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 2 months ago
[Bug/Feature]: a way to disable Ghostscript requirement & broken plugin_manager option
github.com/ocrmypdf/OCRmyPDF - nikitar opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - nikitar opened this issue 2 months ago
[Bug]: Scan time increases quadratically with page count
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 2 months ago
[Bug]: NotImplementedError in colorspace
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 2 months ago
[Bug]: ocrmypdf: error: unrecognized arguments: input.pdf output.pdf
github.com/ocrmypdf/OCRmyPDF - KNDaniel opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - KNDaniel opened this issue 2 months ago
[Feature]: Result Improvement with OpenCV + Pillow Preprocessing
github.com/ocrmypdf/OCRmyPDF - vishaldwdi opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - vishaldwdi opened this issue 2 months ago
[Bug]: Output file is okay but is not PDF/A
github.com/ocrmypdf/OCRmyPDF - tcurdt opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - tcurdt opened this issue 2 months ago
[Query]: docker watched folder environment variables, optimize how?
github.com/ocrmypdf/OCRmyPDF - jaxjexjox opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - jaxjexjox opened this issue 2 months ago
[Bug]: Large file size increases due to PDF/A font substitution
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 3 months ago
[Bug]: maximum recursion depth exceeded
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 3 months ago
[Bug]: The generated PDF is INVALID
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
[Bug]: Output PDF is too large
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
[Bug]: The width is not correct for detected words
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 3 months ago
[Bug]: cannot add non-opaque RGBA color to RGB palette
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 3 months ago
[Bug]: subprocess.CalledProcessError: Command '['D:\\latex\\texlive\\2020\\bin\\win32\\jbig2.EXE', '--version']' returned non-zero exit status 3.
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 3 months ago
[Bug]: Ghostscript rasterizing failed
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
[Bug]: pdfminer.pdfexceptions.PDFTypeError: invalid length: 6
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
ocrmypdf produces wrong page size
github.com/ocrmypdf/OCRmyPDF - femifrak opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - femifrak opened this issue 3 months ago
[Bug]: with the latest version of Ghostscript 10.03.1, ocrmypdf is passing file names to Ghostscript in the wrong order
github.com/ocrmypdf/OCRmyPDF - alan-sandollar opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - alan-sandollar opened this issue 3 months ago
[Bug]: FileNotFoundError: [Errno 2] No such file or directory: 'gs'
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 3 months ago
Update installation.rst "python -m venv .venv"
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this pull request 3 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this pull request 3 months ago
Add '--needed' flag to arch base-devel install command
github.com/ocrmypdf/OCRmyPDF - mersenne-twister opened this pull request 3 months ago
github.com/ocrmypdf/OCRmyPDF - mersenne-twister opened this pull request 3 months ago
--sidecar writes text content and messages to file
github.com/ocrmypdf/OCRmyPDF - gerritgriebel opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - gerritgriebel opened this issue 3 months ago
[Bug]: files signed with a-trust are not recognised as digitally signed and hence processed
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 3 months ago
[Bug]: Ghostscript rasterizing failed
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 3 months ago
[Bug]: Ghostscript can't create a PDF/A-file (Page object was reserved for an Annotation destination)
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 4 months ago
[Bug]: problem with tif "DPI is not credible". Estimate dpi
github.com/ocrmypdf/OCRmyPDF - drnicolas opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - drnicolas opened this issue 4 months ago
[Bug]: OSError: [Errno 28] No space left on device
github.com/ocrmypdf/OCRmyPDF - Salvodif opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - Salvodif opened this issue 4 months ago
Output file images are corrupted
github.com/ocrmypdf/OCRmyPDF - robmclear opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - robmclear opened this issue 4 months ago
[Bug]: doesn't always parse Latin with diacritics
github.com/ocrmypdf/OCRmyPDF - arsinclair opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - arsinclair opened this issue 4 months ago
[Feature]: Enable execution on GPU
github.com/ocrmypdf/OCRmyPDF - danielfcastro opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - danielfcastro opened this issue 4 months ago
[Request]: Please make rich logging library an optional dependency
github.com/ocrmypdf/OCRmyPDF - lucasgadams opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - lucasgadams opened this issue 4 months ago
[Bug]: Existing text is completely replaced with other characters
github.com/ocrmypdf/OCRmyPDF - david-sledge opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - david-sledge opened this issue 4 months ago
[Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 4 months ago
[Bug]: `lots of diacritics - possibly poor OCR` but using standalone tesseract works perfectly
github.com/ocrmypdf/OCRmyPDF - KAGEYAM4 opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - KAGEYAM4 opened this issue 4 months ago
[Bug]: No errors and no output for large DPI files
github.com/ocrmypdf/OCRmyPDF - dan-ryan opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - dan-ryan opened this issue 4 months ago
[Bug]: MetadataProgress does not respect progress_bar=False argument
github.com/ocrmypdf/OCRmyPDF - DavidMChan opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - DavidMChan opened this issue 4 months ago
[Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 4 months ago
[Feature]: Alternative AI OCR "surya" as opposed to EasyOCR, Just found it today and it dominated the accuracy and speed of Tesseract & EasyOCR
github.com/ocrmypdf/OCRmyPDF - abclution opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - abclution opened this issue 4 months ago
[Bug]: ocrmypdf 16.3.1 fails on a file on Arch that 13.4.0 on Ubuntu handles well
github.com/ocrmypdf/OCRmyPDF - Fifis opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - Fifis opened this issue 4 months ago
[Bug]: crashes with tesseract 5.4.0
github.com/ocrmypdf/OCRmyPDF - mplx opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - mplx opened this issue 4 months ago
Incorrect behavior of text color setting in hocrtransform
github.com/ocrmypdf/OCRmyPDF - ep0p opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - ep0p opened this issue 4 months ago
[Bug]: --tesseract-pagesegmode is not sufficiently documented
github.com/ocrmypdf/OCRmyPDF - thomas2net opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - thomas2net opened this issue 4 months ago
Error occurred while consuming document out1.pdf: SubprocessOutputError: Ghostscript rasterizing failed.
github.com/ocrmypdf/OCRmyPDF - dekoenpi opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - dekoenpi opened this issue 5 months ago
[Bug]: OCR not complete. Parts of all pages are ignored
github.com/ocrmypdf/OCRmyPDF - 0lm opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - 0lm opened this issue 5 months ago
[Bug]: multiple spaces not supported for delimitation of bbox parameters
github.com/ocrmypdf/OCRmyPDF - Tehgg opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - Tehgg opened this issue 5 months ago
[Bug]: Flood of "Recursion depth exceeded in _find_image_xrefs_page"
github.com/ocrmypdf/OCRmyPDF - user1584 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - user1584 opened this issue 5 months ago
Pushed docker image is always Ubuntu instead of alpine
github.com/ocrmypdf/OCRmyPDF - vihtap opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - vihtap opened this issue 5 months ago
[Bug]: test_semfree fails with ghostscript 10.03.0+
github.com/ocrmypdf/OCRmyPDF - gringus opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - gringus opened this issue 5 months ago
[Bug]: NotImplementedError: not sure how to get colorspace
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 5 months ago
[Feature]: If page has text, force OCR and rasterize page
github.com/ocrmypdf/OCRmyPDF - mikejokic opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - mikejokic opened this issue 5 months ago
Show progress during postprocessing
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
[Bug]: Crash on multiple .pdf files
github.com/ocrmypdf/OCRmyPDF - olafure opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - olafure opened this issue 5 months ago
Indian Numbers on Arabic text
github.com/ocrmypdf/OCRmyPDF - MedoHamdani opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - MedoHamdani opened this issue 5 months ago
Make usage of --rotate-pages-threshold clearer
github.com/ocrmypdf/OCRmyPDF - stegl83 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - stegl83 opened this issue 5 months ago
[Bug]: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice'
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
[Bug]: No longer works - macos-11.7 x86_64 Python 3.10
github.com/ocrmypdf/OCRmyPDF - atanasj opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - atanasj opened this issue 5 months ago
[Bug]: ValueError: ObjectList must have 6 elements
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 6 months ago
Fix wrong env var for GS path in Snap
github.com/ocrmypdf/OCRmyPDF - helkaluin opened this pull request 6 months ago
github.com/ocrmypdf/OCRmyPDF - helkaluin opened this pull request 6 months ago
[Feature]: Change demo format to VHS
github.com/ocrmypdf/OCRmyPDF - jbarlow83 opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - jbarlow83 opened this issue 6 months ago
[Bug]: real text replaced by � � (visually unchanged, only by copying)
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 6 months ago
Adding language install docs for archlinux
github.com/ocrmypdf/OCRmyPDF - ahmedsbytes opened this pull request 6 months ago
github.com/ocrmypdf/OCRmyPDF - ahmedsbytes opened this pull request 6 months ago
Release notes don't include the latest versions
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 6 months ago
[Bug]: watcher.py requires the "ARCHIVE" folder to be assigned, even if the option is disabled
github.com/ocrmypdf/OCRmyPDF - clodobox opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - clodobox opened this issue 6 months ago
[Bug]: Warning: "xref 473: While extracting this image, an error occurred"
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 6 months ago
[Bug]: DecompressionBombWarning
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 6 months ago
Update the typer[all] dependency to typer-slim[standard]
github.com/ocrmypdf/OCRmyPDF - musicinmybrain opened this pull request 7 months ago
github.com/ocrmypdf/OCRmyPDF - musicinmybrain opened this pull request 7 months ago
added Macports install information
github.com/ocrmypdf/OCRmyPDF - akierig opened this pull request 7 months ago
github.com/ocrmypdf/OCRmyPDF - akierig opened this pull request 7 months ago
[Feature]: Could watcher.py be enhanced to support the conversion of single or multi TIF and JPG files to PDF?
github.com/ocrmypdf/OCRmyPDF - EvilQoo opened this issue 7 months ago
github.com/ocrmypdf/OCRmyPDF - EvilQoo opened this issue 7 months ago
max_workers must be greater than 0
github.com/ocrmypdf/OCRmyPDF - nope999 opened this issue 7 months ago
github.com/ocrmypdf/OCRmyPDF - nope999 opened this issue 7 months ago
[Feature]: Choose between NFKC and NFC normalization for Unicode characters so copy-pasting works
github.com/ocrmypdf/OCRmyPDF - sfllaw opened this issue 7 months ago
github.com/ocrmypdf/OCRmyPDF - sfllaw opened this issue 7 months ago