Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Collective -
Host: opensource -
https://opencollective.com/ocrmypdf
- Code: https://github.com/jbarlow83/OCRmyPDF
graft: fix invisible text appearing after strip_invisible_text
github.com/ocrmypdf/OCRmyPDF - pajowu opened this pull request 11 days ago
github.com/ocrmypdf/OCRmyPDF - pajowu opened this pull request 11 days ago
[Feature]: Aggressive image optimization without color quantization
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 11 days ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 11 days ago
hocr: only add space if boxwidth is positive
github.com/ocrmypdf/OCRmyPDF - pajowu opened this pull request 11 days ago
github.com/ocrmypdf/OCRmyPDF - pajowu opened this pull request 11 days ago
[Bug]: scanned pdf containig electronics schematic
github.com/ocrmypdf/OCRmyPDF - saadb opened this issue 14 days ago
github.com/ocrmypdf/OCRmyPDF - saadb opened this issue 14 days ago
ocrmypdf -v 2 fails with log messages interpreted as tags
github.com/ocrmypdf/OCRmyPDF - fernandoherreradelasheras opened this issue 14 days ago
github.com/ocrmypdf/OCRmyPDF - fernandoherreradelasheras opened this issue 14 days ago
Update intersphinx mapping to current format
github.com/ocrmypdf/OCRmyPDF - QuLogic opened this pull request 16 days ago
github.com/ocrmypdf/OCRmyPDF - QuLogic opened this pull request 16 days ago
Fix "Scanning contents" progress bar with --redo-ocr
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this pull request 17 days ago
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this pull request 17 days ago
fix minor grammar mistake
github.com/ocrmypdf/OCRmyPDF - joskezelensky opened this pull request 18 days ago
github.com/ocrmypdf/OCRmyPDF - joskezelensky opened this pull request 18 days ago
[Bug]: OCR Output Quality Regression on Ubuntu 24.04
github.com/ocrmypdf/OCRmyPDF - guilhermebferreira opened this issue 20 days ago
github.com/ocrmypdf/OCRmyPDF - guilhermebferreira opened this issue 20 days ago
[Bug]: deskew results in "empty" output file
github.com/ocrmypdf/OCRmyPDF - hatl opened this issue 23 days ago
github.com/ocrmypdf/OCRmyPDF - hatl opened this issue 23 days ago
Documentation for ''ocrmypdf.ocr()" not found
github.com/ocrmypdf/OCRmyPDF - fatsciock opened this issue 27 days ago
github.com/ocrmypdf/OCRmyPDF - fatsciock opened this issue 27 days ago
Bump astral-sh/setup-uv from 3 to 4
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request 27 days ago
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request 27 days ago
[Feature]: Option to remove OCR
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 28 days ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 28 days ago
[Feature]: Feature Request - Use Google Document AI or VIsion AI instead of Tesseract
github.com/ocrmypdf/OCRmyPDF - epatels opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - epatels opened this issue about 1 month ago
Bump codecov/codecov-action from 4 to 5
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request about 1 month ago
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request about 1 month ago
[Bug]: pikepdf PdfMatrix module unavailale
github.com/ocrmypdf/OCRmyPDF - IsaacSugden opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - IsaacSugden opened this issue about 1 month ago
Facing issue while applying ocrmypdf to document which different layouts at each page
github.com/ocrmypdf/OCRmyPDF - prashanthkolaneru opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - prashanthkolaneru opened this issue about 1 month ago
[Feature]: Add drop caps support
github.com/ocrmypdf/OCRmyPDF - 4F2E4A2E opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - 4F2E4A2E opened this issue about 1 month ago
ocrmypdf isn't installing on termux
github.com/ocrmypdf/OCRmyPDF - eelalzep opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - eelalzep opened this issue about 1 month ago
[Bug]: HOCRResult.from_json() not unpickling correctly
github.com/ocrmypdf/OCRmyPDF - hoblins opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - hoblins opened this issue about 1 month ago
[Bug]: Docker container entry point
github.com/ocrmypdf/OCRmyPDF - sneakpodbob opened this issue about 1 month ago
github.com/ocrmypdf/OCRmyPDF - sneakpodbob opened this issue about 1 month ago
[3rdparty]: paperless-ngx
github.com/ocrmypdf/OCRmyPDF - Checole opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - Checole opened this issue about 2 months ago
[Bug]: test_malformed_docinfo fails with spectacular INTERNALERROR
github.com/ocrmypdf/OCRmyPDF - mcepl opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - mcepl opened this issue about 2 months ago
[Feature]: Show page numbers when detecting rotation
github.com/ocrmypdf/OCRmyPDF - tsoernes opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - tsoernes opened this issue about 2 months ago
[Feature]: Show page number in PriorOcrFoundError
github.com/ocrmypdf/OCRmyPDF - tsoernes opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - tsoernes opened this issue about 2 months ago
[Bug]: '_idat' object has no attribute 'fileno' // No space left on device
github.com/ocrmypdf/OCRmyPDF - kkduke opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - kkduke opened this issue about 2 months ago
[Bug]: Example docker-compose.yml not working anymore
github.com/ocrmypdf/OCRmyPDF - ckagerer opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - ckagerer opened this issue about 2 months ago
[Bug]: There was an error in an annotation | Setting Overprint Mode to 1 not permitted in PDF/A-2, overprint mode not set
github.com/ocrmypdf/OCRmyPDF - tsoernes opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - tsoernes opened this issue about 2 months ago
[3rdparty]: paperless-ngx PDF Fails to Process with InputFileError: PDF content stream is corrupt
github.com/ocrmypdf/OCRmyPDF - singlatushar07 opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - singlatushar07 opened this issue about 2 months ago
[Bug]: "remove-background is temporarily not implemented" error on linux
github.com/ocrmypdf/OCRmyPDF - dimyself opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - dimyself opened this issue about 2 months ago
[Bug]: Unable to proceed with a custom language lacking a dictionary
github.com/ocrmypdf/OCRmyPDF - vchgan opened this issue about 2 months ago
github.com/ocrmypdf/OCRmyPDF - vchgan opened this issue about 2 months ago
[Bug]: Unpaper Not Found: "Warning: using insecure memory!"
github.com/ocrmypdf/OCRmyPDF - vfilby opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - vfilby opened this issue 2 months ago
Data privacy when using OCRmyPDF
github.com/ocrmypdf/OCRmyPDF - etroci opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - etroci opened this issue 2 months ago
[Bug]: cannot import name 'PdfMatrix' from 'pikepdf'
github.com/ocrmypdf/OCRmyPDF - kdbreck opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - kdbreck opened this issue 2 months ago
[Feature]: support for Apple vision framework
github.com/ocrmypdf/OCRmyPDF - santiagozky opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - santiagozky opened this issue 2 months ago
Doc: new infix for temp files; snap temp files folder
github.com/ocrmypdf/OCRmyPDF - mayeulk opened this pull request 2 months ago
github.com/ocrmypdf/OCRmyPDF - mayeulk opened this pull request 2 months ago
[Bug]: Refuses to process old book with existing OCR
github.com/ocrmypdf/OCRmyPDF - themaster567 opened this issue 2 months ago
github.com/ocrmypdf/OCRmyPDF - themaster567 opened this issue 2 months ago
[Bug]: File generated by OCRmyPDF doesn't open in all PDF editors
github.com/ocrmypdf/OCRmyPDF - sklart opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - sklart opened this issue 3 months ago
[Bug]: Highlights/annotations repeated on all pages
github.com/ocrmypdf/OCRmyPDF - Jmuccigr opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - Jmuccigr opened this issue 3 months ago
[Bug]: pikepdf cropbox/mediabox/trimbox as list can return strings in the list
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 3 months ago
[Bug]: Cannot create a file when that file already exists
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 3 months ago
[Bug]: Tesseract fails on Alpine 3.20.3
github.com/ocrmypdf/OCRmyPDF - pschichtel opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - pschichtel opened this issue 3 months ago
[Feature]: Align pages to text baseline
github.com/ocrmypdf/OCRmyPDF - swxxii opened this issue 3 months ago
github.com/ocrmypdf/OCRmyPDF - swxxii opened this issue 3 months ago
How to remove the image-with-text from the PDF
github.com/ocrmypdf/OCRmyPDF - SurinameClubcard opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - SurinameClubcard opened this issue 4 months ago
Bump sigstore/gh-action-sigstore-python from 2.1.1 to 3.0.0
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request 4 months ago
github.com/ocrmypdf/OCRmyPDF - dependabot[bot] opened this pull request 4 months ago
当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格
github.com/ocrmypdf/OCRmyPDF - deict opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - deict opened this issue 4 months ago
[3rdparty]: 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格
github.com/ocrmypdf/OCRmyPDF - deict opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - deict opened this issue 4 months ago
[Feature]: Add a flag to enable ocrmypdf to write "last-modified attribute" to the OCR'ed file.
github.com/ocrmypdf/OCRmyPDF - ashrockd opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - ashrockd opened this issue 4 months ago
[Feature]: decrypt file if qpdf is installed (EncryptedPdfError: Input PDF is encrypted. The encryption must be removed to perform OCR.)
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 4 months ago
[Bug]: "AttributeError: module 'numpy.typing' has no attribute 'NDArray'" after Homebrew installation
github.com/ocrmypdf/OCRmyPDF - tillboehringer opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - tillboehringer opened this issue 4 months ago
Recommended way of running ocrmypdf with memory limits
github.com/ocrmypdf/OCRmyPDF - andersfylling opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - andersfylling opened this issue 4 months ago
Fix broken test_rotate_page_level
github.com/ocrmypdf/OCRmyPDF - QuLogic opened this pull request 4 months ago
github.com/ocrmypdf/OCRmyPDF - QuLogic opened this pull request 4 months ago
[Bug]: Scan time regression in 16.4.3 with `--redo-ocr`
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 4 months ago
[Bug/Feature]: a way to disable Ghostscript requirement & broken plugin_manager option
github.com/ocrmypdf/OCRmyPDF - nikitar opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - nikitar opened this issue 4 months ago
[Bug]: Scan time increases quadratically with page count
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - aliemjay opened this issue 4 months ago
[Bug]: NotImplementedError in colorspace
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - macdeport opened this issue 4 months ago
[Bug]: ocrmypdf: error: unrecognized arguments: input.pdf output.pdf
github.com/ocrmypdf/OCRmyPDF - KNDaniel opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - KNDaniel opened this issue 4 months ago
[Feature]: Result Improvement with OpenCV + Pillow Preprocessing
github.com/ocrmypdf/OCRmyPDF - vishaldwdi opened this issue 4 months ago
github.com/ocrmypdf/OCRmyPDF - vishaldwdi opened this issue 4 months ago
[Bug]: Output file is okay but is not PDF/A
github.com/ocrmypdf/OCRmyPDF - tcurdt opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - tcurdt opened this issue 5 months ago
[Query]: docker watched folder environment variables, optimize how?
github.com/ocrmypdf/OCRmyPDF - jaxjexjox opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - jaxjexjox opened this issue 5 months ago
[Bug]: Large file size increases due to PDF/A font substitution
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 5 months ago
[Bug]: maximum recursion depth exceeded
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 5 months ago
[Bug]: The generated PDF is INVALID
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
[Bug]: Output PDF is too large
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
[Bug]: The width is not correct for detected words
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - you-healthtap opened this issue 5 months ago
[Bug]: cannot add non-opaque RGBA color to RGB palette
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - jozuas opened this issue 5 months ago
[Bug]: subprocess.CalledProcessError: Command '['D:\\latex\\texlive\\2020\\bin\\win32\\jbig2.EXE', '--version']' returned non-zero exit status 3.
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 5 months ago
[Bug]: Ghostscript rasterizing failed
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
[Bug]: pdfminer.pdfexceptions.PDFTypeError: invalid length: 6
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - user1823 opened this issue 5 months ago
ocrmypdf produces wrong page size
github.com/ocrmypdf/OCRmyPDF - femifrak opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - femifrak opened this issue 5 months ago
[Bug]: with the latest version of Ghostscript 10.03.1, ocrmypdf is passing file names to Ghostscript in the wrong order
github.com/ocrmypdf/OCRmyPDF - alan-sandollar opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - alan-sandollar opened this issue 5 months ago
[Bug]: FileNotFoundError: [Errno 2] No such file or directory: 'gs'
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - 459737087 opened this issue 5 months ago
Update installation.rst "python -m venv .venv"
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this pull request 5 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this pull request 5 months ago
Add '--needed' flag to arch base-devel install command
github.com/ocrmypdf/OCRmyPDF - mersenne-twister opened this pull request 5 months ago
github.com/ocrmypdf/OCRmyPDF - mersenne-twister opened this pull request 5 months ago
--sidecar writes text content and messages to file
github.com/ocrmypdf/OCRmyPDF - gerritgriebel opened this issue 5 months ago
github.com/ocrmypdf/OCRmyPDF - gerritgriebel opened this issue 5 months ago
[Bug]: files signed with a-trust are not recognised as digitally signed and hence processed
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - ferdiga opened this issue 6 months ago
[Bug]: Ghostscript rasterizing failed
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 6 months ago
[Bug]: Ghostscript can't create a PDF/A-file (Page object was reserved for an Annotation destination)
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - JoKalliauer opened this issue 6 months ago
[Bug]: problem with tif "DPI is not credible". Estimate dpi
github.com/ocrmypdf/OCRmyPDF - drnicolas opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - drnicolas opened this issue 6 months ago
[Bug]: OSError: [Errno 28] No space left on device
github.com/ocrmypdf/OCRmyPDF - Salvodif opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - Salvodif opened this issue 6 months ago
Output file images are corrupted
github.com/ocrmypdf/OCRmyPDF - robmclear opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - robmclear opened this issue 6 months ago
[Bug]: doesn't always parse Latin with diacritics
github.com/ocrmypdf/OCRmyPDF - arsinclair opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - arsinclair opened this issue 6 months ago
[Feature]: Enable execution on GPU
github.com/ocrmypdf/OCRmyPDF - danielfcastro opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - danielfcastro opened this issue 6 months ago
[Request]: Please make rich logging library an optional dependency
github.com/ocrmypdf/OCRmyPDF - lucasgadams opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - lucasgadams opened this issue 6 months ago
[Bug]: Existing text is completely replaced with other characters
github.com/ocrmypdf/OCRmyPDF - david-sledge opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - david-sledge opened this issue 6 months ago
[Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 6 months ago
[Bug]: `lots of diacritics - possibly poor OCR` but using standalone tesseract works perfectly
github.com/ocrmypdf/OCRmyPDF - KAGEYAM4 opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - KAGEYAM4 opened this issue 6 months ago
[Bug]: No errors and no output for large DPI files
github.com/ocrmypdf/OCRmyPDF - dan-ryan opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - dan-ryan opened this issue 6 months ago
[Bug]: MetadataProgress does not respect progress_bar=False argument
github.com/ocrmypdf/OCRmyPDF - DavidMChan opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - DavidMChan opened this issue 6 months ago
[Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - Johnnie390 opened this issue 6 months ago
[Feature]: Alternative AI OCR "surya" as opposed to EasyOCR, Just found it today and it dominated the accuracy and speed of Tesseract & EasyOCR
github.com/ocrmypdf/OCRmyPDF - abclution opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - abclution opened this issue 6 months ago
[Bug]: ocrmypdf 16.3.1 fails on a file on Arch that 13.4.0 on Ubuntu handles well
github.com/ocrmypdf/OCRmyPDF - Fifis opened this issue 6 months ago
github.com/ocrmypdf/OCRmyPDF - Fifis opened this issue 6 months ago
[Bug]: crashes with tesseract 5.4.0
github.com/ocrmypdf/OCRmyPDF - mplx opened this issue 7 months ago
github.com/ocrmypdf/OCRmyPDF - mplx opened this issue 7 months ago
Incorrect behavior of text color setting in hocrtransform
github.com/ocrmypdf/OCRmyPDF - ep0p opened this issue 7 months ago
github.com/ocrmypdf/OCRmyPDF - ep0p opened this issue 7 months ago
[Bug]: --tesseract-pagesegmode is not sufficiently documented
github.com/ocrmypdf/OCRmyPDF - thomas2net opened this issue 7 months ago
github.com/ocrmypdf/OCRmyPDF - thomas2net opened this issue 7 months ago