Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
https://github.com/ocrmypdf/OCRmyPDF

Bump astral-sh/setup-uv from 4 to 5

dependabot[bot] opened this pull request 11 days ago
graft: fix invisible text appearing after strip_invisible_text

pajowu opened this pull request 23 days ago
hocr: only add space if boxwidth is positive

pajowu opened this pull request 23 days ago
[Bug]: scanned pdf containig electronics schematic

saadb opened this issue 26 days ago
ocrmypdf -v 2 fails with log messages interpreted as tags

fernandoherreradelasheras opened this issue 26 days ago
Update intersphinx mapping to current format

QuLogic opened this pull request 29 days ago
Fix "Scanning contents" progress bar with --redo-ocr

aliemjay opened this pull request 29 days ago
fix minor grammar mistake

joskezelensky opened this pull request about 1 month ago
[Bug]: OCR Output Quality Regression on Ubuntu 24.04

guilhermebferreira opened this issue about 1 month ago
[Bug]: deskew results in "empty" output file

hatl opened this issue about 1 month ago
Documentation for ''ocrmypdf.ocr()" not found

fatsciock opened this issue about 1 month ago
Bump astral-sh/setup-uv from 3 to 4

dependabot[bot] opened this pull request about 1 month ago
[Feature]: Option to remove OCR

user1823 opened this issue about 1 month ago
Bump codecov/codecov-action from 4 to 5

dependabot[bot] opened this pull request about 2 months ago
[Bug]: pikepdf PdfMatrix module unavailale

IsaacSugden opened this issue about 2 months ago
Facing issue while applying ocrmypdf to document which different layouts at each page

prashanthkolaneru opened this issue about 2 months ago
[Feature]: Add drop caps support

4F2E4A2E opened this issue about 2 months ago
ocrmypdf isn't installing on termux

eelalzep opened this issue about 2 months ago
[Bug]: HOCRResult.from_json() not unpickling correctly

hoblins opened this issue about 2 months ago
[Bug]: Docker container entry point

sneakpodbob opened this issue about 2 months ago
[3rdparty]: paperless-ngx

Checole opened this issue about 2 months ago
[Bug]: test_malformed_docinfo fails with spectacular INTERNALERROR

mcepl opened this issue about 2 months ago
[Feature]: Show page numbers when detecting rotation

tsoernes opened this issue about 2 months ago
[Feature]: Show page number in PriorOcrFoundError

tsoernes opened this issue about 2 months ago
[Bug]: Example docker-compose.yml not working anymore

ckagerer opened this issue 2 months ago
[Bug]: Unpaper Not Found: "Warning: using insecure memory!"

vfilby opened this issue 2 months ago
Data privacy when using OCRmyPDF

etroci opened this issue 3 months ago
[Bug]: cannot import name 'PdfMatrix' from 'pikepdf'

kdbreck opened this issue 3 months ago
[Feature]: support for Apple vision framework

santiagozky opened this issue 3 months ago
Doc: new infix for temp files; snap temp files folder

mayeulk opened this pull request 3 months ago
[Bug]: Refuses to process old book with existing OCR

themaster567 opened this issue 3 months ago
[Bug]: Highlights/annotations repeated on all pages

Jmuccigr opened this issue 3 months ago
[Bug]: Cannot create a file when that file already exists

user1823 opened this issue 4 months ago
[Bug]: Tesseract fails on Alpine 3.20.3

pschichtel opened this issue 4 months ago
[Feature]: Align pages to text baseline

swxxii opened this issue 4 months ago
How to remove the image-with-text from the PDF

SurinameClubcard opened this issue 4 months ago
Bump sigstore/gh-action-sigstore-python from 2.1.1 to 3.0.0

dependabot[bot] opened this pull request 4 months ago
Recommended way of running ocrmypdf with memory limits

andersfylling opened this issue 4 months ago
Add mdate preservation

ferdiga opened this pull request 4 months ago
Fix broken test_rotate_page_level

QuLogic opened this pull request 5 months ago
[Bug]: Scan time regression in 16.4.3 with `--redo-ocr`

aliemjay opened this issue 5 months ago
[Bug]: Scan time increases quadratically with page count

aliemjay opened this issue 5 months ago
[Bug]: Regression in 16.4

gringus opened this issue 5 months ago
[Bug]: NotImplementedError in colorspace

macdeport opened this issue 5 months ago
does not ocr 90° rotated texts

stfnx opened this issue 5 months ago
[Bug]: Output file is okay but is not PDF/A

tcurdt opened this issue 5 months ago
[Bug]: maximum recursion depth exceeded

you-healthtap opened this issue 5 months ago
[Bug]: The generated PDF is INVALID

user1823 opened this issue 5 months ago
[Bug]: Output PDF is too large

user1823 opened this issue 5 months ago
[Bug]: The width is not correct for detected words

you-healthtap opened this issue 5 months ago
[Bug]: cannot add non-opaque RGBA color to RGB palette

jozuas opened this issue 5 months ago
[Bug]: Ghostscript rasterizing failed

user1823 opened this issue 5 months ago
[Bug]: pdfminer.pdfexceptions.PDFTypeError: invalid length: 6

user1823 opened this issue 5 months ago
ocrmypdf produces wrong page size

femifrak opened this issue 5 months ago
Update installation.rst "python -m venv .venv"

JoKalliauer opened this pull request 6 months ago
Add '--needed' flag to arch base-devel install command

mersenne-twister opened this pull request 6 months ago
--sidecar writes text content and messages to file

gerritgriebel opened this issue 6 months ago
[Bug]: Ghostscript rasterizing failed

JoKalliauer opened this issue 6 months ago
[Bug]: KeyError: '/Subtype'

user1823 opened this issue 6 months ago
[Bug]: problem with tif "DPI is not credible". Estimate dpi

drnicolas opened this issue 6 months ago
[Bug]: OSError: [Errno 28] No space left on device

Salvodif opened this issue 6 months ago
Output file images are corrupted

robmclear opened this issue 6 months ago
[Bug]: doesn't always parse Latin with diacritics

arsinclair opened this issue 6 months ago
[Feature]: Enable execution on GPU

danielfcastro opened this issue 6 months ago
[Request]: Please make rich logging library an optional dependency

lucasgadams opened this issue 6 months ago
[Bug]: Existing text is completely replaced with other characters

david-sledge opened this issue 7 months ago
[Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1

Johnnie390 opened this issue 7 months ago
[Bug]: No errors and no output for large DPI files

dan-ryan opened this issue 7 months ago
[Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed

Johnnie390 opened this issue 7 months ago
[Bug]: crashes with tesseract 5.4.0

mplx opened this issue 7 months ago
Update docker.rst

omidraha opened this pull request 7 months ago
Incorrect behavior of text color setting in hocrtransform

ep0p opened this issue 7 months ago