Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
https://github.com/ocrmypdf/OCRmyPDF

Bump astral-sh/setup-uv from 4 to 5

dependabot[bot] opened this pull request 15 days ago
graft: fix invisible text appearing after strip_invisible_text

pajowu opened this pull request 27 days ago
hocr: only add space if boxwidth is positive

pajowu opened this pull request 27 days ago
[Bug]: scanned pdf containig electronics schematic

saadb opened this issue 29 days ago
ocrmypdf -v 2 fails with log messages interpreted as tags

fernandoherreradelasheras opened this issue 30 days ago
Update intersphinx mapping to current format

QuLogic opened this pull request about 1 month ago
Fix "Scanning contents" progress bar with --redo-ocr

aliemjay opened this pull request about 1 month ago
fix minor grammar mistake

joskezelensky opened this pull request about 1 month ago
[Bug]: OCR Output Quality Regression on Ubuntu 24.04

guilhermebferreira opened this issue about 1 month ago
[Bug]: deskew results in "empty" output file

hatl opened this issue about 1 month ago
Documentation for ''ocrmypdf.ocr()" not found

fatsciock opened this issue about 1 month ago
Bump astral-sh/setup-uv from 3 to 4

dependabot[bot] opened this pull request about 1 month ago
[Feature]: Option to remove OCR

user1823 opened this issue about 1 month ago
Bump codecov/codecov-action from 4 to 5

dependabot[bot] opened this pull request about 2 months ago
[Bug]: pikepdf PdfMatrix module unavailale

IsaacSugden opened this issue about 2 months ago
Facing issue while applying ocrmypdf to document which different layouts at each page

prashanthkolaneru opened this issue about 2 months ago
[Feature]: Add drop caps support

4F2E4A2E opened this issue about 2 months ago
ocrmypdf isn't installing on termux

eelalzep opened this issue about 2 months ago
[Bug]: HOCRResult.from_json() not unpickling correctly

hoblins opened this issue about 2 months ago
[Bug]: Docker container entry point

sneakpodbob opened this issue about 2 months ago
[3rdparty]: paperless-ngx

Checole opened this issue 2 months ago
[Feature]: Show page numbers when detecting rotation

tsoernes opened this issue 2 months ago
[Feature]: Show page number in PriorOcrFoundError

tsoernes opened this issue 2 months ago
[Bug]: Example docker-compose.yml not working anymore

ckagerer opened this issue 2 months ago
[Bug]: Unpaper Not Found: "Warning: using insecure memory!"

vfilby opened this issue 3 months ago
Data privacy when using OCRmyPDF

etroci opened this issue 3 months ago
[Bug]: cannot import name 'PdfMatrix' from 'pikepdf'

kdbreck opened this issue 3 months ago
[Feature]: support for Apple vision framework

santiagozky opened this issue 3 months ago
Doc: new infix for temp files; snap temp files folder

mayeulk opened this pull request 3 months ago
[Bug]: Refuses to process old book with existing OCR

themaster567 opened this issue 3 months ago
[Bug]: Highlights/annotations repeated on all pages

Jmuccigr opened this issue 3 months ago
[Bug]: Cannot create a file when that file already exists

user1823 opened this issue 4 months ago
[Bug]: Tesseract fails on Alpine 3.20.3

pschichtel opened this issue 4 months ago
[Feature]: Align pages to text baseline

swxxii opened this issue 4 months ago
How to remove the image-with-text from the PDF

SurinameClubcard opened this issue 4 months ago
Bump sigstore/gh-action-sigstore-python from 2.1.1 to 3.0.0

dependabot[bot] opened this pull request 4 months ago
Recommended way of running ocrmypdf with memory limits

andersfylling opened this issue 4 months ago
Add mdate preservation

ferdiga opened this pull request 4 months ago
Fix broken test_rotate_page_level

QuLogic opened this pull request 5 months ago
[Bug]: Scan time regression in 16.4.3 with `--redo-ocr`

aliemjay opened this issue 5 months ago
[Bug]: Scan time increases quadratically with page count

aliemjay opened this issue 5 months ago
[Bug]: Regression in 16.4

gringus opened this issue 5 months ago
[Bug]: NotImplementedError in colorspace

macdeport opened this issue 5 months ago
does not ocr 90° rotated texts

stfnx opened this issue 5 months ago
[Bug]: Output file is okay but is not PDF/A

tcurdt opened this issue 5 months ago
[Bug]: maximum recursion depth exceeded

you-healthtap opened this issue 5 months ago
[Bug]: The generated PDF is INVALID

user1823 opened this issue 5 months ago
[Bug]: Output PDF is too large

user1823 opened this issue 5 months ago
[Bug]: The width is not correct for detected words

you-healthtap opened this issue 5 months ago
[Bug]: cannot add non-opaque RGBA color to RGB palette

jozuas opened this issue 5 months ago
[Bug]: Ghostscript rasterizing failed

user1823 opened this issue 6 months ago
[Bug]: pdfminer.pdfexceptions.PDFTypeError: invalid length: 6

user1823 opened this issue 6 months ago
ocrmypdf produces wrong page size

femifrak opened this issue 6 months ago
Update installation.rst "python -m venv .venv"

JoKalliauer opened this pull request 6 months ago
Add '--needed' flag to arch base-devel install command

mersenne-twister opened this pull request 6 months ago
--sidecar writes text content and messages to file

gerritgriebel opened this issue 6 months ago
[Bug]: Ghostscript rasterizing failed

JoKalliauer opened this issue 6 months ago
[Bug]: KeyError: '/Subtype'

user1823 opened this issue 6 months ago
[Bug]: problem with tif "DPI is not credible". Estimate dpi

drnicolas opened this issue 6 months ago
[Bug]: OSError: [Errno 28] No space left on device

Salvodif opened this issue 6 months ago
Output file images are corrupted

robmclear opened this issue 6 months ago
[Bug]: doesn't always parse Latin with diacritics

arsinclair opened this issue 6 months ago
[Feature]: Enable execution on GPU

danielfcastro opened this issue 7 months ago
[Request]: Please make rich logging library an optional dependency

lucasgadams opened this issue 7 months ago
[Bug]: Existing text is completely replaced with other characters

david-sledge opened this issue 7 months ago
[Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1

Johnnie390 opened this issue 7 months ago
[Bug]: No errors and no output for large DPI files

dan-ryan opened this issue 7 months ago
[Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed

Johnnie390 opened this issue 7 months ago
[Bug]: crashes with tesseract 5.4.0

mplx opened this issue 7 months ago
Update docker.rst

omidraha opened this pull request 7 months ago
Incorrect behavior of text color setting in hocrtransform

ep0p opened this issue 7 months ago