Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Collective - Host: opensource - https://opencollective.com/ocrmypdf - Code: https://github.com/jbarlow83/OCRmyPDF

Fix error message about missing Python exec

Message is
unable to start container process: exec: "python": executable file not found in $PATH...

github.com/ocrmypdf/OCRmyPDF - 16ab4a8b4ec82175880f235953d99e9c5265b634 authored 10 months ago by James R. Barlow <[email protected]>
Undo future annotations from watcher.py till Typer fixes its issue

Fixes #1258

github.com/ocrmypdf/OCRmyPDF - 8d30cff4ef3e555f41cd11b079f5419a042e185c authored 10 months ago by James R. Barlow <[email protected]>
v16.1.1 release notes

github.com/ocrmypdf/OCRmyPDF - 59d5b0d1bd781f21be0fdea02ff0d2b841bc067e authored 10 months ago by James R. Barlow <[email protected]>
Try pypy3.10

github.com/ocrmypdf/OCRmyPDF - 9ec0745ab8ca688706fb201e1e9b53b86a399664 authored 10 months ago by James R. Barlow <[email protected]>
Python 3.10 cleanup, manual fixes

github.com/ocrmypdf/OCRmyPDF - 3a3635f7f911428cff093a27d9fe77a9be6a71f3 authored 10 months ago by James R. Barlow <[email protected]>
ruff linting/Python 3.10 cleanup

github.com/ocrmypdf/OCRmyPDF - 6a746a1cbb98911eeaabff4ae3aa9b6dec0916a7 authored 10 months ago by James R. Barlow <[email protected]>
Update rust toml settings

github.com/ocrmypdf/OCRmyPDF - 906c130f96dfa91d41b907dd02d70b174e188bc5 authored 10 months ago by James R. Barlow <[email protected]>
v16.1.0 release notes

github.com/ocrmypdf/OCRmyPDF - 4a784588216f919843dababa053fa88a01173533 authored 10 months ago by James R. Barlow <[email protected]>
Clarify warnings filter

github.com/ocrmypdf/OCRmyPDF - fddf3ce2f46059ca6099f336cc0025dd237fc8ee authored 10 months ago by James R. Barlow <[email protected]>
Merge branch 'feature/pageboxes'

github.com/ocrmypdf/OCRmyPDF - 353b34e695893fb74026d0e9f1e9686d9f7eef5d authored 10 months ago by James R. Barlow <[email protected]>
Use hocr renderer for LTR languages

github.com/ocrmypdf/OCRmyPDF - 7d63355c3c5efa139a4cbebe118517a58212b20b authored 10 months ago by James R. Barlow <[email protected]>
Fix handling of pages that are restored to correct orientation with /Rotate

Appears inversion of CTM was incorrect, introduced in commit 9898904

github.com/ocrmypdf/OCRmyPDF - 42ff7fc84289a885c86d4629ebcf8dd9a65cef2d authored 10 months ago by James R. Barlow <[email protected]>
Suppress reportlab deprecation warning

github.com/ocrmypdf/OCRmyPDF - 26470fe16a257c665a260eb509db5edabd189288 authored 10 months ago by James R. Barlow <[email protected]>
First cut at propagating page boxes

This would fix the immediate issue, but does not address an offset mediabox.

github.com/ocrmypdf/OCRmyPDF - 11f53fe9a96daf12938495521914fa2215a9bc20 authored 11 months ago by James R. Barlow <[email protected]>
Attempt to deal with oddball mediaboxes

github.com/ocrmypdf/OCRmyPDF - 3b9d4b7f0aa20f704448a6222590f4f2cb85f689 authored 11 months ago by James R. Barlow <[email protected]>
Mention pipx, install --user --upgrade

Closes #1249

github.com/ocrmypdf/OCRmyPDF - 123c0c766f9ac9b46c824391a383cefc137acbce authored 11 months ago by James R. Barlow <[email protected]>
Advise Homebrew on Linux for Ubuntu 20.04

github.com/ocrmypdf/OCRmyPDF - 6a9be2142e3dcc5a0af0e4e0f0a0dbffb4827274 authored 11 months ago by James R. Barlow <[email protected]>
Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF

github.com/ocrmypdf/OCRmyPDF - 0bc350f55e49baedfaaefd70e8ffc094e5ca2f53 authored 11 months ago by James R. Barlow <[email protected]>
Bump codecov/codecov-action from 3 to 4 (#1247)

Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 3 to 4.
- [Relea...

github.com/ocrmypdf/OCRmyPDF - 7a6edf62ba8b10469ae93316bfd60de81092db54 authored 11 months ago by dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
optimize: log images with unclear decode tables

github.com/ocrmypdf/OCRmyPDF - 07b6f06f11a82009d43b410b0e849c8fd7773895 authored 11 months ago by James R. Barlow <[email protected]>
Update gs dependency & instructions for RHEL (#1228)

* Update gs depedency to 9.54

* Add instructions for RHEL

* Update instruction for using p...

github.com/ocrmypdf/OCRmyPDF - 2005f622bbb4ec61cc35eac32634c8d1960f2d78 authored 11 months ago by nisbet-hubbard <[email protected]>
Document ability to remove all OCR

github.com/ocrmypdf/OCRmyPDF - cca04fd7998858db2b31c165104157ac01de480b authored 11 months ago by James R. Barlow <[email protected]>
Add autotools automake libtool and leptonica requirements (#1230)

These tools are required for a successful install of JBIG2 on Ubuntu

github.com/ocrmypdf/OCRmyPDF - 75bf8e4ba2d4d7e437d6cb208e981a2407c963a7 authored 12 months ago by Maxi <[email protected]>
Minor english correction in Docs (#1229)

github.com/ocrmypdf/OCRmyPDF - daabb5b1003301e6b35f82d528fa3a00f5554dfc authored 12 months ago by Anish Sapkota <[email protected]>
Change Windows build to use older Ghostscript

github.com/ocrmypdf/OCRmyPDF - 035ebea72fc2644215b38b2e1a6fa7133b4ccab3 authored 12 months ago by James R. Barlow <[email protected]>
v16.0.4 release notes

github.com/ocrmypdf/OCRmyPDF - a4999564625f23a0439a9cf5b6a7ef8000b4d0c0 authored 12 months ago by James R. Barlow <[email protected]>
Update cache

github.com/ocrmypdf/OCRmyPDF - 74d2a156c44630cc525bcbfee1753efd561a0ed4 authored 12 months ago by James R. Barlow <[email protected]>
gs: check for text-corrupting versions

github.com/ocrmypdf/OCRmyPDF - f87fc7b12d579b0bba840e29517bfcc9ebceadf1 authored 12 months ago by James R. Barlow <[email protected]>
hocr: fix unbound local

github.com/ocrmypdf/OCRmyPDF - 602f5632cb5c97896ab3f98459aff18d11fe90b8 authored 12 months ago by James R. Barlow <[email protected]>
Merge branch 'feature/hocrdebug'

github.com/ocrmypdf/OCRmyPDF - 9fbbcf75996bc98228ff2a40c387a5a73747eba0 authored 12 months ago by James R. Barlow <[email protected]>
hocr: Pass text direction for standalone words

github.com/ocrmypdf/OCRmyPDF - 9498f01f597566b17fc068dd6b1a516ad480b515 authored 12 months ago by James R. Barlow <[email protected]>
Fix render_mode ON_TOP when previous content stream does not restore CTM

github.com/ocrmypdf/OCRmyPDF - 2c59aca5a184ae503e3ff3cad64b48b59411ffe2 authored 12 months ago by James R. Barlow <[email protected]>
Change default graft render mode back to underneath

It was mistakenly changed to on top at some point

github.com/ocrmypdf/OCRmyPDF - 51301d69c9b7a3b7fa2974cb5342e075c5814a37 authored 12 months ago by James R. Barlow <[email protected]>
Use Courier as debug font

github.com/ocrmypdf/OCRmyPDF - 7e608fd1dfa19ae5c71df74ddbe7c908022b8583 authored 12 months ago by James R. Barlow <[email protected]>
Adjust issue template files description

github.com/ocrmypdf/OCRmyPDF - ecc79315df322fc3229ec27808a34487974d4802 authored 12 months ago by James R. Barlow <[email protected]>
Skip testing oom killer on Python 3.12

Need to investigate further if there's a safe way to do this test.

github.com/ocrmypdf/OCRmyPDF - 14365d10b87590be51a952867b2565b1f3f25119 authored 12 months ago by James R. Barlow <[email protected]>
v16.0.3 release notes

github.com/ocrmypdf/OCRmyPDF - 5e5320020fe04bdbcf71ee701ce1c4553f04699a authored 12 months ago by James R. Barlow <[email protected]>
Drop Ghostscript CVE warning and adjust version down to 9.54

Most distros have probably now backported the CVE fix. Dropped version to 9.54 so RHEL 9 can use...

github.com/ocrmypdf/OCRmyPDF - 103c3e0cd67eac44fc91cf7f0c6d45f56dd182e7 authored 12 months ago by James R. Barlow <[email protected]>
fixed a spelling mistake (#1222)

github.com/ocrmypdf/OCRmyPDF - 7a1c89edd9c72165ba03e264c9da006862e91f56 authored 12 months ago by Anthony Nabil <[email protected]>
Update instructions about custom language packages

Closes #1218

github.com/ocrmypdf/OCRmyPDF - a5ff3d2f427f87b154ecb63173f11435eab93863 authored 12 months ago by James R. Barlow <[email protected]>
Draw RTL text with reversed matrix instead of reversing characters

github.com/ocrmypdf/OCRmyPDF - b71d16dd9627a4b3c0adc317acc594b9041dbd22 authored 12 months ago by James R. Barlow <[email protected]>
Reversing character order for RTL helps output

github.com/ocrmypdf/OCRmyPDF - fd593eb5e9bbd8d8e129a4ca7d95b8753ddd2cf0 authored 12 months ago by James R. Barlow <[email protected]>
Improve and externalize debug rendering settings

github.com/ocrmypdf/OCRmyPDF - a0b98abb945aa344958e1268401db750499f44a7 authored 12 months ago by James R. Barlow <[email protected]>
Also fix space rendering

github.com/ocrmypdf/OCRmyPDF - 18353e1e94023890089c39f1bed732ca3eedc794 authored 12 months ago by James R. Barlow <[email protected]>
Overhaul debug rendering to support possible use of Helvetica for testing

github.com/ocrmypdf/OCRmyPDF - 9adcad84da99bc5295991fa90b6409c02d3a620b authored 12 months ago by James R. Barlow <[email protected]>
Fix upside down glyphless font

github.com/ocrmypdf/OCRmyPDF - f2714586d80e22ae984925752c197c8f4577ebd6 authored 12 months ago by James R. Barlow <[email protected]>
Update issue template

github.com/ocrmypdf/OCRmyPDF - 0b6fb62967b1e79f2f754437dcb99ed0d72df042 authored almost 1 year ago by James R. Barlow <[email protected]>
Temporarily revert default renderer to sandwich while investigating regressions in macOS Preview

github.com/ocrmypdf/OCRmyPDF - 1db8b0b94308f64ef9f348402e94f78345c1999e authored almost 1 year ago by James R. Barlow <[email protected]>
Update issue template

github.com/ocrmypdf/OCRmyPDF - f38aebb3d5b57ba43f82f46dab3580453d9a6364 authored almost 1 year ago by James R. Barlow <[email protected]>
Name artifacts explicitly

github.com/ocrmypdf/OCRmyPDF - 7162c36d371d6fb2f81b78dbca3eff8f3442e229 authored about 1 year ago by James R. Barlow <[email protected]>
Update artifact actions

github.com/ocrmypdf/OCRmyPDF - f4d4ea46c8e5dddc416399ad30bfb3ccbd6f10c5 authored about 1 year ago by James R. Barlow <[email protected]>
v16.0.1 release notes

github.com/ocrmypdf/OCRmyPDF - 2fd1a0f178474a690c32c4c2db4f733147642070 authored about 1 year ago by James R. Barlow <[email protected]>
Tighten dependencies

github.com/ocrmypdf/OCRmyPDF - 73ed33a086c8b2cfe2a896ace016da62c8bf1730 authored about 1 year ago by James R. Barlow <[email protected]>
Fix text rendering issue with new hOCR text renderer

github.com/ocrmypdf/OCRmyPDF - e6095a9949e92b200d4999f543502644e656d150 authored about 1 year ago by James R. Barlow <[email protected]>
Fix release notes - drop rc from version

github.com/ocrmypdf/OCRmyPDF - 16f05af4014dd5a29281406d80e5a10a2576dec4 authored about 1 year ago by James R. Barlow <[email protected]>
Bump actions/setup-python from 4 to 5 (#1205)

Bumps [actions/setup-python](https://github.com/actions/setup-python) from 4 to 5.
- [Release n...

github.com/ocrmypdf/OCRmyPDF - 1631afc878b0e2b3329ba418e78d462b42ae2b35 authored about 1 year ago by dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Fix --fast-web-view documentation (#1206)

github.com/ocrmypdf/OCRmyPDF - 63d87fc44001d3eaff59b396ff4665e3a43af066 authored about 1 year ago by Robin Richtsfeld <[email protected]>
Skip test_encrypted on Py3.12 + macOS

github.com/ocrmypdf/OCRmyPDF - 9489c012594f569754d9d31128e0131f5c914282 authored about 1 year ago by James R. Barlow <[email protected]>
Fix build settings to adjust for dropping py39

github.com/ocrmypdf/OCRmyPDF - 30d92ad83ff879aea01c382b3d2e2dd1cd2d3294 authored about 1 year ago by James R. Barlow <[email protected]>
Filter rl_safe_eval deprecation warning

Full message
eportlab/lib/rl_safe_eval.py:11: DeprecationWarning: ast.NameConstant is deprecated...

github.com/ocrmypdf/OCRmyPDF - a4987733c4f1fe0b4a88dad0839837eeda20a1f3 authored about 1 year ago by James R. Barlow <[email protected]>
v16.0.0rc1 release notes

Fixes #1009, #1191, #1157

github.com/ocrmypdf/OCRmyPDF - 39eee05230d37aeb4b3f9ad4da9cea3a0e043449 authored about 1 year ago by James R. Barlow <[email protected]>
Merge branch 'feature/modernhocr'

github.com/ocrmypdf/OCRmyPDF - 5b2f2e62900777d25176bb7d5769baf3d30360d0 authored about 1 year ago by James R. Barlow <[email protected]>
Rebuild cache for hocr default case

github.com/ocrmypdf/OCRmyPDF - 445617a1a55bc104525ec12998d0c336dcac1596 authored about 1 year ago by James R. Barlow <[email protected]>
hOCR renderer is now default

github.com/ocrmypdf/OCRmyPDF - f6e90a593409787857eb82b493f875254ddf6daa authored about 1 year ago by James R. Barlow <[email protected]>
Move canvas API to pikepdf and import it

github.com/ocrmypdf/OCRmyPDF - 43618e6b3f7fe0881a4d8435f2229e81d5b726e2 authored about 1 year ago by James R. Barlow <[email protected]>
Refactor font so glyphless isn't as hard coded

github.com/ocrmypdf/OCRmyPDF - e97f89de3beb6b993513cde011ccfa7ec6a2a3e3 authored about 1 year ago by James R. Barlow <[email protected]>
Fix hocrtransform CLI

github.com/ocrmypdf/OCRmyPDF - 11d3e32f1e2411459b1d0e247738a75909a6905d authored about 1 year ago by James R. Barlow <[email protected]>
Remove code that attempted to manage xattrs out of output file

Feature requested in issue #1179, but caused #1195. On further review,
there is no platform inde...

github.com/ocrmypdf/OCRmyPDF - 2affa83efe280e5ec40045978d633986929daeb4 authored about 1 year ago by James R. Barlow <[email protected]>
Fix Ghostscript installation instructions and add warning

github.com/ocrmypdf/OCRmyPDF - c90d5cd84b0e27906310c28f0d78c3ded9106d65 authored about 1 year ago by James R. Barlow <[email protected]>
Ignore pypy for now

github.com/ocrmypdf/OCRmyPDF - aacaba3d267b8e390c49fd1dec03666bde31fd18 authored about 1 year ago by James R. Barlow <[email protected]>
Remove next major release deprecations

github.com/ocrmypdf/OCRmyPDF - fec53be84124cae94d65dcd556b71df43422b303 authored about 1 year ago by James R. Barlow <[email protected]>
Drop Python 3.9 support

github.com/ocrmypdf/OCRmyPDF - 3f7b540f76cdfa90eadd356faa6978e5f2493db5 authored about 1 year ago by James R. Barlow <[email protected]>
Make hocrdebug work, and try to handle CJK spacing better

github.com/ocrmypdf/OCRmyPDF - d217856166adb30e3219f728ba9b160382dbd273 authored about 1 year ago by James R. Barlow <[email protected]>
Avoid divzero

github.com/ocrmypdf/OCRmyPDF - e2be457e9b92788c2a042d5353c569b056672003 authored about 1 year ago by James R. Barlow <[email protected]>
Make text API more like an accessor

github.com/ocrmypdf/OCRmyPDF - 4850f486d2de6c3a3376f20e06afdefe56631868 authored about 1 year ago by James R. Barlow <[email protected]>
Fix placement of spaces in debug mode

github.com/ocrmypdf/OCRmyPDF - 729c7febd97eafa25256dbc195f3570ffd43f8a4 authored about 1 year ago by James R. Barlow <[email protected]>
Refactor save_state

github.com/ocrmypdf/OCRmyPDF - 6c6aca2f1ec322f4654059547eba299d771f3558 authored about 1 year ago by James R. Barlow <[email protected]>
Refactor; accumulate content stream as bytes rather than discrete pikepdf objects

github.com/ocrmypdf/OCRmyPDF - c69823f4963616c0992e2ab14d4402b66089fad5 authored about 1 year ago by James R. Barlow <[email protected]>
Add RTL output - seems to work, but debug does not

github.com/ocrmypdf/OCRmyPDF - 73f8f6aac8ae37c628f2c0b12b081ecc46438bcc authored about 1 year ago by James R. Barlow <[email protected]>
hocr: typing cont'd

github.com/ocrmypdf/OCRmyPDF - d944254e4564c7db93d1b3eae61dcd4d78086c9b authored about 1 year ago by James R. Barlow <[email protected]>
hocr: typing

github.com/ocrmypdf/OCRmyPDF - f7ddffe55479fab456be2c37d9d6dfc70fcf0ee9 authored about 1 year ago by James R. Barlow <[email protected]>
Fix JBIG2 not updating progress bar

github.com/ocrmypdf/OCRmyPDF - 8a73ed5d5a347097159cfb2feddf747dc8000304 authored about 1 year ago by James R. Barlow <[email protected]>
Rationalize canvas interface

github.com/ocrmypdf/OCRmyPDF - 03669183d79d6e51a69916fb271f360398db2241 authored about 1 year ago by James R. Barlow <[email protected]>
Improve canvas interface with chaining

github.com/ocrmypdf/OCRmyPDF - 74e101a2faa79033f691a5fa725776243ecf8afa authored about 1 year ago by James R. Barlow <[email protected]>
Restructure hocrtransform submodule to avoid having everything in __init__

github.com/ocrmypdf/OCRmyPDF - 532cf18ad3762cbe1c2f4ea27f0bedd7dedcbd42 authored about 1 year ago by James R. Barlow <[email protected]>
More tidying

github.com/ocrmypdf/OCRmyPDF - 0b90b697e2376ee3f2a39cd1b47e8e52d3cb87c1 authored about 1 year ago by James R. Barlow <[email protected]>
Fix colors and space box rendering

github.com/ocrmypdf/OCRmyPDF - 6be7c5f7c857293a8d64a075a183f2a0bbad85d3 authored about 1 year ago by James R. Barlow <[email protected]>
Remove some obsolete parameters

github.com/ocrmypdf/OCRmyPDF - db2e5132e64c53893ec12cd2efd959d4f09ebf35 authored about 1 year ago by James R. Barlow <[email protected]>
Fix private accessors, rename pdf to canvas

github.com/ocrmypdf/OCRmyPDF - 2ca6e110ca15f291bfd1136691e96830a9d0f550 authored about 1 year ago by James R. Barlow <[email protected]>
Refactor debug printing

github.com/ocrmypdf/OCRmyPDF - 334a07c8390cfc76842ccea93dbca66fe69789d9 authored about 1 year ago by James R. Barlow <[email protected]>
Refactor: extract methods

github.com/ocrmypdf/OCRmyPDF - a57c39358d73168c69b8648cd61a329e34990b37 authored about 1 year ago by James R. Barlow <[email protected]>
Further exploratory improvements

github.com/ocrmypdf/OCRmyPDF - 30a0c315fba50354e6d963813de083c31b51efc3 authored about 1 year ago by James R. Barlow <[email protected]>
Make coordinate system more consistent

github.com/ocrmypdf/OCRmyPDF - b860f0d94c7760c492f2649ffb4d362211d6cde4 authored about 1 year ago by James R. Barlow <[email protected]>
WIP improve text positioning (not there yet)

github.com/ocrmypdf/OCRmyPDF - 14f4c19f5a3be1f55b26a841bb9e8056b16fe869 authored about 1 year ago by James R. Barlow <[email protected]>
More colors

github.com/ocrmypdf/OCRmyPDF - 7ab5c55d4678423a983c0a50dbb61bb558eb2464 authored about 1 year ago by James R. Barlow <[email protected]>
Fix line and rect drawing

github.com/ocrmypdf/OCRmyPDF - 8b6ecd5971b255fe283fe12107ca33e13dd8a6c9 authored about 1 year ago by James R. Barlow <[email protected]>
Fix position errors; ignore non-glyphless font

github.com/ocrmypdf/OCRmyPDF - 7b0871ae4c95fe8c12372d0c7e503b74185fa2c8 authored about 1 year ago by James R. Barlow <[email protected]>
Tidying new hOCR renderer

github.com/ocrmypdf/OCRmyPDF - b14f6f778abac2cbf406664128442f5e7c134111 authored about 1 year ago by James R. Barlow <[email protected]>
Fix more things

github.com/ocrmypdf/OCRmyPDF - 0a07c0a44edc21cb4103c1a91740c37a5bf52d33 authored about 1 year ago by James R. Barlow <[email protected]>
Render interword spaces separately and avoid box overlap

github.com/ocrmypdf/OCRmyPDF - 2633716bb71d6da407e763994ad2c77bd966ebf2 authored about 1 year ago by James R. Barlow <[email protected]>