Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
https://github.com/ocrmypdf/OCRmyPDF

Remove tesseract_badutf8.py

Should have been removed in 9db01c7

b840b16c82c3845bcece3c5f8e360857e5f40571 authored almost 5 years ago
Remove last vestiges of command line usage of qpdf - change to check_pdf

8f5c95f0f4aee6d50ec0c82d75133cd5621cd6cf authored almost 5 years ago
Update release notes with v10 changes

168fc6077478c5aadfbdd00e612ab8c8e7642f68 authored almost 5 years ago
ghostscript: remove deprecated argument from generate_pdfa

c84d0f606d5558b491d1ab7c00434c63f932aef0 authored almost 5 years ago
setup: remove deprecated message about removeal of --force parameter

8b54ce338f1ba0880ae770d944700bd424984fb3 authored almost 5 years ago
Adjust number of workers for concurrent page scanning

18c4aa10bf524b864763879e9091f5da7787035f authored almost 5 years ago
Remove Ghostscript-based text extraction

While faster than Python based methods, we've outgrown the limited
amount of information Ghostsc...

991db17fdeb212f524e3be499c542b904b3e97af authored almost 5 years ago
macOS - use spawn for multiprocessing

See bpo-33725. This is the default for 3.8, opt-in for 3.7 and older.

2c07515907da1b071c4a5e64e66249d9ff8ba7aa authored almost 5 years ago
Fix some broken tests

7513f5425c7fbd7c8ef9bcdca7016d5d1cdcd055 authored almost 5 years ago
Use once-per-worker pikepdf init

27a3b80376533ea39c3c17e8152c89d03caaba27 authored almost 5 years ago
Replace task_initargs with use of partial()

8c381a022729e41c04ef38e74230e350d538e127 authored almost 5 years ago
Further refactoring of concurrency concerns

af3c3c646606993d3fa6f16486844f0ae2a544e0 authored almost 5 years ago
Some wrong with forking worker_pdf, just open it once per page for now

86145a8c76c714f8d690f53e709b2024cf6526b5 authored almost 5 years ago
Refactor multiprocessing pool

db3e75e33ef88471ce11acfbe801e633bf0fabce authored almost 5 years ago
First cut at concurrent page scan

Improvement appears on 168 page file. Needs refactoring

d0d0a98dca15e188d8967e725846028158be8898 authored almost 5 years ago
Do pikepdf.open() once instead of per worker

ce49fc26dd2a8bd5cf19c82fa446d850e51bae20 authored almost 5 years ago
azure: use brew python instead

3834d1a0bf39bb900b9b933051fd7117e09c9a5e authored almost 5 years ago
azure: add certifi, openssl for macOS

33e982b3fdebb6163dc3b09f4e7b2f246b765962 authored almost 5 years ago
Fix issue where only first PNG-style image would be optimized

43d650e78c47ea67a42442ad54d459ba1da8c823 authored almost 5 years ago
Update requirements

b4c65c57816b9c9434a2e7f2eaf7a3855e46b6fe authored almost 5 years ago
watcher: add polling and log level adjustment

d96867e6ab9a53e72a01c7926aaba1e0aebab8bb authored almost 5 years ago
install: clarify that old ocrmypdf should be removed from Ubuntu 18.04

Closes #526

0a5108e704aa760d3dbc8746d9c709ee3514f402 authored almost 5 years ago
Refactor 'xyres' into Resolution

94c52a6fa3d92f7a5f85af6f1fecdd7ae1e76310 authored almost 5 years ago
Refactor xy-pair for resolution to tuple

57771f06a32f4d540590956e20c6a540f8719ecc authored almost 5 years ago
pytest picky about list vs tuple

58abb5785cf55d0cfddeee017e81ca4a8250a94c authored almost 5 years ago
v9.7.2 release notes

509e75eaffffe129d3dff5aef8dce6bdfd55c403 authored almost 5 years ago
Support pdfminer.six 20200402

0c50eedb2a9bb6674700ebc35771d75403e6fa7c authored almost 5 years ago
Drop support for pdfminer.six 20181108

This version required a patch that has since been mainlined, and also did not
declare its depend...

4581027246bcf76a0a06fc9d9d1fe8230af24f3d authored almost 5 years ago
hocrtransform: cleanup/PEP8

Some API breaking changes.

31b5f63f85029944269e6bfda0b6028b03e534f4 authored almost 5 years ago
pytest picky about list vs tuple

957fb1494e4724686ad14e03abb440cfd6e01776 authored almost 5 years ago
Improve help text about aborting due to text

9e3e4f2687690cc18e96c48afb7e91672bb09b96 authored almost 5 years ago
Loosen test language requirements - eng/deu

2155bcacb46ba34bab7d62a9497dfd516e2a2c42 authored almost 5 years ago
Suppress loglevel since we have color now

346da95899e1227d5e04a2d7225b5812086af9c5 authored almost 5 years ago
Add colored logs

f4f7946a0c2363a76524090221031df569e788fc authored almost 5 years ago
Reinstate logging of page numbers

c2919f2e1ca16b838ab1247e79aaff04697b6a16 authored almost 5 years ago
Improve logging of subprocess output

a63d624052fe66b067ce62c4a67c2bc95a46f67a authored almost 5 years ago
Refactor Windows executable shims

4ff4ed24a80a4d4c5b84aba8412849993e5fb43c authored almost 5 years ago
Remove safe_symlink log= warning

af914893763fb8620be0cee7bc8b3c700f5025b8 authored almost 5 years ago
The Great Logging Refactor

Remove all instances of logger object being passed as parameters.
This was a holdover from ruffu...

d146d2b65c7cd99b6fffa4a5aac5247786b4686d authored almost 5 years ago
Merge branch 'master' of github.com:jbarlow83/OCRmyPDF

c38ff90081331ce3ab2db1f82e84fc3fc7a57951 authored almost 5 years ago
Fix isinstance(..,str)

4c029e973faaf24e933540ce63221c3b42980730 authored almost 5 years ago
docs: Set ownership when using docker image (#518)

21cf9029e83b2061257ec01af234c4e6c91fbad9 authored almost 5 years ago
Fix language argument not working as list

Fixes #523

4a640b8dcd9eb0cd2e74eaf748fd65064cc639e3 authored almost 5 years ago
Fix versions with leading v, e.g. v5.0

9471bc8921054a58685fcc3317da966cae21e4fb authored almost 5 years ago
v9.7.1 release notes

7fe06c64fc75996e9a6c01e1033bf0af8e149d2c authored almost 5 years ago
docs: warn that AWS Lambda doesn't work

32a88f1bad23fedc555284523c234d953f507ac1 authored almost 5 years ago
docs: warn that Windows users should use an ifmain guard

99ef42940c8f81b31eca1c4bd4904fb79dea5850 authored almost 5 years ago
Fix version checker failing for qpdf 10.0.0

Fixes #527

d13d70fd56419b12786226af0067ef66a2904031 authored almost 5 years ago
Add a few more type annotations to public APIs

58ec56180a751c335589661d133494d94da85e5a authored almost 5 years ago
Update issue templates

c152710617ffb8e1808886505e74a205dd696525 authored almost 5 years ago
v9.7.0 release notes

8de0f9b86f830395aa40a5df428e4c847eed6a15 authored almost 5 years ago
tests: workaround for Ghostscript 9.52 txtwrite problem

23bc3d3a29439a05ab510c734881563638c6b9eb authored almost 5 years ago
tests: add force OCR to a file with text that Ghostscript doesn't see

For gs 9.52 support.

Also refactor use of pikepdf.open() to use with blocks.

8307832ce9675d3154c0af04bcf9099908e863fb authored almost 5 years ago
watcher: Fix JSONDecodeError if OCR_JSON_SETTINGS not set

Fixes #516

dd1cf567dbebb7d62a3b9ff6d5b85d69a294533e authored almost 5 years ago
Fix debug.log not being deleted on Windows (probably)

Fixes #515

2490be849045d361ba5c9913a42949027e361f6c authored almost 5 years ago
docs: Add username to WSL instructions

Fixes #519

85e6c6669a53ff439982914539752b19e6215e7a authored almost 5 years ago
validation: blacklist Ghostscript 9.51 too

00498282f53b77be7820198fafe0df2765cf7ce2 authored almost 5 years ago
Wrong number of threads to use shown when OMP_THREAD_LIMIT is defined

e4cc9fcba73b1614781d398e30ef1aa098fa6841 authored almost 5 years ago
Add halftone mask to leptonica

a4555b1daed4576badd06f92a35f8938516ae9fe authored almost 5 years ago
info.py: linearize O(n^2) search for use images on a page

f35a2303bb9709d62d97b8939d5b7059ebba270f authored almost 5 years ago
Merge branch 'master' of github.com:jbarlow83/OCRmyPDF

82142fe5efd4e1e74cfa7117b31e6dad5633e490 authored almost 5 years ago
watcher: allow all parameters to ocrmypdf.pdf to be passed by JSON

9be533b5f4a53c9a735a218fe914bdfd0b6bc88e authored almost 5 years ago
optimize: consider ICCBased 1 bit for optimization

99653fcd32445a4e37a3ba30f2cb8704098ab05d authored almost 5 years ago
Consult ICC profile when determining image colorspace

5442c97ed82ead9599bc1f275c9ff52a8416b11b authored almost 5 years ago
fix install instructions for Ubunti 16.04 (#507)

`pip3` defaults to the system's outdated version which downloads wrong qpdf package.

0165255bd97e4f743d59f70484b780dc906f92fe authored almost 5 years ago
Expand documentation for subprocess.run() from test

378e4dae3ba77ef3f0387fbc6e6a885c1db7bf6b authored almost 5 years ago
reqs: update pikepdf version

cdf5afa75311937a1b06911f58ed4f77f1b8cf97 authored almost 5 years ago
v9.6.1 release notes

a2deee49209a389ca2ce8eb3cc4886d979e73e13 authored almost 5 years ago
Remove potentially non-free file logo.afdesign

1efa79cce274e8e813b33e54e4be045a77746137 authored almost 5 years ago
Handle malformed DocumentInfo (#497)

User submitted a PDF in which /Trailer /Info pointed to the XMP metadata
block instead of a Docu...

b3b61c152cc65d74cac054bda1f3a57e11e2da90 authored almost 5 years ago
docs: install cleanup

e429c3d7298a8a380cde32ed42f07bfce95732fe authored almost 5 years ago
Merge branch 'master' of github.com:jbarlow83/OCRmyPDF

8308b20096ec8fa977a41e459f9f2b17a4af3c6d authored almost 5 years ago
docs: docker prefers .yml not .yaml

8b41f60b6e4f6e5cfb35b7eba5a3fbd1326b7beb authored almost 5 years ago
Improve ocrmypdf.bash completions on macOS (#504)

Fixes #502

d56f7490172999db993f7a954713ab1eb71064c0 authored almost 5 years ago
docs: document --pages

9f31774aa99ef5c1f988ecd8b8366dc86a0597f5 authored almost 5 years ago
docs: extract example files from batch.rst

7d55f6e01fa35378a82888bcbbf1856e78f17292 authored almost 5 years ago
docs: fix Docker syntax to use stdin/stdout properly

c3bd2f296d27b13628238f3ade250cfc56e8b03e authored almost 5 years ago
watcher: add self to copyright

e40c60d4d86ec847e92c3cce77a25734a514a305 authored almost 5 years ago
docs: more clarifications

3960232ae054cae0fc5bcbd8b5e02f85f1fea85d authored almost 5 years ago
Merge branch 'master' of github.com:jbarlow83/OCRmyPDF

5fbb3fc6acaa56bd4425d3b7f8edc8a5d054f4f9 authored almost 5 years ago
Fixes docker-compose.yaml file (#499)

Fixes https://github.com/jbarlow83/OCRmyPDF/issues/498

0b1db8fccd62421501f648d6dc5dcdd9373d2899 authored almost 5 years ago
docs: some mild improvements

0417610f9bdef647751a037c5320ce0d28baaf09 authored almost 5 years ago
Disable Travis

43a23e3695583344a18858387b610e5911e9f5b0 authored almost 5 years ago
Merge branch 'pigmonkey-aur-manual'

c4ca572afd22e51bb9c7cc743ad2664b2f60d066 authored almost 5 years ago
Demonstrate installing the AUR package without a helper

This describes how to use the AUR package on a minimal install, as per
the discussion in #494.

...

e04e4565a9f535c34ad73a2fb8553478d53a438b authored almost 5 years ago
Update installation instructions for FreeBSD (#493)

Python 3.7 is the new default version since 2020Q1 which is reflected in
the new prefix (= py37...

2391fb0be0ad4913c05f8c536c9a52f6c557cfa1 authored almost 5 years ago
docs: add Docker compose configuration for watchdog

c16f79d51b9a825dd1eb85f2d77a24bd30b5c328 authored almost 5 years ago
Readme: Add another heise article

e3e888efdeb0cbc6e1d5a21c0ee741b65093b567 authored almost 5 years ago
Merge branch 'master' of github.com:jbarlow83/OCRmyPDF

84f7e34ace8e4a4225ef1c144320dca10678af02 authored almost 5 years ago
Docker image includes also French, Portuguese and Spanish (#491)

32e2175891743c6610e041b6ecbacb39e226a930 authored almost 5 years ago
docs: archlinux install - yaourt is gone

975abfde9a81ca8a2bace7e172d68131542fda82 authored almost 5 years ago
docs: typo

d929ed76c294e57f3366d11e5d340a2a1a62fdaa authored about 5 years ago
Simplify metadata for invalid xml in output

Removes possibly non-free resource enron1.pdf.

4a27124eab1d87466bb5fd9818c61377f88e97c4 authored about 5 years ago
Update reqs

683ffb84e88322b574a7c67f8aeda508b985720c authored about 5 years ago
v9.6.0 notes updated

2f2602357bf0eec88f2f8bf792ae16486ed73ee8 authored about 5 years ago
Merge branch 'pr479'

a5f429f4994b755ba72d9157aa0b582409a1e771 authored about 5 years ago
ifmain -> main()

bdb7f92131acd7f16ce5649e564a89924f32f1a6 authored about 5 years ago
v9.6.0 notes

09f15ac4c0e21fe3c4e4fbfa0041c592acd44d3d authored about 5 years ago
setup: approve pdfminer.six 20200124

4fdbf55c11bbf95b61c87670a24fd50074aae3af authored about 5 years ago
docs: simplify/fix Ubuntu 18.04 install instructions

fe2b07652bca3f7c4c1c77cc1ff433ae8ea13cd7 authored about 5 years ago