Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/camelot-dev/camelot

A Python library to extract tabular data from PDFs
https://github.com/camelot-dev/camelot

It seems tables with only one line are not identified as such?

pboettch opened this issue over 2 years ago
Runtime error~import cv2

paulfruitful opened this issue over 2 years ago
Failed to install cryptography

Divyansh-Gemini opened this issue over 2 years ago
Encoding problem with accented letters / diacritic

josephernest opened this issue over 2 years ago
Unknown Flavor, excaliber json incompatible with camelot

bosd opened this issue over 2 years ago
Multi-column page layout processing

endovitskayaV opened this issue over 2 years ago
updated install-deps doc

phacic opened this pull request over 2 years ago
improving table extraction for html output

Alexandre-bitoun opened this pull request over 2 years ago
Bug: Hardcoded value of '10' limits number of tables in page

ottohirr opened this issue over 2 years ago
ZeroDivisionError in text_in_bbox for some tables

peterhvoth opened this issue over 2 years ago
Can't set mode using to_* methods

captn3m0 opened this issue over 2 years ago
pdf not parsing properly

SagarPrasad22 opened this issue over 2 years ago
changed storage of configuration data

Gladioluss opened this pull request over 2 years ago
Black background color problem

MinMagar opened this issue over 2 years ago
Fix: unwanted data leaks into the last cell

power-zhy opened this pull request over 2 years ago
Input of different table areas on different pdf pages.

RyosukeSakaguchi opened this issue over 2 years ago
No documentation to '.parsing_report' or '.df'

LuizMosciaro opened this issue over 2 years ago
updated ghostscript_backend to work with windows machines

isayahc opened this pull request over 2 years ago
MAINT: Replace PyPDF2 by pypdf

MartinThoma opened this pull request over 2 years ago
[Question] Detect table from PDFs (Return bool values only)

teohsinyee opened this issue over 2 years ago
ZeroDivisionError: float division by zero

pratheesh-prakash opened this issue over 2 years ago
[MRG] Utils: optimise get_page_layout

karlowich opened this pull request over 2 years ago
show() not showing anything

SteffRainville opened this issue over 2 years ago
[MRG] Fixed ZeroDivisionError in text_in_bbox

const-FC opened this pull request almost 3 years ago
IndexError while using split_text

ramSeraph opened this issue almost 3 years ago
No module named 'cv2' when importing camelot

scotscotmcc opened this issue almost 3 years ago
fix float division by zero

tuyenta opened this pull request almost 3 years ago
Optimised and cleaned the code [MRG]

python3-dev opened this pull request about 3 years ago
Camelot without X

HoffmannP opened this issue about 3 years ago
DOC: Update broken links

alissonsv opened this pull request over 3 years ago
[MRG] add support for file_bytes argument with managed_file_context()

cscanlin opened this pull request over 3 years ago
Call pdftopng in python instead of subprocess

orent opened this pull request over 3 years ago
Minimum table area supported by Camelot

ZafarShadman09 opened this issue over 3 years ago
ZeroDivisionError when reading PDF in text_in_bbox

stefanw opened this issue over 3 years ago
added a new cli test for importerror

smitharajesh opened this pull request over 3 years ago
test for invalid url

mohanqxf2 opened this pull request over 3 years ago
[MRG]added test to validate when plot_type is None

rohinigopalqxf2 opened this pull request over 3 years ago
[MRG]Improve test coverage

akkuldn opened this pull request over 3 years ago
Support Reading directly from BytesIO

omarsumadi opened this issue over 3 years ago
[WIP] Add support for parsing PDF pages in parallel

phoewass opened this pull request over 3 years ago
Specify layout dimensions of page

talha298 opened this issue almost 4 years ago
[MRG] fix list index out of range and float division by zero bug

guet3401 opened this pull request about 4 years ago
Is there any plan to remove dependency of PyPDF2?

kiyo-matsu opened this issue about 4 years ago
Installation on linux with pip misses the python3-xlwt dependency

D-W-L opened this issue about 4 years ago
hey, how to get the coordinates joints in the table

liugj101 opened this issue about 4 years ago
[MRG] Add saturation threshold option for low contrast tables

NoReflex opened this pull request about 4 years ago
IndexError: list index out of range when specifying the area of table

astariul opened this issue about 4 years ago
Multiple line-breaks removed

leventebo opened this issue about 4 years ago
Dark tables not detected.

Semnodime opened this issue over 4 years ago
PermissionError at __exit__ in utils.py

palask opened this issue over 4 years ago
Replace ghostscript with pdf2image (fixes multithreading)

rawsh-bt opened this pull request over 4 years ago
Network and Hybrid parsers

FrancoisHuet opened this pull request over 4 years ago
[REF] add DataFrame to_excel params, requirements.txt XlsxWriter>=0.9.8

xubiuit opened this pull request over 4 years ago
Output shows '...' instead of all columns

gunnervino opened this issue over 4 years ago
Fix unit tests, lint, drop Python 2 support

FrancoisHuet opened this pull request over 4 years ago
[ TextEdges ] allow single non-empty char textline

pushkarnimkar opened this pull request almost 5 years ago
Add row support in stream

idan-david opened this pull request almost 5 years ago
Other index error in lattice parser

anakin87 opened this issue about 5 years ago
Allow read_pdf to accept a file-like object

Lnk2past opened this issue about 5 years ago
Index out of range latt

bikashgupta11 opened this pull request about 5 years ago
Activating Open Collective

monkeywithacupcake opened this pull request about 5 years ago
[MRG] Reduce file reads in handlers

vinayak-mehta opened this pull request about 5 years ago
Added a help line to install dependencies

sidntrivedi012 opened this pull request about 5 years ago
ZeroDivisionError: float division by zero - table_regions

vinayak-mehta opened this issue about 5 years ago
why module 'camelot' has no attribute 'read_pdf'

weijiuyang opened this issue over 5 years ago
Great library, but dependencies ??!!

akshowhini opened this issue over 5 years ago
Added multi parameter for page level parameters

sverma25 opened this pull request over 5 years ago
TableList error in Camelot and in Excalibur

njss opened this issue over 5 years ago
Fixed encoding trouble

KOLANICH opened this pull request over 5 years ago
Add more pdf-to-image engines?

vinayak-mehta opened this issue over 5 years ago
Use multiprocessing to parallely process PDF pages

vinayak-mehta opened this issue over 5 years ago
Hybrid flavor combining lattice and stream

vinayak-mehta opened this issue over 5 years ago