Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/camelot-dev/camelot
A Python library to extract tabular data from PDFs
https://github.com/camelot-dev/camelot
It seems tables with only one line are not identified as such?
pboettch opened this issue over 2 years ago
pboettch opened this issue over 2 years ago
Runtime error~import cv2
paulfruitful opened this issue over 2 years ago
paulfruitful opened this issue over 2 years ago
Failed to install cryptography
Divyansh-Gemini opened this issue over 2 years ago
Divyansh-Gemini opened this issue over 2 years ago
Encoding problem with accented letters / diacritic
josephernest opened this issue over 2 years ago
josephernest opened this issue over 2 years ago
Unknown Flavor, excaliber json incompatible with camelot
bosd opened this issue over 2 years ago
bosd opened this issue over 2 years ago
Multi-column page layout processing
endovitskayaV opened this issue over 2 years ago
endovitskayaV opened this issue over 2 years ago
updated install-deps doc
phacic opened this pull request over 2 years ago
phacic opened this pull request over 2 years ago
improving table extraction for html output
Alexandre-bitoun opened this pull request over 2 years ago
Alexandre-bitoun opened this pull request over 2 years ago
Bug: Hardcoded value of '10' limits number of tables in page
ottohirr opened this issue over 2 years ago
ottohirr opened this issue over 2 years ago
ZeroDivisionError in text_in_bbox for some tables
peterhvoth opened this issue over 2 years ago
peterhvoth opened this issue over 2 years ago
Can't set mode using to_* methods
captn3m0 opened this issue over 2 years ago
captn3m0 opened this issue over 2 years ago
pdf not parsing properly
SagarPrasad22 opened this issue over 2 years ago
SagarPrasad22 opened this issue over 2 years ago
changed storage of configuration data
Gladioluss opened this pull request over 2 years ago
Gladioluss opened this pull request over 2 years ago
Black background color problem
MinMagar opened this issue over 2 years ago
MinMagar opened this issue over 2 years ago
Fix: unwanted data leaks into the last cell
power-zhy opened this pull request over 2 years ago
power-zhy opened this pull request over 2 years ago
Input of different table areas on different pdf pages.
RyosukeSakaguchi opened this issue over 2 years ago
RyosukeSakaguchi opened this issue over 2 years ago
No documentation to '.parsing_report' or '.df'
LuizMosciaro opened this issue over 2 years ago
LuizMosciaro opened this issue over 2 years ago
updated ghostscript_backend to work with windows machines
isayahc opened this pull request over 2 years ago
isayahc opened this pull request over 2 years ago
MAINT: Replace PyPDF2 by pypdf
MartinThoma opened this pull request over 2 years ago
MartinThoma opened this pull request over 2 years ago
try to read "Rechnungsdetails" from an Amazon invoice from Germany
InLaw opened this issue over 2 years ago
InLaw opened this issue over 2 years ago
[Question] Detect table from PDFs (Return bool values only)
teohsinyee opened this issue over 2 years ago
teohsinyee opened this issue over 2 years ago
ZeroDivisionError: float division by zero
pratheesh-prakash opened this issue over 2 years ago
pratheesh-prakash opened this issue over 2 years ago
[MRG] Utils: optimise get_page_layout
karlowich opened this pull request over 2 years ago
karlowich opened this pull request over 2 years ago
show() not showing anything
SteffRainville opened this issue over 2 years ago
SteffRainville opened this issue over 2 years ago
[MRG] Fixed ZeroDivisionError in text_in_bbox
const-FC opened this pull request almost 3 years ago
const-FC opened this pull request almost 3 years ago
IndexError while using split_text
ramSeraph opened this issue almost 3 years ago
ramSeraph opened this issue almost 3 years ago
No module named 'cv2' when importing camelot
scotscotmcc opened this issue almost 3 years ago
scotscotmcc opened this issue almost 3 years ago
fix float division by zero
tuyenta opened this pull request almost 3 years ago
tuyenta opened this pull request almost 3 years ago
Optimised and cleaned the code [MRG]
python3-dev opened this pull request about 3 years ago
python3-dev opened this pull request about 3 years ago
Camelot without X
HoffmannP opened this issue about 3 years ago
HoffmannP opened this issue about 3 years ago
DOC: Update broken links
alissonsv opened this pull request over 3 years ago
alissonsv opened this pull request over 3 years ago
[MRG] add support for file_bytes argument with managed_file_context()
cscanlin opened this pull request over 3 years ago
cscanlin opened this pull request over 3 years ago
Call pdftopng in python instead of subprocess
orent opened this pull request over 3 years ago
orent opened this pull request over 3 years ago
two single row tables in two separate pdfs don't bet read by camelot as tables
myrhillion opened this issue over 3 years ago
myrhillion opened this issue over 3 years ago
Minimum table area supported by Camelot
ZafarShadman09 opened this issue over 3 years ago
ZafarShadman09 opened this issue over 3 years ago
ZeroDivisionError when reading PDF in text_in_bbox
stefanw opened this issue over 3 years ago
stefanw opened this issue over 3 years ago
added a new cli test for importerror
smitharajesh opened this pull request over 3 years ago
smitharajesh opened this pull request over 3 years ago
[MRG]Added test case for method bbox_no_intersection from utils to check no intersection logic
rahul-bhave opened this pull request over 3 years ago
rahul-bhave opened this pull request over 3 years ago
test for invalid url
mohanqxf2 opened this pull request over 3 years ago
mohanqxf2 opened this pull request over 3 years ago
[MRG]added test to validate when plot_type is None
rohinigopalqxf2 opened this pull request over 3 years ago
rohinigopalqxf2 opened this pull request over 3 years ago
[MRG]Improve test coverage
akkuldn opened this pull request over 3 years ago
akkuldn opened this pull request over 3 years ago
Support Reading directly from BytesIO
omarsumadi opened this issue over 3 years ago
omarsumadi opened this issue over 3 years ago
[WIP] Add support for parsing PDF pages in parallel
phoewass opened this pull request over 3 years ago
phoewass opened this pull request over 3 years ago
Specify layout dimensions of page
talha298 opened this issue almost 4 years ago
talha298 opened this issue almost 4 years ago
[MRG] fix list index out of range and float division by zero bug
guet3401 opened this pull request about 4 years ago
guet3401 opened this pull request about 4 years ago
Is there any plan to remove dependency of PyPDF2?
kiyo-matsu opened this issue about 4 years ago
kiyo-matsu opened this issue about 4 years ago
Installation on linux with pip misses the python3-xlwt dependency
D-W-L opened this issue about 4 years ago
D-W-L opened this issue about 4 years ago
hey, how to get the coordinates joints in the table
liugj101 opened this issue about 4 years ago
liugj101 opened this issue about 4 years ago
[MRG] Add saturation threshold option for low contrast tables
NoReflex opened this pull request about 4 years ago
NoReflex opened this pull request about 4 years ago
IndexError: list index out of range when specifying the area of table
astariul opened this issue about 4 years ago
astariul opened this issue about 4 years ago
Multiple line-breaks removed
leventebo opened this issue about 4 years ago
leventebo opened this issue about 4 years ago
Dark tables not detected.
Semnodime opened this issue over 4 years ago
Semnodime opened this issue over 4 years ago
Doc enhancement: Note dependency on libgs.so (libgs.dylib on Mac) for ghostscript
jimhall opened this issue over 4 years ago
jimhall opened this issue over 4 years ago
PermissionError at __exit__ in utils.py
palask opened this issue over 4 years ago
palask opened this issue over 4 years ago
Replace ghostscript with pdf2image (fixes multithreading)
rawsh-bt opened this pull request over 4 years ago
rawsh-bt opened this pull request over 4 years ago
Network and Hybrid parsers
FrancoisHuet opened this pull request over 4 years ago
FrancoisHuet opened this pull request over 4 years ago
Error in Ubuntu 20.04 (libgs.so: undefined symbol: FT_Set_MM_WeightVector)
arcruz0 opened this issue over 4 years ago
arcruz0 opened this issue over 4 years ago
[REF] add DataFrame to_excel params, requirements.txt XlsxWriter>=0.9.8
xubiuit opened this pull request over 4 years ago
xubiuit opened this pull request over 4 years ago
Output shows '...' instead of all columns
gunnervino opened this issue over 4 years ago
gunnervino opened this issue over 4 years ago
Fix unit tests, lint, drop Python 2 support
FrancoisHuet opened this pull request over 4 years ago
FrancoisHuet opened this pull request over 4 years ago
[ TextEdges ] allow single non-empty char textline
pushkarnimkar opened this pull request almost 5 years ago
pushkarnimkar opened this pull request almost 5 years ago
Add row support in stream
idan-david opened this pull request almost 5 years ago
idan-david opened this pull request almost 5 years ago
[parsers.stream] - Fix IndexError when extracting more tables than there are columns
JosePVB opened this pull request almost 5 years ago
JosePVB opened this pull request almost 5 years ago
Other index error in lattice parser
anakin87 opened this issue about 5 years ago
anakin87 opened this issue about 5 years ago
Allow read_pdf to accept a file-like object
Lnk2past opened this issue about 5 years ago
Lnk2past opened this issue about 5 years ago
Index out of range latt
bikashgupta11 opened this pull request about 5 years ago
bikashgupta11 opened this pull request about 5 years ago
Activating Open Collective
monkeywithacupcake opened this pull request about 5 years ago
monkeywithacupcake opened this pull request about 5 years ago
[MRG] Reduce file reads in handlers
vinayak-mehta opened this pull request about 5 years ago
vinayak-mehta opened this pull request about 5 years ago
Added a help line to install dependencies
sidntrivedi012 opened this pull request about 5 years ago
sidntrivedi012 opened this pull request about 5 years ago
ZeroDivisionError: float division by zero - table_regions
vinayak-mehta opened this issue about 5 years ago
vinayak-mehta opened this issue about 5 years ago
why module 'camelot' has no attribute 'read_pdf'
weijiuyang opened this issue over 5 years ago
weijiuyang opened this issue over 5 years ago
Great library, but dependencies ??!!
akshowhini opened this issue over 5 years ago
akshowhini opened this issue over 5 years ago
Added multi parameter for page level parameters
sverma25 opened this pull request over 5 years ago
sverma25 opened this pull request over 5 years ago
TableList error in Camelot and in Excalibur
njss opened this issue over 5 years ago
njss opened this issue over 5 years ago
Fixed encoding trouble
KOLANICH opened this pull request over 5 years ago
KOLANICH opened this pull request over 5 years ago
Add more pdf-to-image engines?
vinayak-mehta opened this issue over 5 years ago
vinayak-mehta opened this issue over 5 years ago
Use multiprocessing to parallely process PDF pages
vinayak-mehta opened this issue over 5 years ago
vinayak-mehta opened this issue over 5 years ago
Hybrid flavor combining lattice and stream
vinayak-mehta opened this issue over 5 years ago
vinayak-mehta opened this issue over 5 years ago