Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/webrecorder/py-wacz
https://github.com/webrecorder/py-wacz
Timezone support
rien333 opened this issue 3 months ago
rien333 opened this issue 3 months ago
py-wacz fails without a `index.html` file
rien333 opened this issue 3 months ago
rien333 opened this issue 3 months ago
Add --copy-pages option to copy pages.jsonl/extraPages.jsonl directly to WACZ
tw4l opened this pull request 10 months ago
tw4l opened this pull request 10 months ago
Add --copy-pages option to copy pages.jsonl/extraPages.jsonl as-is into WACZ
tw4l opened this issue 10 months ago
tw4l opened this issue 10 months ago
Python in the read me file
jburnford opened this issue about 1 year ago
jburnford opened this issue about 1 year ago
AttributeError: 'NoneType' object has no attribute 'lower'
nvanderperren opened this issue about 1 year ago
nvanderperren opened this issue about 1 year ago
Windows 10 truncates read path and prevents validation
sbshep opened this issue about 1 year ago
sbshep opened this issue about 1 year ago
Add index generation system that uses offsets into the WACZ itself.
anjackson opened this pull request over 1 year ago
anjackson opened this pull request over 1 year ago
better documentation via `wacz --help`
nvanderperren opened this issue over 1 year ago
nvanderperren opened this issue over 1 year ago
Detecting pages
Natkeeran opened this issue over 1 year ago
Natkeeran opened this issue over 1 year ago
Ignore hashtag on pages
ikreymer opened this pull request over 1 year ago
ikreymer opened this pull request over 1 year ago
Update README to fix --verifier-url param
vbanos opened this pull request almost 2 years ago
vbanos opened this pull request almost 2 years ago
Canonical method for converting multiple WARC files to WACZ
jackdos opened this issue almost 2 years ago
jackdos opened this issue almost 2 years ago
Rename compressed WARC files without .gz extension when creating WACZ
ibnesayeed opened this issue almost 2 years ago
ibnesayeed opened this issue almost 2 years ago
Fix typos
stavares843 opened this pull request almost 2 years ago
stavares843 opened this pull request almost 2 years ago
Add 0.4.7 changes
tw4l opened this pull request almost 2 years ago
tw4l opened this pull request almost 2 years ago
Add -l/--log-directory option to add logs directory to WACZ
tw4l opened this pull request almost 2 years ago
tw4l opened this pull request almost 2 years ago
[FEATURE] Add logs to WACZ
tw4l opened this issue almost 2 years ago
tw4l opened this issue almost 2 years ago
include request cookie
ikreymer opened this pull request about 2 years ago
ikreymer opened this pull request about 2 years ago
[FEATURE] Add a WARC Record Iterator
ibnesayeed opened this issue over 2 years ago
ibnesayeed opened this issue over 2 years ago
Tweak README for consistency
machawk1 opened this pull request over 2 years ago
machawk1 opened this pull request over 2 years ago
Fix URL on PyPI
edsu opened this pull request over 2 years ago
edsu opened this pull request over 2 years ago
Some commands documented to interact with WACZ files are invalid
machawk1 opened this issue over 2 years ago
machawk1 opened this issue over 2 years ago
Support single seed, detect pages with extra pages
ikreymer opened this pull request almost 3 years ago
ikreymer opened this pull request almost 3 years ago
Close ZIP once finished
edsu opened this pull request almost 3 years ago
edsu opened this pull request almost 3 years ago
Test failure under Python 3.10
edsu opened this issue almost 3 years ago
edsu opened this issue almost 3 years ago
Instructions how to create wacz from browsertrix crawl
despens opened this issue almost 3 years ago
despens opened this issue almost 3 years ago
zipfile.BadZipFile error during wacz creation from warc file - Windows only
ivbeg opened this issue almost 3 years ago
ivbeg opened this issue almost 3 years ago
Dev dependencies should be separated from normal dependencies
phiresky opened this issue almost 3 years ago
phiresky opened this issue almost 3 years ago
More tolerant parsing
ikreymer opened this pull request almost 3 years ago
ikreymer opened this pull request almost 3 years ago
Hash Computation Fix
ikreymer opened this pull request about 3 years ago
ikreymer opened this pull request about 3 years ago
Signing/Verification Support
ikreymer opened this pull request about 3 years ago
ikreymer opened this pull request about 3 years ago
py-wacz: when adding pages from specified page list, check for https versions.
ikreymer opened this issue over 3 years ago
ikreymer opened this issue over 3 years ago
`datapackage.json` does not pass frictionless data default profile validation
DiegoPino opened this issue over 3 years ago
DiegoPino opened this issue over 3 years ago
py-wacz: Add a way to list pages in the WACZ
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Allow MD5 as datapackage hash
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Support premade page lists from a crawler
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Command Line Return Code should be 0
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Ability to specify main page via --url / --ts flags
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Use psf/black for python code formatting
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Combine the text and page index
emmadickson opened this issue over 4 years ago
emmadickson opened this issue over 4 years ago
Improve testing suite
emmadickson opened this issue over 4 years ago
emmadickson opened this issue over 4 years ago
Validation of WACZ Format
ikreymer opened this issue over 4 years ago
ikreymer opened this issue over 4 years ago
py-wacz: Implement test suite for py-wacz
ikreymer opened this issue over 4 years ago
ikreymer opened this issue over 4 years ago
Error "File size unexpectedly exceeded ZIP64 limit" occurs when using py-wacz on a large WACZ file
whalehub opened this issue over 4 years ago
whalehub opened this issue over 4 years ago