Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/webrecorder/py-wacz
https://github.com/webrecorder/py-wacz
Timezone support
rien333 opened this issue 2 months ago
rien333 opened this issue 2 months ago
py-wacz fails without a `index.html` file
rien333 opened this issue 2 months ago
rien333 opened this issue 2 months ago
Add --copy-pages option to copy pages.jsonl/extraPages.jsonl directly to WACZ
tw4l opened this pull request 9 months ago
tw4l opened this pull request 9 months ago
Add --copy-pages option to copy pages.jsonl/extraPages.jsonl as-is into WACZ
tw4l opened this issue 9 months ago
tw4l opened this issue 9 months ago
Python in the read me file
jburnford opened this issue 11 months ago
jburnford opened this issue 11 months ago
AttributeError: 'NoneType' object has no attribute 'lower'
nvanderperren opened this issue 11 months ago
nvanderperren opened this issue 11 months ago
Windows 10 truncates read path and prevents validation
sbshep opened this issue about 1 year ago
sbshep opened this issue about 1 year ago
Add index generation system that uses offsets into the WACZ itself.
anjackson opened this pull request over 1 year ago
anjackson opened this pull request over 1 year ago
better documentation via `wacz --help`
nvanderperren opened this issue over 1 year ago
nvanderperren opened this issue over 1 year ago
Detecting pages
Natkeeran opened this issue over 1 year ago
Natkeeran opened this issue over 1 year ago
Ignore hashtag on pages
ikreymer opened this pull request over 1 year ago
ikreymer opened this pull request over 1 year ago
Update README to fix --verifier-url param
vbanos opened this pull request over 1 year ago
vbanos opened this pull request over 1 year ago
Canonical method for converting multiple WARC files to WACZ
jackdos opened this issue almost 2 years ago
jackdos opened this issue almost 2 years ago
Rename compressed WARC files without .gz extension when creating WACZ
ibnesayeed opened this issue almost 2 years ago
ibnesayeed opened this issue almost 2 years ago
Fix typos
stavares843 opened this pull request almost 2 years ago
stavares843 opened this pull request almost 2 years ago
Add 0.4.7 changes
tw4l opened this pull request almost 2 years ago
tw4l opened this pull request almost 2 years ago
Add -l/--log-directory option to add logs directory to WACZ
tw4l opened this pull request almost 2 years ago
tw4l opened this pull request almost 2 years ago
[FEATURE] Add logs to WACZ
tw4l opened this issue almost 2 years ago
tw4l opened this issue almost 2 years ago
include request cookie
ikreymer opened this pull request about 2 years ago
ikreymer opened this pull request about 2 years ago
[FEATURE] Add a WARC Record Iterator
ibnesayeed opened this issue over 2 years ago
ibnesayeed opened this issue over 2 years ago
Tweak README for consistency
machawk1 opened this pull request over 2 years ago
machawk1 opened this pull request over 2 years ago
Fix URL on PyPI
edsu opened this pull request over 2 years ago
edsu opened this pull request over 2 years ago
Some commands documented to interact with WACZ files are invalid
machawk1 opened this issue over 2 years ago
machawk1 opened this issue over 2 years ago
Support single seed, detect pages with extra pages
ikreymer opened this pull request over 2 years ago
ikreymer opened this pull request over 2 years ago
Close ZIP once finished
edsu opened this pull request over 2 years ago
edsu opened this pull request over 2 years ago
Test failure under Python 3.10
edsu opened this issue over 2 years ago
edsu opened this issue over 2 years ago
Instructions how to create wacz from browsertrix crawl
despens opened this issue over 2 years ago
despens opened this issue over 2 years ago
zipfile.BadZipFile error during wacz creation from warc file - Windows only
ivbeg opened this issue almost 3 years ago
ivbeg opened this issue almost 3 years ago
Dev dependencies should be separated from normal dependencies
phiresky opened this issue almost 3 years ago
phiresky opened this issue almost 3 years ago
More tolerant parsing
ikreymer opened this pull request almost 3 years ago
ikreymer opened this pull request almost 3 years ago
Hash Computation Fix
ikreymer opened this pull request almost 3 years ago
ikreymer opened this pull request almost 3 years ago
Signing/Verification Support
ikreymer opened this pull request almost 3 years ago
ikreymer opened this pull request almost 3 years ago
py-wacz: when adding pages from specified page list, check for https versions.
ikreymer opened this issue over 3 years ago
ikreymer opened this issue over 3 years ago
`datapackage.json` does not pass frictionless data default profile validation
DiegoPino opened this issue over 3 years ago
DiegoPino opened this issue over 3 years ago
py-wacz: Add a way to list pages in the WACZ
ikreymer opened this issue almost 4 years ago
ikreymer opened this issue almost 4 years ago
Allow MD5 as datapackage hash
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Support premade page lists from a crawler
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Command Line Return Code should be 0
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Ability to specify main page via --url / --ts flags
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Use psf/black for python code formatting
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
Combine the text and page index
emmadickson opened this issue about 4 years ago
emmadickson opened this issue about 4 years ago
Improve testing suite
emmadickson opened this issue about 4 years ago
emmadickson opened this issue about 4 years ago
Validation of WACZ Format
ikreymer opened this issue about 4 years ago
ikreymer opened this issue about 4 years ago
py-wacz: Implement test suite for py-wacz
ikreymer opened this issue over 4 years ago
ikreymer opened this issue over 4 years ago
Error "File size unexpectedly exceeded ZIP64 limit" occurs when using py-wacz on a large WACZ file
whalehub opened this issue over 4 years ago
whalehub opened this issue over 4 years ago