Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
Webrecorder
Creating open-source web archiving software for all!
Collective -
Host: webrecorder -
https://opencollective.com/webrecorder
- Website: https://webrecorder.net/
- Code: https://github.com/webrecorder
Chrome 112 + new headless mode + consistent viewport tweaks
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Entire site is crawled, but no output warcs are generated.
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue over 1 year ago
stopping: if crawl is marked as stopping, and no warcs found, mark st…
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Record unbound Range requests only, don't convert bounded range requests to unbounded
github.com/webrecorder/pywb - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/pywb - ikreymer opened this pull request over 1 year ago
Can't create intranet profile
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue over 1 year ago
Feature Request: capture social media posts (e.g. tweets) as seperate pages
github.com/webrecorder/archiveweb.page - nvanderperren opened this issue over 1 year ago
github.com/webrecorder/archiveweb.page - nvanderperren opened this issue over 1 year ago
Adds newline type clarification
github.com/webrecorder/specs - Shrinks99 opened this pull request over 1 year ago
github.com/webrecorder/specs - Shrinks99 opened this pull request over 1 year ago
file chooser: relax extension pattern to allow query/hash after exten…
github.com/webrecorder/replayweb.page - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/replayweb.page - ikreymer opened this pull request almost 2 years ago
Spec diagram font doesn't load properly in all browsers
github.com/webrecorder/specs - Shrinks99 opened this issue almost 2 years ago
github.com/webrecorder/specs - Shrinks99 opened this issue almost 2 years ago
Deimos/add https type
github.com/webrecorder/warcio - Deimos4Flare opened this pull request almost 2 years ago
github.com/webrecorder/warcio - Deimos4Flare opened this pull request almost 2 years ago
Disable Chrome optimization logic
github.com/webrecorder/browsertrix-crawler - malemburg opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - malemburg opened this pull request almost 2 years ago
Enable Expression Statement and Loop Control extensions for Jinja templates
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
Crawler often downloads 40-50MB worth of unnecessary Chrome model files
github.com/webrecorder/browsertrix-crawler - malemburg opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - malemburg opened this issue almost 2 years ago
black screen on interactive profile creation
github.com/webrecorder/browsertrix-crawler - jswrenn opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - jswrenn opened this issue almost 2 years ago
state: adjust redis keys to be more consistent
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
Replay of an archive of a page of 4xx and 5xx HTTP response status has a 200 response status
github.com/webrecorder/pywb - notevenaperson opened this issue almost 2 years ago
github.com/webrecorder/pywb - notevenaperson opened this issue almost 2 years ago
Disk utilization threshold
github.com/webrecorder/browsertrix-crawler - atomotic opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - atomotic opened this issue almost 2 years ago
Handling of CRAWL_ARGS cannot cope with quoted strings
github.com/webrecorder/browsertrix-crawler - anjackson opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - anjackson opened this issue almost 2 years ago
Facebook recording using the extension fails with ServerJS based data-sjs payload content length mismatch
github.com/webrecorder/archiveweb.page - tsemachh opened this issue almost 2 years ago
github.com/webrecorder/archiveweb.page - tsemachh opened this issue almost 2 years ago
Consolidate wacz error loglines
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
Log fatal messages to redis errors
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
ReSpec generation from Markdown
github.com/webrecorder/specs - edsu opened this issue almost 2 years ago
github.com/webrecorder/specs - edsu opened this issue almost 2 years ago
Update README to fix --verifier-url param
github.com/webrecorder/py-wacz - vbanos opened this pull request almost 2 years ago
github.com/webrecorder/py-wacz - vbanos opened this pull request almost 2 years ago
Improve thumbnails with sharp
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
crawl stopping / additional states:
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
pywb users and developers survey to inform future development roadmap
github.com/webrecorder/pywb - ikreymer opened this issue almost 2 years ago
github.com/webrecorder/pywb - ikreymer opened this issue almost 2 years ago
Improve thumbnail creation
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
Switch back to Puppeteer from Playwright
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
Catch 4xx and 5xx page.goto() responses to mark invalid URLs as failed
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
Wombat incorrectly rewriting URL
github.com/webrecorder/wombat - calbon2702 opened this issue almost 2 years ago
github.com/webrecorder/wombat - calbon2702 opened this issue almost 2 years ago
Full-page screenshots missing content
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue almost 2 years ago
Playwright persistent browser context causing memory issues
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
Fixes from 0.9.1
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
Fix full page screenshot
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
Documentation: Expand instructions for Certificate Authority setup
github.com/webrecorder/pywb - notevenaperson opened this pull request almost 2 years ago
github.com/webrecorder/pywb - notevenaperson opened this pull request almost 2 years ago
Browsertrix can't fetch articles to crawl list (only menu items)
github.com/webrecorder/browsertrix-crawler - gitreich opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - gitreich opened this issue almost 2 years ago
How do I view my fully created warc files on my chromebook computer without installing any warc file tools.
github.com/webrecorder/archiveweb.page - KaineRecycler opened this issue almost 2 years ago
github.com/webrecorder/archiveweb.page - KaineRecycler opened this issue almost 2 years ago
Black background in full screen view for websites that do not set the body background colour explicitly
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue almost 2 years ago
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue almost 2 years ago
The saved page is not loading after update
github.com/webrecorder/replayweb.page - ZheniaZuser opened this issue almost 2 years ago
github.com/webrecorder/replayweb.page - ZheniaZuser opened this issue almost 2 years ago
Allow switching capturing backend from pywb to warcprox
github.com/webrecorder/browsertrix-crawler - Sanqui opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - Sanqui opened this issue almost 2 years ago
WACZ Signing and Verification spec: Support for timestamped anonynous signatures
github.com/webrecorder/specs - matteocargnelutti opened this issue almost 2 years ago
github.com/webrecorder/specs - matteocargnelutti opened this issue almost 2 years ago
Post request with specials characters in the payload fail to replay
github.com/webrecorder/pywb - JSarif opened this issue almost 2 years ago
github.com/webrecorder/pywb - JSarif opened this issue almost 2 years ago
New spec: Request body canonicalization
github.com/webrecorder/specs - tw4l opened this issue almost 2 years ago
github.com/webrecorder/specs - tw4l opened this issue almost 2 years ago
Allow spaces in userAgentSuffix command line option
github.com/webrecorder/browsertrix-crawler - anjackson opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - anjackson opened this issue almost 2 years ago
Quick exit on redis connection error after interrupt
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
Feature: Instagram stories
github.com/webrecorder/browsertrix-behaviors - bitknox opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-behaviors - bitknox opened this pull request almost 2 years ago
Store archive dir size in Redis
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
Introduce new Limit Parameter crawl-size
github.com/webrecorder/browsertrix-crawler - gitreich opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - gitreich opened this issue almost 2 years ago
Feature Request: Give replayweb.page a tabbed UI in the recorder capture multiple pages simulatenously.
github.com/webrecorder/archiveweb.page - YousufSSyed opened this issue almost 2 years ago
github.com/webrecorder/archiveweb.page - YousufSSyed opened this issue almost 2 years ago
worker: lower wait time, in case where no additional pages remain and…
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
Can't archive a page - 2 different environments, 2 different results
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ArtHoff opened this issue almost 2 years ago
Store crawl size in Redis while crawl is running
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue almost 2 years ago
Crawler doesn't mark invalid URL as failed
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
Issues with loading resources from Cloudflare bucket in Firefox 112
github.com/webrecorder/replayweb.page - despens opened this issue almost 2 years ago
github.com/webrecorder/replayweb.page - despens opened this issue almost 2 years ago
feat: Add custom behavior injection
github.com/webrecorder/browsertrix-crawler - lambdahands opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - lambdahands opened this pull request almost 2 years ago
Set file format correctly in WARC info
github.com/webrecorder/awp-sw - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/awp-sw - tw4l opened this pull request almost 2 years ago
Store done in redis as integer and only save full json in redis for failed pages
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
Archives sometimes becoming broken after saving more pages
github.com/webrecorder/archiveweb.page - ZheniaZuser opened this issue almost 2 years ago
github.com/webrecorder/archiveweb.page - ZheniaZuser opened this issue almost 2 years ago
Support importing behaviors from the new Chrome dev tools Recorder panel JSON export format
github.com/webrecorder/browsertrix-crawler - pirate opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - pirate opened this issue almost 2 years ago
New Behavior Request For: https://www.twitch.tv/
github.com/webrecorder/browsertrix-behaviors - Klindten opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-behaviors - Klindten opened this issue almost 2 years ago
revisit dupe check: optimized revisit filtering
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
WACZ loading refactor + support nested WACZ + MultiWACZ updating
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
is it possible to output regular files
github.com/webrecorder/browsertrix-crawler - ftc2 opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ftc2 opened this issue almost 2 years ago
rewriting: handle new blob:<id>/<base url> scheme to support setting …
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
More extensive document.write -> Blob rewrite fix for service worker-based replay
github.com/webrecorder/wombat - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/wombat - ikreymer opened this pull request almost 2 years ago
Logging into reddit.com isn't working. Would adding a cookies.txt fix this?
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
How does custom filtering for the recorder work? Could I use it to filter out MP4 files? (and other video file extensions)?
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
Add support for the 1995 NCSA 1.5.1 webserver
github.com/webrecorder/warcio - omgoo opened this pull request almost 2 years ago
github.com/webrecorder/warcio - omgoo opened this pull request almost 2 years ago
Ensure revisits can not override non-revisit resources
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/wabac.js - ikreymer opened this pull request almost 2 years ago
origin override: add --originOverride source=dest to allow routing wh…
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
Make updates to dynamic injections
github.com/webrecorder/browsertrix-behaviors - lambdahands opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-behaviors - lambdahands opened this pull request almost 2 years ago
Investigate removing done from Redis
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
Add option to log errors to redis
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
Add option to log crawl errors to Redis
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
Some images not loading on page
github.com/webrecorder/replayweb.page - Ecdldaiiere opened this issue almost 2 years ago
github.com/webrecorder/replayweb.page - Ecdldaiiere opened this issue almost 2 years ago
Is there a way to dedup WARCs after recording them?
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
Initial implementation
github.com/webrecorder/wacz2car - RangerMauve opened this pull request almost 2 years ago
github.com/webrecorder/wacz2car - RangerMauve opened this pull request almost 2 years ago
Error when restarting crawl with config via stdin
github.com/webrecorder/browsertrix-crawler - darcyparksliu opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - darcyparksliu opened this issue almost 2 years ago
Add cookie test to serializer
github.com/webrecorder/warcio.js - patrickheeney opened this pull request almost 2 years ago
github.com/webrecorder/warcio.js - patrickheeney opened this pull request almost 2 years ago
Multiple set-cookie header support
github.com/webrecorder/warcio.js - patrickheeney opened this issue almost 2 years ago
github.com/webrecorder/warcio.js - patrickheeney opened this issue almost 2 years ago
Add --title and --description CLI args to write metadata into datapackage.json
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
[Use Case]: timestamping WARCs inside a WACZ
github.com/webrecorder/specs - mikevandijk opened this issue almost 2 years ago
github.com/webrecorder/specs - mikevandijk opened this issue almost 2 years ago
Add --maxPageLimit override
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
blockrules/logger: use global logger var
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request almost 2 years ago
Pywb fails to install on 3.11 & 3.12 (at least on M1 Macs)
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
Getting [Errno 2] No such file or directory after deleting a WARC and reindexing.
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
github.com/webrecorder/pywb - YousufSSyed opened this issue almost 2 years ago
wabac.js + wombat version bump
github.com/webrecorder/replayweb.page - ikreymer opened this pull request almost 2 years ago
github.com/webrecorder/replayweb.page - ikreymer opened this pull request almost 2 years ago
Add unit test for sizeLimit
github.com/webrecorder/browsertrix-crawler - stavares843 opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - stavares843 opened this pull request almost 2 years ago
Skip creating draft release if non-draft release exists
github.com/webrecorder/browsertrix-browser-base - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-browser-base - tw4l opened this pull request almost 2 years ago
Update README for 0.9.0
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
wget warc status code?
github.com/webrecorder/warcio - JohnMaguire opened this issue almost 2 years ago
github.com/webrecorder/warcio - JohnMaguire opened this issue almost 2 years ago
wbrequest object not available in not_found.html template
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
Provide clean way to inject data into replayed pages
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
Add options to filter logs by --logLevel and --context
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request almost 2 years ago
Add CLI options to filter logs by logLevel and/or context
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue almost 2 years ago
Make metadata dictionary available in all templates
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
Dependencies of per-collection templates
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
github.com/webrecorder/pywb - despens opened this issue almost 2 years ago
Network error when using --config and config file
github.com/webrecorder/browsertrix-crawler - darcyparksliu opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - darcyparksliu opened this issue almost 2 years ago
Support Contextual Information in datapackage.json for WACZ
github.com/webrecorder/browsertrix-crawler - markpbaggett opened this issue almost 2 years ago
github.com/webrecorder/browsertrix-crawler - markpbaggett opened this issue almost 2 years ago