Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
Webrecorder
Creating open-source web archiving software for all!
Collective -
Host: webrecorder -
https://opencollective.com/webrecorder
- Website: https://webrecorder.net/
- Code: https://github.com/webrecorder
Fixes switch_locale not adding locale if missing from URL
github.com/webrecorder/pywb - Quirinus opened this pull request over 1 year ago
github.com/webrecorder/pywb - Quirinus opened this pull request over 1 year ago
switch_locale not adding locale if missing from URL
github.com/webrecorder/pywb - Quirinus opened this issue over 1 year ago
github.com/webrecorder/pywb - Quirinus opened this issue over 1 year ago
Add Chrome version 118
github.com/webrecorder/browsertrix-browser-base - jasper-s opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-browser-base - jasper-s opened this pull request over 1 year ago
More flexible multi value arg parsing + README update for 0.12.0
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Crawling wayback machine snapshots
github.com/webrecorder/browsertrix-crawler - FronkMau opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - FronkMau opened this issue over 1 year ago
MongoDB Backups
github.com/webrecorder/browsertrix - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix - ikreymer opened this pull request over 1 year ago
Indexing Errors with YouTube JSON in POST Request Payload
github.com/webrecorder/pywb - mona-ul opened this issue over 1 year ago
github.com/webrecorder/pywb - mona-ul opened this issue over 1 year ago
Sets "Pywb Error" string as translatable in error template
github.com/webrecorder/pywb - Quirinus opened this pull request over 1 year ago
github.com/webrecorder/pywb - Quirinus opened this pull request over 1 year ago
String not set as translatable in template
github.com/webrecorder/pywb - Quirinus opened this issue over 1 year ago
github.com/webrecorder/pywb - Quirinus opened this issue over 1 year ago
New feature/idea for the desktop app
github.com/webrecorder/archiveweb.page - yacylover opened this issue over 1 year ago
github.com/webrecorder/archiveweb.page - yacylover opened this issue over 1 year ago
Return User-Agent on all code path to set headers appropriately
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
When passed as CLI argument, User-Agent is not always set
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
Exclusion rules for browser behaviors
github.com/webrecorder/browsertrix-crawler - pato-pan opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - pato-pan opened this issue over 1 year ago
output, generate, or concatenate into a single wacz file?
github.com/webrecorder/browsertrix-crawler - pato-pan opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - pato-pan opened this issue over 1 year ago
[Docs] More exclusion examples?
github.com/webrecorder/browsertrix-crawler - pato-pan opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - pato-pan opened this issue over 1 year ago
load saved state fixes + redis tests
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
storage: also compute crc32 as part of storage webhook when uploading…
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
disable component updates by setting --component-updater to invalid URL
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Add crc32 computation
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue over 1 year ago
Bump @babel/traverse from 7.3.4 to 7.23.2
github.com/webrecorder/wombat - dependabot[bot] opened this pull request over 1 year ago
github.com/webrecorder/wombat - dependabot[bot] opened this pull request over 1 year ago
Bump @babel/traverse from 7.19.4 to 7.23.2
github.com/webrecorder/warcio.js - dependabot[bot] opened this pull request over 1 year ago
github.com/webrecorder/warcio.js - dependabot[bot] opened this pull request over 1 year ago
Document `liveRedirectOnNotFound`
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
Cannot restart crawl from state file
github.com/webrecorder/browsertrix-crawler - darcyparksliu opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - darcyparksliu opened this issue over 1 year ago
infinite loop caused by /
github.com/webrecorder/browsertrix-crawler - wsdookadr opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - wsdookadr opened this issue over 1 year ago
fixed nginx CORS config
github.com/webrecorder/replayweb.page - renevoorburg opened this pull request over 1 year ago
github.com/webrecorder/replayweb.page - renevoorburg opened this pull request over 1 year ago
trouble with running in cron
github.com/webrecorder/browsertrix-crawler - eleaner opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - eleaner opened this issue over 1 year ago
WACZ range request error
github.com/webrecorder/replayweb.page - preetamsinghvi opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - preetamsinghvi opened this issue over 1 year ago
Support adding/removing exclusions without restarting the crawler
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
tests: disable ad-block tests: seeing inconsistent ci behavior
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
0.11.3 Fixes
github.com/webrecorder/archiveweb.page - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/archiveweb.page - ikreymer opened this pull request over 1 year ago
Add initial set of Playwright integration tests, for testing embeds
github.com/webrecorder/replayweb.page - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/replayweb.page - ikreymer opened this pull request over 1 year ago
fix: ensure default index.html is served for app,
github.com/webrecorder/replayweb.page - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/replayweb.page - ikreymer opened this pull request over 1 year ago
misc fuzzy matching + rewriting fixes
github.com/webrecorder/wabac.js - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/wabac.js - ikreymer opened this pull request over 1 year ago
ReplayWebpage V2 Docs Content Reorganization
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
PWA Manifest Not Available on Deployed Site
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
ReplayWebpage V2 Documentation Update
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
build(deps): bump postcss from 8.3.6 to 8.4.31
github.com/webrecorder/replayweb.page - dependabot[bot] opened this pull request over 1 year ago
github.com/webrecorder/replayweb.page - dependabot[bot] opened this pull request over 1 year ago
Fast cancelation + remove time counter
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
build(deps-dev): bump electron from 25.1.1 to 25.8.4
github.com/webrecorder/replayweb.page - dependabot[bot] opened this pull request over 1 year ago
github.com/webrecorder/replayweb.page - dependabot[bot] opened this pull request over 1 year ago
Execution Time Follow-Up Work
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
improved text extraction: (addresses #403)
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Improved Text Extraction, stored to WARC
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue over 1 year ago
Add trailing slash info to `url_prefix` help text
github.com/webrecorder/warcit - Shrinks99 opened this pull request over 1 year ago
github.com/webrecorder/warcit - Shrinks99 opened this pull request over 1 year ago
Atomic conversion outputs
github.com/webrecorder/warcit - anjackson opened this pull request over 1 year ago
github.com/webrecorder/warcit - anjackson opened this pull request over 1 year ago
additional failure logic:
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
10GB wacz file - how to split?
github.com/webrecorder/browsertrix-crawler - eleaner opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - eleaner opened this issue over 1 year ago
Switch to Brave Base Image
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
CVE-2023-4863: update chrome browser version
github.com/webrecorder/browsertrix-crawler - DriesVanbilloen opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - DriesVanbilloen opened this issue over 1 year ago
Document loading from replay.json
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
POST request not handled correctly
github.com/webrecorder/archiveweb.page - bricas opened this issue over 1 year ago
github.com/webrecorder/archiveweb.page - bricas opened this issue over 1 year ago
Improve UI display of URL list errors
github.com/webrecorder/browsertrix - SuaYoo opened this issue over 1 year ago
github.com/webrecorder/browsertrix - SuaYoo opened this issue over 1 year ago
alternative ways of implementing browser behaviors
github.com/webrecorder/browsertrix-crawler - wsdookadr opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - wsdookadr opened this issue over 1 year ago
Store crawler start and end times in Redis lists
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request over 1 year ago
additional fixes for worker getting stuck
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Set new logic for invalid seeds
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this pull request over 1 year ago
[docs] recrawl and excludes
github.com/webrecorder/browsertrix-crawler - wsdookadr opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - wsdookadr opened this issue over 1 year ago
Bump gevent from 21.12.0 to 23.9.1
github.com/webrecorder/pywb - dependabot[bot] opened this pull request over 1 year ago
github.com/webrecorder/pywb - dependabot[bot] opened this pull request over 1 year ago
Handle HTTP 429 errors + add failure limit
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
Slow down + retry on HTTP 429 errors
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
Crawler getting stuck on Page Crashed
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
HEAD Fallback Mechanism to GET 0-0
github.com/webrecorder/replayweb.page - robertvanloenhout opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - robertvanloenhout opened this issue over 1 year ago
Player keeps loading on a 404 page
github.com/webrecorder/replayweb.page - borsboomm opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - borsboomm opened this issue over 1 year ago
Pywb failing to handle self-redirects from OutbackCDX
github.com/webrecorder/pywb - obrienben opened this issue over 1 year ago
github.com/webrecorder/pywb - obrienben opened this issue over 1 year ago
docs: Update Behaviors Tutorial
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this pull request over 1 year ago
Update README.md
github.com/webrecorder/browsertrix-crawler - gitreich opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - gitreich opened this pull request over 1 year ago
Question: Searchable URL (SURT) algorithm differences for CDXJ
github.com/webrecorder/specs - tfmorris opened this issue over 1 year ago
github.com/webrecorder/specs - tfmorris opened this issue over 1 year ago
cannot download amboss website
github.com/webrecorder/archiveweb.page - 925670849 opened this issue over 1 year ago
github.com/webrecorder/archiveweb.page - 925670849 opened this issue over 1 year ago
feat(CI): add build step to lint CI
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this pull request over 1 year ago
more logging improvements
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Some fonts not showing on screenshot - fix
github.com/webrecorder/browsertrix-crawler - djhmateer opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - djhmateer opened this issue over 1 year ago
Cloudflare security page is saved instead of real content
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this issue over 1 year ago
Update CI Release Action
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Error handling fixes to avoid crawler getting stuck.
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Fails to install on Python 3.11 because cchardet can't build
github.com/webrecorder/warcit - aquatix opened this issue over 1 year ago
github.com/webrecorder/warcit - aquatix opened this issue over 1 year ago
build(deps-dev): bump electron from 25.1.1 to 25.8.1
github.com/webrecorder/replayweb.page - dependabot[bot] opened this pull request over 1 year ago
github.com/webrecorder/replayweb.page - dependabot[bot] opened this pull request over 1 year ago
Currently cannot fully capture oembed of tweets on oembed.link
github.com/webrecorder/archiveweb.page - despens opened this issue over 1 year ago
github.com/webrecorder/archiveweb.page - despens opened this issue over 1 year ago
[Bug]: WARC files are very slow to load in Firefox
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - Shrinks99 opened this issue over 1 year ago
favicon: use 127.0.0.1 instead of localhost
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Update tldextract cache for pywb during build
github.com/webrecorder/browsertrix-crawler - vnznznz opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - vnznznz opened this pull request over 1 year ago
Enhance file stats test to detect file modification
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
behavior logging tweaks, add netIdle
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
add 'startEarly' option to init opts, which will enable autoplay / autofetch to detect URLs as soon as possible.
github.com/webrecorder/browsertrix-behaviors - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-behaviors - ikreymer opened this pull request over 1 year ago
start running autoplay/autofetch behaviors only in response to run()/start()
github.com/webrecorder/browsertrix-behaviors - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-behaviors - ikreymer opened this pull request over 1 year ago
Start all behaviors only when run() is called.
github.com/webrecorder/browsertrix-behaviors - ikreymer opened this issue over 1 year ago
github.com/webrecorder/browsertrix-behaviors - ikreymer opened this issue over 1 year ago
Automated CI build
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this issue over 1 year ago
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this issue over 1 year ago
optimize link extraction: (fixes #376)
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
status: fix typo setting status to log message
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
fix: facebook photos page scroll
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-behaviors - Chickensoupwithrice opened this pull request over 1 year ago
Track start and end time of each crawler session in Redis
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - tw4l opened this issue over 1 year ago
Add index generation system that uses offsets into the WACZ itself.
github.com/webrecorder/py-wacz - anjackson opened this pull request over 1 year ago
github.com/webrecorder/py-wacz - anjackson opened this pull request over 1 year ago
fix rendering in readme shell commands
github.com/webrecorder/pywb - flozi00 opened this pull request over 1 year ago
github.com/webrecorder/pywb - flozi00 opened this pull request over 1 year ago
logging fixes: avoid duplicate logging for same error
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
More efficient link extraction / link extraction behaviors.
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this issue over 1 year ago
loading: keep 'serveIndex' query arg on all sw.js loads, not just in …
github.com/webrecorder/replayweb.page - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/replayweb.page - ikreymer opened this pull request over 1 year ago
logging: resolve confusion with 'crawl done' not being written to log…
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - ikreymer opened this pull request over 1 year ago
Add option to output stats file live, i.e. after each page crawled
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - benoit74 opened this pull request over 1 year ago
Add ability to load behaviours from URL
github.com/webrecorder/browsertrix-crawler - Chickensoupwithrice opened this pull request over 1 year ago
github.com/webrecorder/browsertrix-crawler - Chickensoupwithrice opened this pull request over 1 year ago
New endpoint for info per snapshot
github.com/webrecorder/wabac.js - SuaYoo opened this issue over 1 year ago
github.com/webrecorder/wabac.js - SuaYoo opened this issue over 1 year ago
Update resource (URLs) browser
github.com/webrecorder/replayweb.page - SuaYoo opened this issue over 1 year ago
github.com/webrecorder/replayweb.page - SuaYoo opened this issue over 1 year ago