Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
Archive Team
We are going to rescue your shit
Collective -
Host: opensource -
https://opencollective.com/archiveteam
- Website: https://archiveteam.org/
- Code: https://github.com/ArchiveTeam
github.com/ArchiveTeam/grab-site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: 1,408 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/wpull
Wget-compatible web downloader and crawler.
Stars: 556 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/ArchiveBot
ArchiveBot, an IRC bot for archiving websites
Stars: 357 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/warrior-dockerfile
A Dockerfile for the ArchiveTeam Warrior
Stars: 307 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/Ubuntu-Warrior
Scripts to build and boot warrior virtual machine containing Docker
Stars: 114 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: 106 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/IA.BAK
We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.
Stars: 87 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/terroroftinytown
URLTeam's second generation of URL shortener archiving tools
Stars: 71 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/seesaw-kit
Making a reusable toolkit for writing seesaw scripts
Stars: 69 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/reddit-grab
Grabbing everything from reddit.
Stars: 59 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/yahooanswers-grab
Saving all questions and answers from Yahoo! Answers.
Stars: 50 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/tumblr-grab
Archiving all to-be-deleted NSFW tumblr blogs.
Stars: 49 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/universal-tracker
A configurable, reusable tracker with dashboard
Stars: 34 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/terroroftinytown-client-grab
The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project
Stars: 27 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/ludios_wpull
wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved
Stars: 26 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/warrior-code2
Boot scripts for the ArchiveTeam Warrior 2
Stars: 24 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/youtube-grab
Archiving all metadata from YouTube (everything except videos themselves due to size)
Stars: 23 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/warrior4-vm
Warrior virtual machine appliance (version 4)
Stars: 21 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/gamemaker-sandbox-items
Gamemaker Sandbox Tracker items
Stars: 18 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/500px-grab
Archiving https://500px.com/creativecommons
Stars: 17 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/urls-grab
Archiving URLs (outlinks) from a variety of sources.
Stars: 17 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/youtube-dislikes-grab
Archiving general youtube video metadata through innertube for dislikes removal.
Stars: 14 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/telegram-grab
Archiving public telegram messages.
Stars: 12 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/VideoBot
Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.
Stars: 11 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/urlteam-stuff
Urlteam website, code, ... also, PONIES
Stars: 10 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/youtube-dislikes-items
Managing items for youtube-dislikes-grab.
Stars: 10 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/google-sites-grab
Archiving Google Sites Classic.
Stars: 8 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/mediafire-items
Managing items for mediafire-grab.
Stars: 7 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/greader-grab
http://www.archiveteam.org/index.php?title=Google_Reader
Stars: 6 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/ftp-queue
Create queue items for ftp-grab.
Stars: 6 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/mediafire-grab
Archiving mediafire.com URLs.
Stars: 6 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/wget-lua-forum-scripts
Downloading forums posts with Wget+Lua
Stars: 6 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/citeseerxpdf-grab
Grabbing all sources of CiteSeerX.
Stars: 6 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/youtube-items
Managing items for youtube-grab
Stars: 6 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/tinyarchive
Software behind tracker.tinyarchive.org - Warning: Very hacky code
Stars: 5 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/coursera-grab
Saving courses from Coursera.
Stars: 5 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/yahoomessages-grab
Archiving Yahoo Messages
Stars: 5 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/warrior-preseed
Constructing a new warrior VM
Stars: 5 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/gamemaker-sandbox-grab
Grabbing sandbox.yoyogames.com
Stars: 5 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/archiveteam-megawarc-factory
Some scripts to process ArchiveTeam uploads
Stars: 5 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/reddit-items
Managing items for reddit-grab.
Stars: 4 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/tumblr-grab-test
Archiving Tumblr blogs (an ArchiveTeam Warrior testing project)
Stars: 4 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/flashdomains-grab
Copy of domains-grab for Flash sites.
Stars: 4 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/furaffinity-grab
Grabbing all images and other stuff from Fur Affinity.
Stars: 4 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/blingee-grab
Saving all images and content from Blingee.
Stars: 4 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/google-newspapers-grab
Archiving the Google New Archive at https://news.google.com/newspapers.
Stars: 4 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/Universal-tracker-2
A better tracker with more features for ArchiveTeam
Stars: 4 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/heroku-buildpack-archiveteam-warrior
Heroku buildpack with the Archive Team Warrior
Stars: 4 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/grab-base-df
Base Dockerfile for warrior project grab scripts
Stars: 4 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/justintv-grab
Grabbing as much of justin.tv's archives as possible
Stars: 4 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/parler-items
Managing items for parler-grab.
Stars: 3 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/standalone-readme-template
Readme instructions template for manually running pipeline grab scripts outside the warrior
Stars: 3 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/fortunecity
Want to help? See the readme at the bottom.
Stars: 3 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/yahoogroups-grab
Archiving Yahoo! Groups.
Stars: 3 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/tencent-weibo-grab
Archiving Tencent Weibo (t.qq.com), 腾讯微博
Stars: 3 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/zstd-dictionary-trainer
Training ZSTD dictionaries for use in ZST WARCs.
Stars: 3 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/livejournal-discovery
Discovering items for livejournal-grab.
Stars: 3 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/NewsGrabber-Services
The services for NewsGrabber.
Stars: 3 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/ArchiveBot-agents
Site-specific agents that work with ArchiveBot
Stars: 3 - Last synced: 22 Dec 2024
github.com/ArchiveTeam/panoramio-grab
Grabbing everything from panoramio
Stars: 3 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/vidme-grab
Archiving all videos from vid.me.
Stars: 3 - Last synced: 21 Dec 2024
github.com/ArchiveTeam/twitchtv-discovery-grab
Discovering twitch.tv content
Stars: 3 - Last synced: 21 Dec 2024