Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

Archive Team

We are going to rescue your shit
Collective - Host: opensource - https://opencollective.com/archiveteam - Website: https://archiveteam.org/ - Code: https://github.com/ArchiveTeam

github.com/ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

Stars: 1,408 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/wpull

Wget-compatible web downloader and crawler.

Stars: 556 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/ArchiveBot

ArchiveBot, an IRC bot for archiving websites

Stars: 357 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/warrior-dockerfile

A Dockerfile for the ArchiveTeam Warrior

Stars: 307 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/parler-grab

Archiving Parler.

Stars: 228 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/Ubuntu-Warrior

Scripts to build and boot warrior virtual machine containing Docker

Stars: 114 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: 106 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/IA.BAK

We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.

Stars: 87 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/terroroftinytown

URLTeam's second generation of URL shortener archiving tools

Stars: 71 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/seesaw-kit

Making a reusable toolkit for writing seesaw scripts

Stars: 69 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/imgur-grab

Archiving imgur.

Stars: 65 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/NewsGrabber

Grabbing all news.

Stars: 62 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/reddit-grab

Grabbing everything from reddit.

Stars: 59 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/yahooanswers-grab

Saving all questions and answers from Yahoo! Answers.

Stars: 50 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/tumblr-grab

Archiving all to-be-deleted NSFW tumblr blogs.

Stars: 49 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/universal-tracker

A configurable, reusable tracker with dashboard

Stars: 34 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/terroroftinytown-client-grab

The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project

Stars: 27 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/ludios_wpull

wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved

Stars: 26 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/googleplus-grab

Archiving Google+.

Stars: 24 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/warrior-code2

Boot scripts for the ArchiveTeam Warrior 2

Stars: 24 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/youtube-grab

Archiving all metadata from YouTube (everything except videos themselves due to size)

Stars: 23 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/ftp-gov-grab

Archiving government FTPs.

Stars: 22 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/warrior4-vm

Warrior virtual machine appliance (version 4)

Stars: 21 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/warrior-code

Stars: 20 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/soundcloud-grab

Stars: 19 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/WebArchiver

Decentralized web archiving

Stars: 19 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/tinyback

A tiny web scraper

Stars: 18 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/gamemaker-sandbox-items

Gamemaker Sandbox Tracker items

Stars: 18 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/500px-grab

Archiving https://500px.com/creativecommons

Stars: 17 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/urls-grab

Archiving URLs (outlinks) from a variety of sources.

Stars: 17 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/youtube-dislikes-grab

Archiving general youtube video metadata through innertube for dislikes removal.

Stars: 14 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/telegram-grab

Archiving public telegram messages.

Stars: 12 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/VideoBot

Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.

Stars: 11 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/urlteam-stuff

Urlteam website, code, ... also, PONIES

Stars: 10 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/youtube-dislikes-items

Managing items for youtube-dislikes-grab.

Stars: 10 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/subscene-grab

Archiving Subscene.

Stars: 9 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/pastebin-grab

Archiving pastebin

Stars: 9 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/NewsGrabber-Warrior

Stars: 8 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/blogger-grab

Archiving Blogger/Blogspot.

Stars: 8 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/google-sites-grab

Archiving Google Sites Classic.

Stars: 8 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/urls-sources

Sources for urls-grab.

Stars: 7 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/ftp-grab

Save all FTP sites!

Stars: 7 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/flickr-grab

Grabbing Flickr images.

Stars: 7 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/mediafire-items

Managing items for mediafire-grab.

Stars: 7 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/greader-grab

http://www.archiveteam.org/index.php?title=Google_Reader

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/ftp-queue

Create queue items for ftp-grab.

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/mediafire-grab

Archiving mediafire.com URLs.

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/wget-lua-forum-scripts

Downloading forums posts with Wget+Lua

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/citeseerxpdf-grab

Grabbing all sources of CiteSeerX.

Stars: 6 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/github-grab

Archiving GitHub

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/ftp-nab

Thinger to download FTP sites

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/mobileme-grab

Downloading MobileMe

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/youtube-items

Managing items for youtube-grab

Stars: 6 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/twitchtv-grab

Grabbing twitch.tv videos

Stars: 6 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/splinder-grab

Stars: 5 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/tinyarchive

Software behind tracker.tinyarchive.org - Warning: Very hacky code

Stars: 5 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/coursera-grab

Saving courses from Coursera.

Stars: 5 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/yahoomessages-grab

Archiving Yahoo Messages

Stars: 5 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/warrior-preseed

Constructing a new warrior VM

Stars: 5 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/ffnet-grab

Fanfictioning

Stars: 5 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/gamemaker-sandbox-grab

Grabbing sandbox.yoyogames.com

Stars: 5 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/archiveteam-megawarc-factory

Some scripts to process ArchiveTeam uploads

Stars: 5 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/formspring-grab

Downloading Formspring

Stars: 5 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/wikis-grab

Grabbing all wikis.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/reddit-items

Managing items for reddit-grab.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/tumblr-grab-test

Archiving Tumblr blogs (an ArchiveTeam Warrior testing project)

Stars: 4 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/roblox-grab

Archiving roblox forums.

Stars: 4 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/gfycat-grab

Archiving gfycat.com.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/flashdomains-grab

Copy of domains-grab for Flash sites.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/furaffinity-grab

Grabbing all images and other stuff from Fur Affinity.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/blingee-grab

Saving all images and content from Blingee.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/google-newspapers-grab

Archiving the Google New Archive at https://news.google.com/newspapers.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/Universal-tracker-2

A better tracker with more features for ArchiveTeam

Stars: 4 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/heroku-buildpack-archiveteam-warrior

Heroku buildpack with the Archive Team Warrior

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/liveleak-grab

Archiving liveleak.com

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/imdb-grab

Archiving IMDb.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/grab-base-df

Base Dockerfile for warrior project grab scripts

Stars: 4 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/justintv-grab

Grabbing as much of justin.tv's archives as possible

Stars: 4 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/sourceforge-grab

Archiving SourceForge.

Stars: 4 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/wikidot-grab

Archiving wikidot.

Stars: 4 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/parler-items

Managing items for parler-grab.

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/standalone-readme-template

Readme instructions template for manually running pipeline grab scripts outside the warrior

Stars: 3 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/fortunecity

Want to help? See the readme at the bottom.

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/enjin-grab

Archiving Enjin.

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/yahoogroups-grab

Archiving Yahoo! Groups.

Stars: 3 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/orkut-grab

Download all of Orkut

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/vlive-grab

Archiving vlive.tv.

Stars: 3 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/tencent-weibo-grab

Archiving Tencent Weibo (t.qq.com), 腾讯微博

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/zstd-dictionary-trainer

Training ZSTD dictionaries for use in ZST WARCs.

Stars: 3 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/furaffinity-items

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/livejournal-discovery

Discovering items for livejournal-grab.

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/dpreview-grab

Archiving DPReview

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/NewsGrabber-Services

The services for NewsGrabber.

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/webs-grab

Archiving webs.com

Stars: 3 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/ArchiveBot-agents

Site-specific agents that work with ArchiveBot

Stars: 3 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/pixiv-grab

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/puush-grab

Stars: 3 - Last synced: 22 Dec 2024

github.com/ArchiveTeam/panoramio-grab

Grabbing everything from panoramio

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/vidme-grab

Archiving all videos from vid.me.

Stars: 3 - Last synced: 21 Dec 2024

github.com/ArchiveTeam/twitchtv-discovery-grab

Discovering twitch.tv content

Stars: 3 - Last synced: 21 Dec 2024