Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/ArchiveTeam/hyves-grab


https://github.com/ArchiveTeam/hyves-grab

Support grabbing from not domain sites.

Supports urls such as http://www.hyves.nl/club/1022532/Albertus_Magnus/.
Configure the tracker t...

442902b87a500e1ec79b17147a24d7a01d753038 authored about 11 years ago by Christopher Foo <[email protected]>
Zlib development headers are needed, too.

f349a1767e9a7b222f79626c3c9004cfd84a652c authored about 11 years ago by yipdw <[email protected]>
Add message about limiting to README.md

001111752ffd8c4679fb25122ab2a81a19382430 authored about 11 years ago by Terry Wrist <[email protected]>
Bumps version 20131120.01, closes #2.

76211bd0a319b066b5a2ea912109ddd4c6b99d7d authored about 11 years ago by Christopher Foo <[email protected]>
Merge remote-tracking branch 'origin/master' into develop

4a7632a307a7d6173dc4c7474403c502cc825f14 authored about 11 years ago by Christopher Foo <[email protected]>
Merge branch 'move-wait' of https://github.com/yipdw/hyves-grab into yipdw-move-wait

cda97e5c721298ebebecdda46e58e78af49e1276 authored about 11 years ago by Christopher Foo <[email protected]>
Update README.md

zlib-devel is needed; otherwise, wget-lua won't produce warc.gzs.

8982305245c976c9891776a0b4ae30e43a9df15c authored about 11 years ago by yipdw <[email protected]>
Fix false anger trying to GET pager form urls. Closes #1.

d5472c38b583682a98c99402d40bb33910db330e authored about 11 years ago by Christopher Foo <[email protected]>
Move wait time to httploop_result.

httploop_result is the Lua hook that's guaranteed to be called after
every request. download_ch...

daa17e419d3bad414b74cbea7ef0e9892c42a7ab authored about 11 years ago by David Yip <[email protected]>
Update README.md

python-dev is needed in the Debian/Ubuntu setup instructions.

9e61b8911972fceef62810dd865c011dd7031980 authored about 11 years ago by yipdw <[email protected]>
Use python from the environment instead of hard-code path.

29cc2c1d3052324382dfa0e840be398b3b7e6110 authored about 11 years ago by Christopher Foo <[email protected]>
Supports grabbing the music.

28d53f5f2653011b7e641dd945974ac7725dce91 authored about 11 years ago by Christopher Foo <[email protected]>
Adds aplayer.swf decrypter

2353899b9a436125a05ab7922e1213c5d52291bb authored about 11 years ago by Christopher Foo <[email protected]>
Fixes pager regex to not pick song names.

Faulty pager names with song titles causes false angers.

5b08c6d6d6137b3cd98580dc67295088e44d4a18 authored about 11 years ago by Christopher Foo <[email protected]>
Include --disable-web-address in bind address example in readme

3ce2154e9e50a78747bd9201c2b17f2586228d62 authored about 11 years ago by Christopher Foo <[email protected]>
Merge branch 'master' into develop

009e3f6aba3f0ea9fd4c952184e2ba8ab69efbfd authored about 11 years ago by Christopher Foo <[email protected]>
Unescape html entities from scraped urls.

Passing faulty urls to a pager url causes 500 which we assume it's angry
but it's not actually a...

0ca1c705ab55381f656d1ca18ae863abe92b3bea authored about 11 years ago by Christopher Foo <[email protected]>
add wget-lua-warrior

0b019145a91e315cfad1f97d14bc76a2ef41717b authored about 11 years ago by Terry Wrist <[email protected]>
Stop infinite loop on showMemberDetails

20c3f1a4d22a0b8233176a4ab6cc596fd1d7def9 authored about 11 years ago by Christopher Foo <[email protected]>
Change sleep time between requests to ~1.0seconds

d220aeb171aed012b90f05f7c1e9c35e35c1c214 authored about 11 years ago by Christopher Foo <[email protected]>
Page everything. Up min delay to 20sec.

66f705d8307e8cfdce28e20e3899dcda3eb5e28e authored about 11 years ago by Christopher Foo <[email protected]>
Use the joepie91-style readme format

9d2d64d78bd3cea718620a2728d9aa82d0b1f72e authored about 11 years ago by Christopher Foo <[email protected]>
Take out wget random-wait. Take out abort after many tries.

48413f42443db2f8baf5d2d9e185bf4e0e820753 authored about 11 years ago by Christopher Foo <[email protected]>
Support bind_address available in seesaw 0.0.16

ecc3cafe07db655432f08a64e2b6a312837b9135 authored about 11 years ago by Christopher Foo <[email protected]>
Use tries=inf and check for 500. Abort after 9000 tries.

27f34a02c7592c5fd10ac6eb8e60e060ccf902ad authored about 11 years ago by Christopher Foo <[email protected]>
Paginate the album contents

ed47b0954d19069c165db264dfc3c012660e75d8 authored about 11 years ago by Christopher Foo <[email protected]>
Fix pattern escapes. Crawl normal links in pager html fragments.

19005c3872472ce1a65fa4076fa30bd9bb4df748 authored about 11 years ago by Christopher Foo <[email protected]>
Page the forums and whowhatwhere

9b06f01c98ecef3b4c88333542d2d42ce47beda8 authored about 11 years ago by Christopher Foo <[email protected]>
Add get-wget-lua.sh

a9e18629af0c9afc90e03c7ba96770eb24d03af8 authored about 11 years ago by Christopher Foo <[email protected]>
Fill the readme with the generic instructions

5eb44ef9611201aded7c5770ab0d9e593d637ecc authored about 11 years ago by Christopher Foo <[email protected]>
Username scrape pipeline moved to archiveteam/hyves-username-grab

3f2b8a415b088f0a660965d91c4ccef519602ada authored about 11 years ago by Christopher Foo <[email protected]>
Grab videos from album

b8e4bf4dbed08d20ed38927e585282f2284d0cef authored about 11 years ago by Christopher Foo <[email protected]>
username: Check for throttle and retry on error

c176ac96c3c7393fbf3c470210a888fbce90fde5 authored about 11 years ago by Christopher Foo <[email protected]>
Implement paginating hyves from the user homepage.

0a228322ab86b56d100a291f9d9556275bb02993 authored about 11 years ago by Christopher Foo <[email protected]>
username scraper: scrape groups as well

9e82b7d890b2ec0d3a6c6e993c62ed3809f0ae4c authored about 11 years ago by Christopher Foo <[email protected]>
Add username scraper

ca2f8d17f44691ab98a0124b499ef20c10c8c83b authored about 11 years ago by Christopher Foo <[email protected]>
Move example scripts to seperate directory

b5fa9866855e1fc6c78d23db37bc5fb5736cf9b3 authored about 11 years ago by Christopher Foo <[email protected]>
Clarify comment about postman secret

8e1233e26edb415cbdceceefdf236a8816647699 authored about 11 years ago by Christopher Foo <[email protected]>
Take out the debug tracker address

11ea850dbc59ddcf4d012481f6658d04fb046725 authored about 11 years ago by Christopher Foo <[email protected]>
Whitelist the photo URLs to be downloaded

6fff95bf0f4df28b8c08161f438970e57f11e498 authored about 11 years ago by Christopher Foo <[email protected]>
Make photo comment fetch more faithful to original behaviour

Pass username to lua script

c67f799a3016c2877e5a4fa8d0dde3d8d24b93ec authored about 11 years ago by Christopher Foo <[email protected]>
Fix up the grammar in the comments

14667693040f939ea91645e69d0bcff7fe7ecf7e authored about 11 years ago by Christopher Foo <[email protected]>
Offload wget wait onto lua script

87f7822cdf43586a09af48afe590539a8c31216c authored about 11 years ago by Christopher Foo <[email protected]>
Implement grabbing of album photos and comments

051840c0e83d850b90f364c8d7469d96584a8630 authored about 11 years ago by Christopher Foo <[email protected]>
Scrape urls from pagination

b4625ad0df12285f7fc31f22a61034342a36d48c authored about 11 years ago by Christopher Foo <[email protected]>
Add missing urls return

1cd81c5b0def53dccf03586f1c63617d4f054778 authored about 11 years ago by Christopher Foo <[email protected]>
Fix typos and add lua urlcode from xanga-grab

64b00bde2d6f1fce05af5e98acc9b58832137c6f authored about 11 years ago by Christopher Foo <[email protected]>
Implement throttle check

2818a5c06a71df68012ff0b6be5e5a19b92614eb authored about 11 years ago by Christopher Foo <[email protected]>
Fix the multilang urls

888f3ffb299518cdeefd06273b7fb78c9bd19ef9 authored about 11 years ago by Christopher Foo <[email protected]>
Add more pagination urls

c44fd91b11daf71dea2312ee4e1dd76d1054905a authored about 11 years ago by Christopher Foo <[email protected]>
Stub in pagination

c4ff1c9453a532fef18d96d79bb067b450cb0edf authored about 11 years ago by Christopher Foo <[email protected]>
Add pager scripts

465957b68deea642a08b5342040bbd90a147b41b authored about 11 years ago by Christopher Foo <[email protected]>
Add WIP pipeline

9a1e1dd16bd4d3422ba09643afe53e50fa386e12 authored about 11 years ago by Christopher Foo <[email protected]>
Add empty readme file

f8ee12a854682d9fead37486d50992f9f7bcaca7 authored about 11 years ago by Christopher Foo <[email protected]>