Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/ArchiveTeam/skyblog-grab

Archiving all of Skyblog.
https://github.com/ArchiveTeam/skyblog-grab

Version 20230820.01. Ignore URLs containing ////.

8a0a6e3992ac2928706bc821439bf5696b970bd5 authored over 1 year ago by arkiver <[email protected]>
Version 20230810.01. Fix nil problem for URLs queuing.

396930c1af93dab786c8e65c80aaa0a15605a387 authored over 1 year ago by arkiver <[email protected]>
Version 20230809.04. Allow up to 4 doubles of URL.

bd6a2a084d4c6e079bfaf024db659a790ccbc17c authored over 1 year ago by arkiver <[email protected]>
Version 20230809.03. Correctly crawl non-blog items.

8045eee82635ea6e5fee4b6c0f481b4847d8e726 authored over 1 year ago by arkiver <[email protected]>
Version 20230809.02. Improved 404 handling. Fix bug causing abort on retry on status code < 400. Only check subdomain string on blog.

abf700f04ce6a1649af9289dad4bc28ebb599200 authored over 1 year ago by arkiver <[email protected]>
Version 20230809.01. Do not abort item on 404 on likely pagination URL treated as post.

ce8a35cec5eee74eff592c78e5e03bb8a73e64e9 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.22. Prevent loop archiving URL with good status code.

345e81d09712a0225228a000b8bf422d09ccbece authored over 1 year ago by arkiver <[email protected]>
Version 20230807.21. Abort fast on 404 on article_*.html page.

5e7bcce7c0ad84576ee76849a9a791da04cab8c0 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.20. Do not accept status code 404.

06211c4021286e7289650581709d12dc84cacbcd authored over 1 year ago by arkiver <[email protected]>
Version 20230807.19. Connect timeout 1 second.

e57345d20390d67801fa2ea0f26de709068f83db authored over 1 year ago by arkiver <[email protected]>
Version 20230807.18. Connect timeout 2 seconds. Multi item size 100.

b1eacdec0725ceee9bc509818250d28eb4b4e7bb authored over 1 year ago by arkiver <[email protected]>
Version 20230807.17. Prevent assets from being queued to #//.

48fb115be828121a89e5e0ed5a17f54787b1758e authored over 1 year ago by arkiver <[email protected]>
Version 20230807.16. Do not rotate DNS (IPs). Connect timeout 5 seconds.

aca8c69c860f7f7972b70bee4e9f1d57bd15d423 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.15. Connect timeout 0.3.

6bc5b6f54e9fee3333b648fa943086f4de579828 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.14. Update user-agents. Check status code on first URL of item. Reduce connect timeout to 0.2. Do not reject skyblog IPs.

1653585d0aae2c504ef1799928405116448a9458 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.13. Ignore _comments.xml URL. Relax check on if something is remix.

ade01786ab1625768b6336c42345ececd7312fe8 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.12. Do not connect with the IPs that are unresponsive.

02966fc607e39af228f4f9c839413a5699ea8f99 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.11. Do not get _comments.xml URL if not explicitly mentioned.

fa93a1d050979b1dc47a976ab483a1d8d1ec9e99 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.10. Ignore paginated comments from comments.xml URL.

0ecd35f7f4c7b1ae45bb199d821b42692b669f82 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.09. Ignore certain unimportant comments page.

66f2c782d0249a6a14472ff248d4a845db42b7c6 authored over 1 year ago by arkiver <[email protected]>
Version 20230807.08. Prevent loop. Ignore ADD_COMMENT URL.

888b9b5c66a6eeb7568b5c246c04f631cd6ab88e authored over 1 year ago by arkiver <[email protected]>
Version 20230807.07. Skip tag items.

980f29a0cff3c4e950cb22810b9f498ec2db654e authored over 1 year ago by arkiver <[email protected]>
Version 20230807.06. Ignore individual photo and video blog pages.

3e55beb5eb23b651f4e038d3e2561cd670a5d95d authored over 1 year ago by arkiver <[email protected]>
Version 20230807.05. Connect timeout 0.3. Support mg.skyrock.com for tracks.

4b7344b4298953aa5331dfc43c920ce34a4db51e authored over 1 year ago by arkiver <[email protected]>
Version 20230807.04. Ignore URLs. Ensure only HTTPS is archived.

fb2defb1e6024764a6611e1606ab6ef07f875efa authored over 1 year ago by arkiver <[email protected]>
Version 20230807.03. Only 0.5 second connect timeout.

81e59c3cec3733ea2a8939507e8805e833472fbc authored over 1 year ago by arkiver <[email protected]>
Version 20230807.02. Timeout 1 second.

f3c9d66a3eda38d46981d2c8b48b952ac973dd1a authored over 1 year ago by arkiver <[email protected]>
Version 20230807.01. Skip post URLs with action parameter is post is remix.

43f6dce3258e361d1c985aa6139adfa0fb85bd5d authored over 1 year ago by arkiver <[email protected]>
Version 20230806.04. Multi item size 30. Make this actually get items.

d126f901860b78dca8cde04ddf0e4172e08e9714 authored over 1 year ago by arkiver <[email protected]>
Version 20230806.03. General improvements.

65914d4218619a8161a2243f928bc5ac54f04cc6 authored over 1 year ago by arkiver <[email protected]>
Version 20230806.02. Fix tracker. Remove \ from HTML.

eb4ca0d7e8c3fc8bb1ae48e3a2762ddd7280b204 authored over 1 year ago by arkiver <[email protected]>
Version 20230806.01. Initial.

820c1debc444ade2446b2838004d0854391aa527 authored over 1 year ago by arkiver <[email protected]>