Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/ArchiveTeam/grab-site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
https://github.com/ArchiveTeam/grab-site
Make log window animation a little less glitchy
4ee4d964424a72d2898ed1ff069377de4f318514 authored over 9 years ago by Ivan Kozik <[email protected]>
4ee4d964424a72d2898ed1ff069377de4f318514 authored over 9 years ago by Ivan Kozik <[email protected]>
Add Clear button for filter; always show scrollbar
32722ad9d376873eaee91d3ca50b48bc8b52d0cf authored over 9 years ago by Ivan Kozik <[email protected]>
32722ad9d376873eaee91d3ca50b48bc8b52d0cf authored over 9 years ago by Ivan Kozik <[email protected]>
Align the nick as well; fix padding when aligned
b93b799a11e4cc9fbb6f435d62df7585cb573ca5 authored over 9 years ago by Ivan Kozik <[email protected]>
b93b799a11e4cc9fbb6f435d62df7585cb573ca5 authored over 9 years ago by Ivan Kozik <[email protected]>
Remove link to dashboard2
b261297c1d82b6e64efe999a1361d66ecae695ba authored over 9 years ago by Ivan Kozik <[email protected]>
b261297c1d82b6e64efe999a1361d66ecae695ba authored over 9 years ago by Ivan Kozik <[email protected]>
dashboard: Remove extraneous space when note not present.
1766d9f65389614b048585441b84662ea93e5581 authored over 9 years ago by David Yip <[email protected]>
1766d9f65389614b048585441b84662ea93e5581 authored over 9 years ago by David Yip <[email protected]>
Fix erroneous rendering of border after filtering
a1611d453e3f4eeb216ddc4c05923c692e2694d4 authored over 9 years ago by Ivan Kozik <[email protected]>
a1611d453e3f4eeb216ddc4c05923c692e2694d4 authored over 9 years ago by Ivan Kozik <[email protected]>
Implement filter box
953f6d3bd58da8e5a44eeac18a6d18d1f661f2cd authored over 9 years ago by Ivan Kozik <[email protected]>
953f6d3bd58da8e5a44eeac18a6d18d1f661f2cd authored over 9 years ago by Ivan Kozik <[email protected]>
Show taller log window when there is only one visible
605824fd14f11d1ba67d6f781133cf8716c7e7b4 authored over 9 years ago by Ivan Kozik <[email protected]>
605824fd14f11d1ba67d6f781133cf8716c7e7b4 authored over 9 years ago by Ivan Kozik <[email protected]>
Preliminary implementation of 'Align!' feature
c669035f5d95c234dea670b8b30f0d0cd0e7732c authored over 9 years ago by Ivan Kozik <[email protected]>
c669035f5d95c234dea670b8b30f0d0cd0e7732c authored over 9 years ago by Ivan Kozik <[email protected]>
dashboard: Add "try 3.0 beta" link
12c351e47f3b5b42bbf0583239baba6acd521c95 authored over 9 years ago by Christopher Foo <[email protected]>
12c351e47f3b5b42bbf0583239baba6acd521c95 authored over 9 years ago by Christopher Foo <[email protected]>
Use wider dashboard margins on big screens
27c446e3349c3201131f241cbc62da287586d81d authored over 9 years ago by Ivan Kozik <[email protected]>
27c446e3349c3201131f241cbc62da287586d81d authored over 9 years ago by Ivan Kozik <[email protected]>
Tweak # queued tooltip, refactor
aeb8addab5d563fce03e5640df1e3094077f84f1 authored over 9 years ago by Ivan Kozik <[email protected]>
aeb8addab5d563fce03e5640df1e3094077f84f1 authored over 9 years ago by Ivan Kozik <[email protected]>
dashboard: Don't show blank notes.
773f5887e7e326f68edc3ec03c67c6494f7970b7 authored over 9 years ago by David Yip <[email protected]>
773f5887e7e326f68edc3ec03c67c6494f7970b7 authored over 9 years ago by David Yip <[email protected]>
dashboard: Add links to pipeline and job reports.
Better wording and layout is totally up for grabs.
70d3f24957b7ef16f2bb8ed482e29f75a53be73d authored over 9 years ago by David Yip <[email protected]>
s/Filter:/Show:/
bcd9b30da226c0504d90f0ef8b71c2c5b04b7222 authored over 9 years ago by Ivan Kozik <[email protected]>
bcd9b30da226c0504d90f0ef8b71c2c5b04b7222 authored over 9 years ago by Ivan Kozik <[email protected]>
Show/hide one log window when its stats line is clicked
b38a703e7df4198287aa19365ad1f78f1bd8f761 authored over 9 years ago by Ivan Kozik <[email protected]>
b38a703e7df4198287aa19365ad1f78f1bd8f761 authored over 9 years ago by Ivan Kozik <[email protected]>
Animate showing/hiding of log windows
3006ba9637bee56bfb397d10db2f6432709209a6 authored over 9 years ago by Ivan Kozik <[email protected]>
3006ba9637bee56bfb397d10db2f6432709209a6 authored over 9 years ago by Ivan Kozik <[email protected]>
Fix ws:// and /logs/recent URLs
ddafec8c27daa6c0f94474ce6457a923eb985cde authored over 9 years ago by Ivan Kozik <[email protected]>
ddafec8c27daa6c0f94474ce6457a923eb985cde authored over 9 years ago by Ivan Kozik <[email protected]>
Autofocus the filter box
ccb57691b7d4196bb6998aadd20693911f9d3db4 authored over 9 years ago by Ivan Kozik <[email protected]>
ccb57691b7d4196bb6998aadd20693911f9d3db4 authored over 9 years ago by Ivan Kozik <[email protected]>
Import the first revision of dashboard 2.0 in the ArchiveBot repo
fa4f193af9f1f70007d2d55a1c66a8c665ac78f2 authored over 9 years ago by Ivan Kozik <[email protected]>
fa4f193af9f1f70007d2d55a1c66a8c665ac78f2 authored over 9 years ago by Ivan Kozik <[email protected]>
Start work on websocket server for future dashboard integration
5229ddf5dc41acdf4d24aa5b8472b945fbf8558d authored over 9 years ago by Ivan Kozik <[email protected]>
5229ddf5dc41acdf4d24aa5b8472b945fbf8558d authored over 9 years ago by Ivan Kozik <[email protected]>
Clarify argument order requirement
03d1efc2cee1d498324adf5a2c0fce83683742bd authored over 9 years ago by Ivan Kozik <[email protected]>
03d1efc2cee1d498324adf5a2c0fce83683742bd authored over 9 years ago by Ivan Kozik <[email protected]>
Update UA
62cba3a0e7f53a94d65ecd5ec5f73de1759c061a authored over 9 years ago by Ivan Kozik <[email protected]>
62cba3a0e7f53a94d65ecd5ec5f73de1759c061a authored over 9 years ago by Ivan Kozik <[email protected]>
Put the date into the DIR name and WARC name
2d9b1395f151bd381c3d0f156103b60ff674e22b authored over 9 years ago by Ivan Kozik <[email protected]>
2d9b1395f151bd381c3d0f156103b60ff674e22b authored over 9 years ago by Ivan Kozik <[email protected]>
Update UA
66d22b155680d62525b597485d8c2f477ecf00a7 authored over 9 years ago by Ivan Kozik <[email protected]>
66d22b155680d62525b597485d8c2f477ecf00a7 authored over 9 years ago by Ivan Kozik <[email protected]>
Remove --no-skip-getaddrinfo to match ArchiveBot
c83c89b0cf189352632985d8f80f06c788f4e3bf authored over 9 years ago by Ivan Kozik <[email protected]>
c83c89b0cf189352632985d8f80f06c788f4e3bf authored over 9 years ago by Ivan Kozik <[email protected]>
Send Accept-Language to avoid 500 Internal Server Error when sending Firefox UA to reddit.com
2983615d50d6a1ea45df735cfdf6bf7d11c9d93f authored over 9 years ago by Ivan Kozik <[email protected]>
2983615d50d6a1ea45df735cfdf6bf7d11c9d93f authored over 9 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
8bf1b00c46e64fd0d26d4c707d2a5230a26dfb5c authored almost 10 years ago by Ivan Kozik <[email protected]>
8bf1b00c46e64fd0d26d4c707d2a5230a26dfb5c authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
040afe0d92e706c84fbdafeb241895db8710ca48 authored almost 10 years ago by Ivan Kozik <[email protected]>
040afe0d92e706c84fbdafeb241895db8710ca48 authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
9fbe0bdb6f93c60fa9e6f4fd59b0124b34405bf4 authored almost 10 years ago by Ivan Kozik <[email protected]>
9fbe0bdb6f93c60fa9e6f4fd59b0124b34405bf4 authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
d14d8135e62fa535c3bd0117d34053b2b7e2504b authored almost 10 years ago by Ivan Kozik <[email protected]>
d14d8135e62fa535c3bd0117d34053b2b7e2504b authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
c140f1caf0f3293d52272743e1b18d8a8f6ffab5 authored almost 10 years ago by Ivan Kozik <[email protected]>
c140f1caf0f3293d52272743e1b18d8a8f6ffab5 authored almost 10 years ago by Ivan Kozik <[email protected]>
Pause the crawl when running low on disk or memory
b785e9b1b709e5ace477d4f42dbe125a438b20dd authored almost 10 years ago by Ivan Kozik <[email protected]>
b785e9b1b709e5ace477d4f42dbe125a438b20dd authored almost 10 years ago by Ivan Kozik <[email protected]>
Avoid creating directories with ? or & in the filename, which breaks
sqlalchemy when it tries to parse arguments from the filename.
Fixes https://github.com/ludios/g...
cbaccd9e024a1506392377171aa66cf68be266f2 authored almost 10 years ago by Ivan Kozik <[email protected]>
Describe arguments more
f80df6944f9922c8dc42d544347a23592c4d8101 authored almost 10 years ago by Ivan Kozik <[email protected]>
f80df6944f9922c8dc42d544347a23592c4d8101 authored almost 10 years ago by Ivan Kozik <[email protected]>
Cleanup
611a0be845364e4594778842081b82578df2db1d authored almost 10 years ago by Ivan Kozik <[email protected]>
611a0be845364e4594778842081b82578df2db1d authored almost 10 years ago by Ivan Kozik <[email protected]>
Mention WARC files; clarify
820e2aeef4ad477f273b33ec47400ecda1f89e63 authored almost 10 years ago by Ivan Kozik <[email protected]>
820e2aeef4ad477f273b33ec47400ecda1f89e63 authored almost 10 years ago by Ivan Kozik <[email protected]>
Describe what this is
a1cbcb9ea93de6d76faea6eff808a4f7ce15b64a authored almost 10 years ago by Ivan Kozik <[email protected]>
a1cbcb9ea93de6d76faea6eff808a4f7ce15b64a authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
3fe4774c2cc93bc74f35def70d047d822654296d authored almost 10 years ago by Ivan Kozik <[email protected]>
3fe4774c2cc93bc74f35def70d047d822654296d authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
1f9f80dff0837e1bcce8b8097890b22aa2b31916 authored almost 10 years ago by Ivan Kozik <[email protected]>
1f9f80dff0837e1bcce8b8097890b22aa2b31916 authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
fbbfa3c0b4b7ece2ad762006fce9d1f24ce0d68f authored almost 10 years ago by Ivan Kozik <[email protected]>
fbbfa3c0b4b7ece2ad762006fce9d1f24ce0d68f authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
5b2b68061d08270b0fb0b0bc4eeef50232617267 authored almost 10 years ago by Ivan Kozik <[email protected]>
5b2b68061d08270b0fb0b0bc4eeef50232617267 authored almost 10 years ago by Ivan Kozik <[email protected]>
Include path/query components in directory name
85f02d20555e852b0f7b8f458248dfa7e73b2851 authored almost 10 years ago by Ivan Kozik <[email protected]>
85f02d20555e852b0f7b8f458248dfa7e73b2851 authored almost 10 years ago by Ivan Kozik <[email protected]>
Copy in latest dupespotter
62866f53364e172ec27418b4525505e1839842f6 authored almost 10 years ago by Ivan Kozik <[email protected]>
62866f53364e172ec27418b4525505e1839842f6 authored almost 10 years ago by Ivan Kozik <[email protected]>
Link to global ignore set
ccaee25497e1defae4c385385e08e78111ebfa53 authored almost 10 years ago by Ivan Kozik <[email protected]>
ccaee25497e1defae4c385385e08e78111ebfa53 authored almost 10 years ago by Ivan Kozik <[email protected]>
Clarify
e2118bbea4a8d4921aaeee762dc81f36ccf38bad authored almost 10 years ago by Ivan Kozik <[email protected]>
e2118bbea4a8d4921aaeee762dc81f36ccf38bad authored almost 10 years ago by Ivan Kozik <[email protected]>
Tell user to install git as well
4a22b4d59331e4f705027c4eef40b285f53ecfa5 authored almost 10 years ago by Ivan Kozik <[email protected]>
4a22b4d59331e4f705027c4eef40b285f53ecfa5 authored almost 10 years ago by Ivan Kozik <[email protected]>
Support --ignore-sets= instead of the space-separated version
65e096a0356e0a42db57e5e33c6dc0939932f4b2 authored almost 10 years ago by Ivan Kozik <[email protected]>
65e096a0356e0a42db57e5e33c6dc0939932f4b2 authored almost 10 years ago by Ivan Kozik <[email protected]>
Link to pythex
2d7125951f00f8d6eaf526a6e94af4b521684cb1 authored almost 10 years ago by Ivan Kozik <[email protected]>
2d7125951f00f8d6eaf526a6e94af4b521684cb1 authored almost 10 years ago by Ivan Kozik <[email protected]>
Document file formats
f815920a83da6a61c81acaac4254cc33325c664e authored almost 10 years ago by Ivan Kozik <[email protected]>
f815920a83da6a61c81acaac4254cc33325c664e authored almost 10 years ago by Ivan Kozik <[email protected]>
Make it real obvious
d73ee5ba27c774c303345c6ba730329473848fd1 authored almost 10 years ago by Ivan Kozik <[email protected]>
d73ee5ba27c774c303345c6ba730329473848fd1 authored almost 10 years ago by Ivan Kozik <[email protected]>
Add ArchiveBot LICENSE
2f7ae834bb98757b7c87fa55bc545a1613381a32 authored almost 10 years ago by Ivan Kozik <[email protected]>
2f7ae834bb98757b7c87fa55bc545a1613381a32 authored almost 10 years ago by Ivan Kozik <[email protected]>
Add igoff feature
0699689a1418c42bf55a6c74fef9fd73363bc598 authored almost 10 years ago by Ivan Kozik <[email protected]>
0699689a1418c42bf55a6c74fef9fd73363bc598 authored almost 10 years ago by Ivan Kozik <[email protected]>
Add support for --no-offsite-links
2ccb8b4d6f32176c95193429074d02374c496fa3 authored almost 10 years ago by Ivan Kozik <[email protected]>
2ccb8b4d6f32176c95193429074d02374c496fa3 authored almost 10 years ago by Ivan Kozik <[email protected]>
Another html5lib comment
52d0acc3b52cc91d9113d39026b8e732f8fbb85b authored almost 10 years ago by Ivan Kozik <[email protected]>
52d0acc3b52cc91d9113d39026b8e732f8fbb85b authored almost 10 years ago by Ivan Kozik <[email protected]>
Fix comment
64d027da2cff2e2d4691b41c06621c6b1d6ba2d6 authored almost 10 years ago by Ivan Kozik <[email protected]>
64d027da2cff2e2d4691b41c06621c6b1d6ba2d6 authored almost 10 years ago by Ivan Kozik <[email protected]>
Rename script
6f8ef82efbd1acd47c361066dc9aa2ce8c563292 authored almost 10 years ago by Ivan Kozik <[email protected]>
6f8ef82efbd1acd47c361066dc9aa2ce8c563292 authored almost 10 years ago by Ivan Kozik <[email protected]>
Load changes from DIR/ignores and DIR/ignore_sets while the crawl is running
979b843458afb52a8ba251c54c2b271cce99c1ce authored almost 10 years ago by Ivan Kozik <[email protected]>
979b843458afb52a8ba251c54c2b271cce99c1ce authored almost 10 years ago by Ivan Kozik <[email protected]>
Refactor
5f7593fda207c28bcba2e63ebf441b953211ff9f authored almost 10 years ago by Ivan Kozik <[email protected]>
5f7593fda207c28bcba2e63ebf441b953211ff9f authored almost 10 years ago by Ivan Kozik <[email protected]>
Improve README
429b2032ffe34e060ab9ba68436bd8216a762d95 authored almost 10 years ago by Ivan Kozik <[email protected]>
429b2032ffe34e060ab9ba68436bd8216a762d95 authored almost 10 years ago by Ivan Kozik <[email protected]>
CRLF -> LF
1705174fb2d1b8e8401c7da2b99b35690659910f authored almost 10 years ago by Ivan Kozik <[email protected]>
1705174fb2d1b8e8401c7da2b99b35690659910f authored almost 10 years ago by Ivan Kozik <[email protected]>
Allow specifying --ignore-sets NAME1,NAME2,...
eea440422dd071286c003c7362990226a47cc42e authored almost 10 years ago by Ivan Kozik <[email protected]>
eea440422dd071286c003c7362990226a47cc42e authored almost 10 years ago by Ivan Kozik <[email protected]>
Use global ignore set and also ignore Icecast sites like ArchiveBot
a61ed949ca85d916847094ef0cfbf0f44f723cf6 authored almost 10 years ago by Ivan Kozik <[email protected]>
a61ed949ca85d916847094ef0cfbf0f44f723cf6 authored almost 10 years ago by Ivan Kozik <[email protected]>
Use cookies.txt
2986ae8a3143830bfec6bdf7e4e17ee248ac0880 authored almost 10 years ago by Ivan Kozik <[email protected]>
2986ae8a3143830bfec6bdf7e4e17ee248ac0880 authored almost 10 years ago by Ivan Kozik <[email protected]>
Add a site-grabber based on ArchiveBot's use of wpull
91fd89be5d61423449814518261bddb85e814d70 authored almost 10 years ago by Ivan Kozik <[email protected]>
91fd89be5d61423449814518261bddb85e814d70 authored almost 10 years ago by Ivan Kozik <[email protected]>