Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/ArchiveTeam/ludios_wpull
wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved
https://github.com/ArchiveTeam/ludios_wpull
5d06b020e72937131042de71d2a1620f7a0bd73d authored 11 months ago by HeliosLHC <[email protected]>
572d73bc61f6bfe24d6be834dc333a030c75313f authored 11 months ago by HeliosLHC <[email protected]>
ad069c7671aa2f6cfa8928400388ae9d0492dffb authored 11 months ago by HeliosLHC <[email protected]>
8ccba07cfbbe6950814493e776c33a5f9729bf74 authored 11 months ago by HeliosLHC <[email protected]>
1b2a21f98512b1f7f96262117db8e3804b2840ab authored 11 months ago by HeliosLHC <[email protected]>
c8be866d17beb1165850118df096c39e3746e93d authored 11 months ago by HeliosLHC <[email protected]>
53d5339866ef9380ff478336b284d4dbf9e4c262 authored 11 months ago by HeliosLHC <[email protected]>
* update testing job to always generate coverage map
* restrict scan path of coverage
* ad...
348460287b7ff84284528589c3a73202e99b87cf authored 11 months ago by HeliosLHC <[email protected]>c3ae6ca584fdde2be318b275d2bae7f2d5b6c406 authored 11 months ago by HeliosLHC <[email protected]>
66457b8a4438ad7d153f887195ff9d68baf7d646 authored 11 months ago by HeliosLHC <[email protected]>
d3a9517b4b305574acadfc22aeeac923f96eb2bb authored 11 months ago by HeliosLHC <[email protected]>
e3fcc2eddbd61ba60d8de37b33c38326185088bc authored 11 months ago by HeliosLHC <[email protected]>
022faff67c179957eae7c7090e0f30a0103796b2 authored 11 months ago by HeliosLHC <[email protected]>
5d6e7f02184146131d6c333da734ed15eedebb11 authored 12 months ago by HeliosLHC <[email protected]>
f77a22160ff9c95122ce6e8bf947a9be6063d6ad authored 12 months ago by HeliosLHC <[email protected]>
d4b7e485ade958945993ca0a43a76d3c7e11ace0 authored 12 months ago by HeliosLHC <[email protected]>
2171753b8d20a208d024e7fd3c1690a6b4553c3d authored 12 months ago by HeliosLHC <[email protected]>
93f52df34f1ec8392802cfa61b2a4660134edd3e authored 12 months ago by HeliosLHC <[email protected]>
c7319799e3ebe6cc6bb4a1f0b55966ba6f240bc2 authored 12 months ago by HeliosLHC <[email protected]>
* Bump psutil from 4.2.0 to 5.9.X
04d900dbbb0e2f29838ad0ac8c3a1deb76a1b41f authored 12 months ago by HeliosLHC <[email protected]>Add initial Github Actions support for publishing to PyPI
225434999d1f97ecbb1564d5749c71eb456261a2 authored 12 months ago by HeliosLHC <[email protected]>252da259f142a96a0f1351d643c4869fdd537a2a authored 12 months ago by HeliosLHC <[email protected]>
542cd533e92830a7701860014314138a3724eb1a authored 12 months ago by HeliosLHC <[email protected]>
Modernize ludios_wpull
7b4ed99df1a56f60f82cb5f8c0bf7cd274fc9da4 authored 12 months ago by HeliosLHC <[email protected]>fb6049b5f1740241a9a44f01daee48b693ab77ae authored 12 months ago by HeliosLHC <[email protected]>
310aa0ccd380f8d21a207291253c9d92a8679fbd authored 12 months ago by HeliosLHC <[email protected]>
97c20fc8566a6accbb8932aaf27899f007f26c16 authored 12 months ago by HeliosLHC <[email protected]>
1d667193960984269f023ce09f69804dff9e3916 authored 12 months ago by HeliosLHC <[email protected]>
d741c66e763667eed5ea902cef87ee1488089e53 authored 12 months ago by HeliosLHC <[email protected]>
f4c009a8752492b75ac7848bf6bdc1a5d1d36d6c authored 12 months ago by HeliosLHC <[email protected]>
d40e8a69157d67b20fccbe0df7e85effd94e9dae authored 12 months ago by HeliosLHC <[email protected]>
89fee9731c5b2412725961496fc0ad46aaf58648 authored 12 months ago by HeliosLHC <[email protected]>
61e6dfdc124695aa24159fd8964caf44fa7b2006 authored 12 months ago by HeliosLHC <[email protected]>
888da23ae0e452321d530deb928cb15106cd905c authored 12 months ago by HeliosLHC <[email protected]>
997cdb49ce24bc5021c827678f445d6f922971e7 authored 12 months ago by HeliosLHC <[email protected]>
b8c02ee12ca2c72d83c3e9b47289637f1d0c3c3d authored 12 months ago by HeliosLHC <[email protected]>
8b5cc94f41b91018bb767d44b9cf35cfa1bf7bd9 authored 12 months ago by HeliosLHC <[email protected]>
f46081bd72ef8bd6158ea1273892e927cfd08bc9 authored 12 months ago by HeliosLHC <[email protected]>
86771c8338dcfdf304e2422595f680f098ea1fc4 authored 12 months ago by HeliosLHC <[email protected]>
47e11b056eac6138b36438b8afe6c57522ed7279 authored 12 months ago by HeliosLHC <[email protected]>
dd5567cff7f8808a8481f8ee1ddf191b9967fca1 authored 12 months ago by HeliosLHC <[email protected]>
1e975681254c812a2b3947828729c3d6db4fa56d authored 12 months ago by HeliosLHC <[email protected]>
dfd7687630a83da5da0ab0f747bb08a4c4f03b7e authored 12 months ago by HeliosLHC <[email protected]>
744d870673d048d786ab04566dd2b818807cddf8 authored 12 months ago by HeliosLHC <[email protected]>
176145fc693ae2d230fe405b0a21f235733fe431 authored 12 months ago by HeliosLHC <[email protected]>
f3098312e45e803c832eb84ec119b7a7e82cdb29 authored 12 months ago by HeliosLHC <[email protected]>
bd087bdf18063f3091c6e3f98aaad1c9003fb719 authored 12 months ago by HeliosLHC <[email protected]>
bfd518a68a154dbd49e5da26b617b70f15b0cacf authored 12 months ago by HeliosLHC <[email protected]>
0c2fc043c5eb2461583b15838554cd01d314c479 authored 12 months ago by HeliosLHC <[email protected]>
8c8b1ebd389aaef49cb823a597ffc8526542703e authored about 1 year ago by HeliosLHC <[email protected]>
c28d684ebd4396321b8fb303345e2d2d23d42f73 authored about 1 year ago by HeliosLHC <[email protected]>
bb32132d7d39c0f85de2baed242f05177a021476 authored about 1 year ago by HeliosLHC <[email protected]>
c2ef79ffd0fcdfa8ab169ba2e88c4c4b221c1165 authored about 1 year ago by HeliosLHC <[email protected]>
7e0e32378f8edf0f5230ea5b19fb48531edc472e authored about 1 year ago by HeliosLHC <[email protected]>
7f81a38e3841a1238d7414ce32b6d0e9ef7df05e authored about 1 year ago by HeliosLHC <[email protected]>
0d75a5acb58b2559cc2eda9282c1104c4d0bced9 authored about 1 year ago by Nyx <[email protected]>
78e8fdd5b7abe195c2ebea7114e5feea7be19150 authored about 1 year ago by Nyx <[email protected]>
19a0640ec0a1d54a070ca2bfc50e2a2e61d91baf authored about 1 year ago by Nyx <[email protected]>
f63acfc567ee057cec6d9235a6aa05154351ee8c authored about 1 year ago by Nyx <[email protected]>
7f1e6ef3c88ee765ffcb9d54e24234b23ba8bb05 authored about 1 year ago by Nyx <[email protected]>
f0f59807c4f9f5049153e1ec9199430173a9d359 authored about 1 year ago by Nyx <[email protected]>
9ef03cf2bbff4fec4bba417d4066c2cfa2a16a6f authored about 1 year ago by Nyx <[email protected]>
replaced namedlist and ordereddefaultdict with stdlib implementations
17bc682cff16428585b39ed118006d61658c1ca1 authored about 1 year ago by HeliosLHC <[email protected]>a42a5154a40200164fa33428ac16d351a71175ce authored about 1 year ago by HeliosLHC <[email protected]>
f884cd1740a546a78af6e87c90762948d01575ee authored about 1 year ago by HeliosLHC <[email protected]>
ab1f8a9aacc0092a05d39999f5ee6e85ed6fae06 authored about 1 year ago by HeliosLHC <[email protected]>
25076b5192616d0c7a203f4c480db4dbf185697a authored about 1 year ago by HeliosLHC <[email protected]>
eee42fd2d5f89ab2fca6f052b9608cf34e125492 authored about 1 year ago by HeliosLHC <[email protected]>
07a6340610158cd6f85f7ff4ff84f1fa187d8130 authored over 1 year ago by HeliosLHC <[email protected]>
d228c01f108e42022aceacdbbd9feb61d884a2c0 authored over 1 year ago by HeliosLHC <[email protected]>
7d1db472d848d116255dbf683a94e3954c6397e5 authored over 1 year ago by HeliosLHC <[email protected]>
80c6f7d7327b0c4c4f9534d8a8f1dcef9f31badf authored over 3 years ago by Ivan Kozik <[email protected]>
```
File "/usr/local/lib/python3.7/site-packages/wpull/scraper/html.py", line 257, in _is_acce...
9e9a017872dfda7e2d844623458ac381efcd97e5 authored over 3 years ago by Ivan Kozik <[email protected]>
8e989f1547c68f2c5266a55d79da801eaefb149a authored over 3 years ago by Ivan Kozik <[email protected]>
1cb56bd93f252f2915754b3311865fabd5589f86 authored almost 5 years ago by Ivan Kozik <[email protected]>
Note: currently only tested with Python 3.7
475efb75ee0e929fe5fda0514a87935db7c7a991 authored about 6 years ago by Ivan Kozik <[email protected]>a7fc92b7b98d1e90ee1c8ce5d581966398cd855a authored about 6 years ago by Ivan Kozik <[email protected]>
File "/root/gs-venv/lib/python3.7/site-packages/wpull/scraper/base.py", line 186, in scrape_in...
db73106bfeedb401a4a848a38e39149e1572aff8 authored about 6 years ago by Ivan Kozik <[email protected]>77627232e16095f258e486aa6cd7a7ec3df0aa9f authored about 6 years ago by Ivan Kozik <[email protected]>
aa1a077b2da3967fa9da9ba2bf3a8677e9363cb0 authored about 6 years ago by DoomTay <[email protected]>
82417d31e640dad6521cad3bd9a836f99e70b081 authored about 6 years ago by DoomTay <[email protected]>
416df7a3d09f734c456fa98e92d9ae944436d024 authored about 6 years ago by flan <[email protected]>
cede7d4e0f0e843e4cbd055be712a1d6b5797ee5 authored about 6 years ago by flan <[email protected]>
Port of wget commit 6a2d67b5836a6f1b9c989968a5392ff3511bc1f9.
9706d31ad6c1f455f6df61ef6b0733a268d4acec authored about 6 years ago by flan <[email protected]>b787821c5bac1bda78629ae08dd21b91f284631b authored about 6 years ago by Ivan Kozik <[email protected]>
Fixes https://github.com/ludios/wpull/issues/14
cc19b2531e1de74de583c84342724c8bb14e5311 authored about 6 years ago by Ivan Kozik <[email protected]>dd3cecc5182f7b9107ba23b14d5db9cb6ffd07ee authored about 6 years ago by Ivan Kozik <[email protected]>
b538f07ca1339d2dbaf473ba09f4cec9e2b0dbbc authored about 6 years ago by Ivan Kozik <[email protected]>
c76c34963e58f729fd6341468ad328122074e7a8 authored about 6 years ago by Ivan Kozik <[email protected]>
9844e92994cc96f6d2765753f183c84ef34aa376 authored about 6 years ago by Ivan Kozik <[email protected]>
2968734ee68699e57350f39c55971b62749813b8 authored about 6 years ago by Ivan Kozik <[email protected]>
8fabca4e60c38bd1ddb328731e01728c7bc066a6 authored about 6 years ago by Ivan Kozik <[email protected]>
bd5f00036703d352f2f652c18718fd77ee9c2b9d authored about 6 years ago by Ivan Kozik <[email protected]>
1fad32f8be5155ea81feed0af12c937c8fb9fbfd authored about 6 years ago by Ivan Kozik <[email protected]>
4c043dfe9180c4976eb0c1836ecc43a66e6a734a authored about 6 years ago by Ivan Kozik <[email protected]>
I didn't want to fix these for lxml trees, and neither grab-site or ArchiveBot
use these options...
html5-parser is much faster than html5lib.
Bump minimum Python version to 3.7.0.
Monkeypatch u...
71ec52061e24c4893200240933939d5ce0dacd39 authored about 6 years ago by Ivan Kozik <[email protected]>ac77c5cbcbd9e4e9f65ca8fdff375a56135bcd8a authored about 6 years ago by Ivan Kozik <[email protected]>
e1c35bf62cdc704b7144f9e65029668c6fd754fd authored about 6 years ago by Ivan Kozik <[email protected]>