Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/ArchiveTeam/ludios_wpull

wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved
https://github.com/ArchiveTeam/ludios_wpull

Bump version to 5.0.3 from 5.0.2a

5d06b020e72937131042de71d2a1620f7a0bd73d authored 11 months ago by HeliosLHC <[email protected]>
Remove version lock for chardet

572d73bc61f6bfe24d6be834dc333a030c75313f authored 11 months ago by HeliosLHC <[email protected]>
Bump dependency chardet to 5.X.X

ad069c7671aa2f6cfa8928400388ae9d0492dffb authored 11 months ago by HeliosLHC <[email protected]>
Move permissions block up 1 level

8ccba07cfbbe6950814493e776c33a5f9729bf74 authored 11 months ago by HeliosLHC <[email protected]>
grant GH actions permission to update pull requests

1b2a21f98512b1f7f96262117db8e3804b2840ab authored 11 months ago by HeliosLHC <[email protected]>
disable failing socket reuse and proxy unittest (#30)

c8be866d17beb1165850118df096c39e3746e93d authored 11 months ago by HeliosLHC <[email protected]>
Delete deprecated .travis.yml

53d5339866ef9380ff478336b284d4dbf9e4c262 authored 11 months ago by HeliosLHC <[email protected]>
Fix GH actions unittesting (#29)

* update testing job to always generate coverage map

* restrict scan path of coverage

* ad...

348460287b7ff84284528589c3a73202e99b87cf authored 11 months ago by HeliosLHC <[email protected]>
re-add removed unittest step

c3ae6ca584fdde2be318b275d2bae7f2d5b6c406 authored 11 months ago by HeliosLHC <[email protected]>
Fix test action name and move coverage gen to separate step

66457b8a4438ad7d153f887195ff9d68baf7d646 authored 11 months ago by HeliosLHC <[email protected]>
Allow non-zero exit code for unittest

d3a9517b4b305574acadfc22aeeac923f96eb2bb authored 11 months ago by HeliosLHC <[email protected]>
Add missing sudo for apt commands

e3fcc2eddbd61ba60d8de37b33c38326185088bc authored 11 months ago by HeliosLHC <[email protected]>
Add Github Action for unittesting and coverage report

022faff67c179957eae7c7090e0f30a0103796b2 authored 11 months ago by HeliosLHC <[email protected]>
Bump version to 5.0.2a

5d6e7f02184146131d6c333da734ed15eedebb11 authored 12 months ago by HeliosLHC <[email protected]>
Pin PyPI actions to v1

f77a22160ff9c95122ce6e8bf947a9be6063d6ad authored 12 months ago by HeliosLHC <[email protected]>
Change source repository URL

d4b7e485ade958945993ca0a43a76d3c7e11ace0 authored 12 months ago by HeliosLHC <[email protected]>
Remove top-level permission grant for id-token

2171753b8d20a208d024e7fd3c1690a6b4553c3d authored 12 months ago by HeliosLHC <[email protected]>
Add id-token write permission for GH Actions

93f52df34f1ec8392802cfa61b2a4660134edd3e authored 12 months ago by HeliosLHC <[email protected]>
Update version to 5.0.1a due to psutil upgrade

c7319799e3ebe6cc6bb4a1f0b55966ba6f240bc2 authored 12 months ago by HeliosLHC <[email protected]>
Upgrade psutil from 4.2.0 to 5.9.X (#27)

* Bump psutil from 4.2.0 to 5.9.X

04d900dbbb0e2f29838ad0ac8c3a1deb76a1b41f authored 12 months ago by HeliosLHC <[email protected]>
Merge pull request #26 from ArchiveTeam/github-actions

Add initial Github Actions support for publishing to PyPI

225434999d1f97ecbb1564d5749c71eb456261a2 authored 12 months ago by HeliosLHC <[email protected]>
Update python-publish.yml

252da259f142a96a0f1351d643c4869fdd537a2a authored 12 months ago by HeliosLHC <[email protected]>
Create python-publish.yml

542cd533e92830a7701860014314138a3724eb1a authored 12 months ago by HeliosLHC <[email protected]>
Merge pull request #25 from ArchiveTeam/python-3.12

Modernize ludios_wpull

7b4ed99df1a56f60f82cb5f8c0bf7cd274fc9da4 authored 12 months ago by HeliosLHC <[email protected]>
reformatting

fb6049b5f1740241a9a44f01daee48b693ab77ae authored 12 months ago by HeliosLHC <[email protected]>
set version type to alpha

310aa0ccd380f8d21a207291253c9d92a8679fbd authored 12 months ago by HeliosLHC <[email protected]>
fix additional unit tests

97c20fc8566a6accbb8932aaf27899f007f26c16 authored 12 months ago by HeliosLHC <[email protected]>
remove assertion that doesn't apply due to lack of --timestamping flag

1d667193960984269f023ce09f69804dff9e3916 authored 12 months ago by HeliosLHC <[email protected]>
replace localhost with 127.0.0.1 that tornado.testing now returns for get_url

d741c66e763667eed5ea902cef87ee1488089e53 authored 12 months ago by HeliosLHC <[email protected]>
amend fix for plugin.resolve_dns method test

f4c009a8752492b75ac7848bf6bdc1a5d1d36d6c authored 12 months ago by HeliosLHC <[email protected]>
fix test assertion due to tornado changing default from localhost to 127.0.0.1

d40e8a69157d67b20fccbe0df7e85effd94e9dae authored 12 months ago by HeliosLHC <[email protected]>
replace deprecated tornado code

89fee9731c5b2412725961496fc0ad46aaf58648 authored 12 months ago by HeliosLHC <[email protected]>
remove handling for older sphinx versions

61e6dfdc124695aa24159fd8964caf44fa7b2006 authored 12 months ago by HeliosLHC <[email protected]>
move packaging back as primary dependency

888da23ae0e452321d530deb928cb15106cd905c authored 12 months ago by HeliosLHC <[email protected]>
bump version to 5 and change written program name from Wpull to ludios_wpull

997cdb49ce24bc5021c827678f445d6f922971e7 authored 12 months ago by HeliosLHC <[email protected]>
bump minimum Python version from 3.11 to 3.12

b8c02ee12ca2c72d83c3e9b47289637f1d0c3c3d authored 12 months ago by HeliosLHC <[email protected]>
upgrade unit tests

8b5cc94f41b91018bb767d44b9cf35cfa1bf7bd9 authored 12 months ago by HeliosLHC <[email protected]>
upgrade yapsy to use master branch

f46081bd72ef8bd6158ea1273892e927cfd08bc9 authored 12 months ago by HeliosLHC <[email protected]>
fix html scraper test

86771c8338dcfdf304e2422595f680f098ea1fc4 authored 12 months ago by HeliosLHC <[email protected]>
replace deprecated mimetype from test (obsoleted in IETF RFC 9239)

47e11b056eac6138b36438b8afe6c57522ed7279 authored 12 months ago by HeliosLHC <[email protected]>
replace deprecated datetime.utcnow dns.resolver.Resolver.query methods

dd5567cff7f8808a8481f8ee1ddf191b9967fca1 authored 12 months ago by HeliosLHC <[email protected]>
replace deprecated assertEquals with assertEqual

1e975681254c812a2b3947828729c3d6db4fa56d authored 12 months ago by HeliosLHC <[email protected]>
use scalar_subquery fix SQLAlchemy implicit casting warnings

dfd7687630a83da5da0ab0f747bb08a4c4f03b7e authored 12 months ago by HeliosLHC <[email protected]>
remove legacy compat support for older OpenSSL versions

744d870673d048d786ab04566dd2b818807cddf8 authored 12 months ago by HeliosLHC <[email protected]>
replace yapsy release version with master branch version

176145fc693ae2d230fe405b0a21f235733fe431 authored 12 months ago by HeliosLHC <[email protected]>
replace custom ConcurrentHTTPServer with now native ThreadingHTTPServer

f3098312e45e803c832eb84ec119b7a7e82cdb29 authored 12 months ago by HeliosLHC <[email protected]>
add warcat as testing dependency

bd087bdf18063f3091c6e3f98aaad1c9003fb719 authored 12 months ago by HeliosLHC <[email protected]>
replace tornado SSL error with stdlib SSL error

bfd518a68a154dbd49e5da26b617b70f15b0cacf authored 12 months ago by HeliosLHC <[email protected]>
bump tornado to v6

0c2fc043c5eb2461583b15838554cd01d314c479 authored 12 months ago by HeliosLHC <[email protected]>
upgrade sqlalchemy to version 2

8c8b1ebd389aaef49cb823a597ffc8526542703e authored about 1 year ago by HeliosLHC <[email protected]>
replace legacy setup.py and requirements.txt with pyproject.toml

c28d684ebd4396321b8fb303345e2d2d23d42f73 authored about 1 year ago by HeliosLHC <[email protected]>
ignore ruff formatter

bb32132d7d39c0f85de2baed242f05177a021476 authored about 1 year ago by HeliosLHC <[email protected]>
Remove Python 2 style subclassing from "object"

c2ef79ffd0fcdfa8ab169ba2e88c4c4b221c1165 authored about 1 year ago by HeliosLHC <[email protected]>
replace "yield from" with "await" in tests

7e0e32378f8edf0f5230ea5b19fb48531edc472e authored about 1 year ago by HeliosLHC <[email protected]>
Merge branch 'python-3.11' of https://github.com/ArchiveTeam/ludios_wpull into python-3.11

7f81a38e3841a1238d7414ce32b6d0e9ef7df05e authored about 1 year ago by HeliosLHC <[email protected]>
imports cleanup

0d75a5acb58b2559cc2eda9282c1104c4d0bced9 authored about 1 year ago by Nyx <[email protected]>
cleanup unused imports

78e8fdd5b7abe195c2ebea7114e5feea7be19150 authored about 1 year ago by Nyx <[email protected]>
remove unnecessary await

19a0640ec0a1d54a070ca2bfc50e2a2e61d91baf authored about 1 year ago by Nyx <[email protected]>
Change LinkContext to frozen dataclass to enable hashing

f63acfc567ee057cec6d9235a6aa05154351ee8c authored about 1 year ago by Nyx <[email protected]>
fix incorrect typing for "link_type"

7f1e6ef3c88ee765ffcb9d54e24234b23ba8bb05 authored about 1 year ago by Nyx <[email protected]>
move LinkInfo and convert to dataclass

f0f59807c4f9f5049153e1ec9199430173a9d359 authored about 1 year ago by Nyx <[email protected]>
Upgrade to Python 3.11 asyncio syntax

9ef03cf2bbff4fec4bba417d4066c2cfa2a16a6f authored about 1 year ago by Nyx <[email protected]>
Merge pull request #24 from HeliosLHC/modernizations

replaced namedlist and ordereddefaultdict with stdlib implementations

17bc682cff16428585b39ed118006d61658c1ca1 authored about 1 year ago by HeliosLHC <[email protected]>
autopep8 reformatting

a42a5154a40200164fa33428ac16d351a71175ce authored about 1 year ago by HeliosLHC <[email protected]>
ignore venv directory

f884cd1740a546a78af6e87c90762948d01575ee authored about 1 year ago by HeliosLHC <[email protected]>
replace namedtuples with dataclasses

ab1f8a9aacc0092a05d39999f5ee6e85ed6fae06 authored about 1 year ago by HeliosLHC <[email protected]>
remove unused imports

25076b5192616d0c7a203f4c480db4dbf185697a authored about 1 year ago by HeliosLHC <[email protected]>
swap namedtuple with dataclass

eee42fd2d5f89ab2fca6f052b9608cf34e125492 authored about 1 year ago by HeliosLHC <[email protected]>
remove logic for deprecated Python versions

07a6340610158cd6f85f7ff4ff84f1fa187d8130 authored over 1 year ago by HeliosLHC <[email protected]>
update references to use new collections.abc module

d228c01f108e42022aceacdbbd9feb61d884a2c0 authored over 1 year ago by HeliosLHC <[email protected]>
replaced namedlist and ordereddefaultdict with stdlib implementations

7d1db472d848d116255dbf683a94e3954c6397e5 authored over 1 year ago by HeliosLHC <[email protected]>
Bump version 3.0.9

80c6f7d7327b0c4c4f9534d8a8f1dcef9f31badf authored over 3 years ago by Ivan Kozik <[email protected]>
Fix `AttributeError: 'cython_function_or_method' object has no attribute 'lower'` when parsing an XML declaration.

```
File "/usr/local/lib/python3.7/site-packages/wpull/scraper/html.py", line 257, in _is_acce...

8461910d54d7efbec705d1651c0ea0a7bdb7f322 authored over 3 years ago by Ivan Kozik <[email protected]>
Bump version 3.0.8

9e9a017872dfda7e2d844623458ac381efcd97e5 authored over 3 years ago by Ivan Kozik <[email protected]>
Pin sqlalchemy to 1.3.24 to fix https://github.com/ArchiveTeam/grab-site/issues/181

8e989f1547c68f2c5266a55d79da801eaefb149a authored over 3 years ago by Ivan Kozik <[email protected]>
Move README to avoid showing deceptive install instructions on GitHub

1cb56bd93f252f2915754b3311865fabd5589f86 authored almost 5 years ago by Ivan Kozik <[email protected]>
setup.py: remove specific Python versions from trove classifiers

Note: currently only tested with Python 3.7

475efb75ee0e929fe5fda0514a87935db7c7a991 authored about 6 years ago by Ivan Kozik <[email protected]>
Bump version 3.0.7

a7fc92b7b98d1e90ee1c8ce5d581966398cd855a authored about 6 years ago by Ivan Kozik <[email protected]>
Fix a crash when parsing sitemaps

File "/root/gs-venv/lib/python3.7/site-packages/wpull/scraper/base.py", line 186, in scrape_in...

db73106bfeedb401a4a848a38e39149e1572aff8 authored about 6 years ago by Ivan Kozik <[email protected]>
Bump version 3.0.6

77627232e16095f258e486aa6cd7a7ec3df0aa9f authored about 6 years ago by Ivan Kozik <[email protected]>
Appended unit tests

aa1a077b2da3967fa9da9ba2bf3a8677e9363cb0 authored about 6 years ago by DoomTay <[email protected]>
Tweaked background URL regex

82417d31e640dad6521cad3bd9a836f99e70b081 authored about 6 years ago by DoomTay <[email protected]>
Add test for <track>

416df7a3d09f734c456fa98e92d9ae944436d024 authored about 6 years ago by flan <[email protected]>
Add tests for 98c8a987e8c9884584cee5dc7ed8697d255db220

cede7d4e0f0e843e4cbd055be712a1d6b5797ee5 authored about 6 years ago by flan <[email protected]>
Extract links from HTML5 media tags

Port of wget commit 6a2d67b5836a6f1b9c989968a5392ff3511bc1f9.

9706d31ad6c1f455f6df61ef6b0733a268d4acec authored about 6 years ago by flan <[email protected]>
Bump version 3.0.5

b787821c5bac1bda78629ae08dd21b91f284631b authored about 6 years ago by Ivan Kozik <[email protected]>
scraper.html: fix ValueError crash when parsing an image as HTML

Fixes https://github.com/ludios/wpull/issues/14

cc19b2531e1de74de583c84342724c8bb14e5311 authored about 6 years ago by Ivan Kozik <[email protected]>
Fix segfault on parsing <html>...<html /> in xhtml mode: https://github.com/ludios/wpull/issues/15

dd3cecc5182f7b9107ba23b14d5db9cb6ffd07ee authored about 6 years ago by Ivan Kozik <[email protected]>
Bump version 3.0.3

b538f07ca1339d2dbaf473ba09f4cec9e2b0dbbc authored about 6 years ago by Ivan Kozik <[email protected]>
Remove "Please report this problem [...] to Wpull's issue tracker" message because we're a fork

c76c34963e58f729fd6341468ad328122074e7a8 authored about 6 years ago by Ivan Kozik <[email protected]>
Fix docstring: status_reason -> reason

9844e92994cc96f6d2765753f183c84ef34aa376 authored about 6 years ago by Ivan Kozik <[email protected]>
Bump version 3.0.2

2968734ee68699e57350f39c55971b62749813b8 authored about 6 years ago by Ivan Kozik <[email protected]>
Require last-working tornado version

8fabca4e60c38bd1ddb328731e01728c7bc066a6 authored about 6 years ago by Ivan Kozik <[email protected]>
Bump version 3.0.1

bd5f00036703d352f2f652c18718fd77ee9c2b9d authored about 6 years ago by Ivan Kozik <[email protected]>
Use newer dnspython

1fad32f8be5155ea81feed0af12c937c8fb9fbfd authored about 6 years ago by Ivan Kozik <[email protected]>
Bump version 3.0.0

4c043dfe9180c4976eb0c1836ecc43a66e6a734a authored about 6 years ago by Ivan Kozik <[email protected]>
Remove support for --convert-links and --backup-converter

I didn't want to fix these for lxml trees, and neither grab-site or ArchiveBot
use these options...

1303c3c24b2d5d5996cc096b78aa7434a53df5fa authored about 6 years ago by Ivan Kozik <[email protected]>
Use html5-parser to parse HTML/XHTML and lxml to parse XML; remove support for html5lib and --html-parser

html5-parser is much faster than html5lib.

Bump minimum Python version to 3.7.0.

Monkeypatch u...

71ec52061e24c4893200240933939d5ce0dacd39 authored about 6 years ago by Ivan Kozik <[email protected]>
Remove phantomjs support; suggested alternative is to use a browser-based crawler

ac77c5cbcbd9e4e9f65ca8fdff375a56135bcd8a authored about 6 years ago by Ivan Kozik <[email protected]>
Rename wpull.testing.async module for Python 3.7

e1c35bf62cdc704b7144f9e65029668c6fd754fd authored about 6 years ago by Ivan Kozik <[email protected]>