An open API service for software projects hosted on Open Collective.

github.com/stashapp/CommunityScrapers

This is a public repository containing scrapers created by the Stash Community.
https://github.com/stashapp/CommunityScrapers

Fix image selector for PinupDollars

971cfe8d168aa9c5ad680e7f9d009c6e44de24f7 authored over 1 year ago
Fix URL selectors for ManyVids performers

402048049edeceee4e1a10cc9043c7e08d530284 authored over 1 year ago
Scrape full old-style performer URL from ManyVids

d7b3eec9f69455854d5b2c0fa38e5f1aa945bbdd authored over 1 year ago
Fix image scraping for older scenes on naughtyalysha.com

They hide some of these video elements in Javascript but we can solve anything with regexp

0e493f3ed87e6e099876e29590f6ba82834ae726 authored over 1 year ago
Add scraper for naughtyalysha.com

c9ddd800d189d92d7e533057341d82bcd446ae2c authored over 1 year ago
Remove duplicate scraper from KimHolland

Also move the file to the base directory as it has no dependencies

49c6d704768951b0e48303f5351fee03b1cb43fc authored over 1 year ago
Add sceneByFragment scraper for KimHolland (#1679)

* Add sceneQueryScraper

* Rename scraper

* Search based on filename instead of title

--------...

d34b2f76b8ff3dfbf5e26168c92089d8236c5e71 authored over 1 year ago
Update Mylf.yml (#1677)

Expanded site mapping to include more sub studios and to better match StashDB naming.

24ba667312f19f7cbe5b8e22534d99ab0d96f2e8 authored over 1 year ago
Name was including "identifies as ...", updated regex to remove. (#1676)

6d37c8800a3fabe88551c7c84d279f8346d07b7a authored over 1 year ago
Add Kim Holland scraper (#1675)

Co-authored-by: Philippe.Cambien <philippe.cambien@macadam.eu>

239e61f2bc4f7cb7501d25ef8b0080e0ae16b61c authored over 1 year ago
Scrape high-res images from TeamSkeet

eff9f1a5e4ab83eec832195e896d6e62a819f782 authored over 1 year ago
Adult empire single scene rather than entire movie (#1664)

24e992f509adaf60f6b006c212a78dfdb91bb31f authored over 1 year ago
fix/improve RealityLovers (#1674)

* fix/improve RealityLovers

* fetch largest cover image

* use ordered logic for cover qual...

e531e878df5595b76debceb26bd7648d51fd39b2 authored over 1 year ago
Update GIGA.yml (#1673)

Akiba-web new release layout differs from very old ones which breaks image matching during manua...

86bbf3bd7797c801cc705539c507b36b324231ca authored over 1 year ago
fix: adjust Arx scraper to updated graphql API (#1672)

* adjust to updated graphql API

* scrape studio code

* move introspection to pycommon

1bd91304008341fc8ebd2c772aa17afe0aad5e3e authored over 1 year ago
Fix title scraping for Pornworld

6d7901e005d7239bec7b5fc1321fd9c97517af15 authored over 1 year ago
Improve fragment scraping with ManyVids

Instead of requiring that the filename is _exactly_ the video ID and nothing else
we will now lo...

711adda4eccfc018f5f83656c09b2594fa07965b authored over 1 year ago
Adding a scraper for my site timestamp.trade (#1670)

Co-authored-by: Tweeticoats <Tweeticoats@github.com>

2d93c14a0cbc859b72e7bd346b5382158ac5350d authored over 1 year ago
Add drdaddypov.com, hotwifefun.com & hotwivescheating.com (#1669)

b160ae6a815c01ec35c609aef4043d5dec2e7136 authored over 1 year ago
Make URL patterns less strict for ManyVids

62bbd65f954c9d40d11139196425ead1c9c5c629 authored over 1 year ago
Fix date parsing for AdultPrime

3d8edf6af88de1d753d70a5687584583a2a858f5 authored over 1 year ago
Update AMAMultimedia for new site layout

5a0a37a54a366b80959fdc1836bc50efe09af38a authored over 1 year ago
add scrape of numerical codes from Wtfpass sites (#1667)

d214b2a4899911e4f3f3d7c66b5259460bae3901 authored over 1 year ago
Move Blowpass Sites to Algolia + Add usepov.com to PaperStreetMedia.yml (#1666)

f98f91bdbd20f8099062d1de6d39b95af6d07a46 authored over 1 year ago
Add URLs to ModelCentro (#1665)

Add Gina Gerson and Lonely Meow to ModelCentro URL list.

1c0b7adaebfb2846cb35e9551219325f810041a6 authored over 1 year ago
Clarify Python installation instructions in README

87a608da6a9ac5948b666c9d476dc75ef7d7a9bb authored over 1 year ago
Update README with info on the scraper manager

Also expands on how to set up Python for scrapers

ac97191a849ddf5dc46331a980007ef391a55308 authored over 1 year ago
Fix Date selector for AdultPrime

f945397b775b39108703051e1fe323d94b0846cb authored over 1 year ago
Fix TheScoreGroup

They removed the 'Download trailer' button but we can still get the URL from
the preview video e...

c83688396840e1a0ad0cba8981730cd34cba2f47 authored over 1 year ago
Use model name as studio name from ManyVids

d510a7fd24d0955462eed6ffbbf3369a7d831305 authored over 1 year ago
Canonicalize scraped URL in ManyVids

09c54403a0f7cdd0bd06f1965ae9b255af7202f5 authored over 1 year ago
Update ManyVids Scraper (#1660)

Replace Python-based ManyVids scraper with a pure JSON scraper

---------

Co-authored-by: M...

161aadd9165d9b6adab33cd1c4230be1eb1dcb70 authored over 1 year ago
Fix marker scraping in AyloAPI, part 2

0a8b75bd0d7dc5a3c0210c073a603d383707afa2 authored over 1 year ago
Fix marker scraping in AyloAPI

cb87d4864c7c5cf65cf4b8a3bcec95ff72eca871 authored over 1 year ago
Improved Tag extraction specifically for pornmegaload.com but did not see detriment to other sites. (#1663)

712b9670888b8ad65e88c0376821a9e7229b9b16 authored over 1 year ago
Add scraper for InterracialVision.com

c352ffd61d279c90c47874fa3245edd70472a76a authored over 1 year ago
Add helper util for Python scrapers to guess performer nationality

This functionality showed up separately in IAFD, KBProductions and AyloAPI

2c495b192a221cfb0b8fa72adbdafe7acbe056fb authored over 1 year ago
fixup: forgot the TLD for Guy Selector

5f64fd849891c86735cc810fbd97e27f364bffc3 authored over 1 year ago
Add Guy Selector to GayWire

d42900e8578f32da28ed873f4d110173f07a200e authored over 1 year ago
Also accept old-style URLs for Rule34Video

This is for compatability only, users will get the new canonical URL as part of the scrape

04bb84eddc8611b532392831626697ace3c6a019 authored over 1 year ago
Update URL pattern for Rule34Video

bc1258a368164584161f080d258ce13b401084ce authored over 1 year ago
Scrape performer URLs when scraping scenes

3a232b5bf1d1ea4d00fb3ef76eaea25788b2c3d5 authored over 1 year ago
Logging fix and fix for www-prefix URL's (#1659)

caf5a3b0442e898ff4d821d69e81b0889197ec7e authored over 1 year ago
Update MissaX to compensate for broken HTML

Their backend generates errors when parsing scene duration which affected our parsing of the dat...

423b1c085fcefbce685ccb7c516a56f93f06f71c authored over 1 year ago
Add sceneByName for xhamster (#1655)

4a92a4fde2dac5af22af7765d02e8d1c2b55ad25 authored over 1 year ago
Update Giga scraper (#1656)

Added fixed studio name
Sorted field names alphabetically

8d909017f8c11038b39053d76bea06b5dda45039 authored over 1 year ago
Improve czechav scraper (#1654)

* Improved czechav scraper: fixed title fetching, added date and performer scraping

* Updated...

a0657ddf9cd788f10ec7b6fbd7b57c3bdbab77d5 authored over 1 year ago
Fix studio name for FreakMob Media

7a1647ed5821437a63bc2a667857349c856293d1 authored over 1 year ago
Add CreampieThais to KBProductions

e272cdc86dff9e29ac96d6d9ad0babe4cfd86341 authored over 1 year ago
Add AsianStreetMeat

0edd16eb3b0591fda4789434ec6ff9bc5e7e115f authored over 1 year ago
Move WhisparrWDTV to its own folder

03687fb0c48fa9e9f66147afdaf2d38c10c5b42e authored over 1 year ago
Scraper for WDTV xml files generated by Whisparr (#1357)

* Add whispar wdtv metadata scraper

* Remove redundant code

* Update comments

* Update ...

8ca577c609aae641c72dc25f19e0744bfeb18835 authored over 1 year ago
Update MetroHD

Scrape Family Hookups instead of Family Hook Ups

64a5a6664e8286681768414aa2ba588eb460b05e authored over 1 year ago
Add InTheCrack: thanks to mustash for this

e7bb8ae19efdae5c3e3244954fac1095de113944 authored over 1 year ago
Update XSinsVR: thanks to mustash for this

7653d40a0aa177621de024af41d349e45a20c492 authored over 1 year ago
Update VRHush: thanks to mustash for this

02f1bd20b97e476692891a86839bbceb33dd0c1b authored over 1 year ago
Add Nick Marxx to KBProductions

41e0bfb828832ffa79f15e23b11cd5f4f4934f29 authored over 1 year ago
Added GIGA scraper (#1651)

* Added GIGA scraper

Added scraper for GIGA studio site ( www.akiba-web.com )

* Added miss...

6b0a66cb29ac230746ff94c916834a414d2bba8e authored over 1 year ago
Fix image scraping for SexLikeReal again

2c7bbd3388aeb4db8f03e2bae69278ea648ff278 authored over 1 year ago
Tiny fixes to GEVI

b09cca5dc76677e38a5bcec539b1fda0b07fb031 authored over 1 year ago
Add GEVI scraper

Handles performers, movies and episodes (scenes)

93365541bc7da19dea60d06147da14fe86561dbe authored over 1 year ago
Added sceneByFragment to PMVHaven scraper (#1649)

* Extended PMVHaven.py functionality

- Added main function
- Added function for sceneByFragm...

e76e43e5db933ef82ca5266235ecfb0f396fc055 authored over 1 year ago
Use Director as Photographer when scraping gallery with SARJ-LLC

155530c222384ae03d236127a15580310cd311fe authored over 1 year ago
Slightly improve error handling in py_common config parsing and graphql

ff3807ade11b023d679ff72b7b5789b12dbade42 authored over 1 year ago
Fix inverted condition in Men scraper that caused most scenes to have TwinkPop URLs

68b9096c88d2e003f665d21e8b851a2eb134f32c authored over 1 year ago
Adding initial support for a wikidata based scraper. (#673)

* Adding initial support for a wikidata based scraper.
wikidata is the database behind wikipedi...

96698c06cdc0b7bf9b27d02475d6cc2e6635629d authored over 1 year ago
fix path finding in json response (#1647)

Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>

b360649e4f5f5433dc309d3cd8735d716c05ec81 authored over 1 year ago
Add Mars Media network URL scene scraper (#1648)

* Add Mars Media sites

Add sites for Mars Media network scene scraper

* Update and rename ...

0dcef087517806b168b88498c7a5fe6eeac20882 authored over 1 year ago
Fix theassfactory.com

They've moved over to the new site format as seen on julesjordan.com

6232721725df96c7bb8eec54bebbbfa815f39a94 authored over 1 year ago
Update SCRAPERS-LIST for KBProductions

e02c2ba0bbd469db26aed546750146e040e05b94 authored over 1 year ago
Consolidate even more sites into KBProductions

06afc9009a026b7f13ee2b20cdee2d26db426299 authored over 1 year ago
Add URL scene scraper for Cutler's Den (cutlersden.com) (#1645)

* Add Cutler's Den scraper

Adding URL scene scraper for Cutler's Den (cutlersden.com).

* N...

855ccb9f9ba2940a5d69a7f816b2d54bd18e2c66 authored over 1 year ago
Updated Pure Media scraper (#1646)

f007987600b255cfd61a71288755277319b7cf6b authored over 1 year ago
Remove redundant SCRAPERS-LIST

809551f35a8c7a634772f6581deee62a52fea7d2 authored over 1 year ago
Rewrite KBProductions

c693970441715af5f71fc53d6f69f0690f30aa1b authored over 1 year ago
Merge TopWebModels scraper into KBProductions scraper. (#1637)

* Merge TopWebModels to KBProductions

Modified URL regex to include those used by TopWebModel...

16f38d9ea2e8cfa7c185fbc4e4ffda481516bdd9 authored over 1 year ago
Add tag mapping for 'Athletic Woman' to AyloAPI

0be6d71b788b4cc71b80f405d265a562222b7db5 authored over 1 year ago
Better handling for deleted ManyVids scenes

3cebfb77be70ee2fe9ad3237a99cf1ffc1b4eddb authored over 1 year ago
URL scene scraper for Bring Me A Boy (bringmeaboy.com) (#1644)

* Add URL for BringMeABoy scraper

Adding URL for BringMeABoy URL scene scraper

* Add Bring...

679ca236928d8ede8970765c3868f9fbb43d3a10 authored over 1 year ago
Scrape Series as part of scraping Scenes in Aylo API

7af00cf7e49a722e6716c44b47cf30ec972b9b8d authored over 1 year ago
Fallback to use poster as cover for Aylo API series

675e0e236c536a0b99b8a1262035c0f7fd817233 authored over 1 year ago
Fix scraping Series as Movies from Aylo API

b5587f36d8aa20517e0b643f93b1039d5e171845 authored over 1 year ago
Aylo API scrapers (#1641)

* Add new AyloAPI scrapers
* Remove old MindGeek scrapers deprecated by AyloAPI
* Update SCRAP...

c2abfdf6d877e06743e38980a954879745918c90 authored over 1 year ago
Add scraper for spritzz.com (#1642)

* Add spritzz.com to URL list

Added spritzz.com to URL list for new scraper.

* Add Spritzz...

4ff3ed60154eb94a246b9c6c34482be2581217ae authored over 1 year ago
Updated YesGirlz scraper (#1640)

9e5a039971f605399eb7006311ea341039fa0bda authored over 1 year ago
Fix filenames for Jellyfin scraper

c76b223fb82cd8b1d209878cabcd2505ec9e38d5 authored over 1 year ago
Update Boobpedia to fix search

They updated their MediaWiki and search layout changed: this uses the API search instead which s...

363500851ef7cf907099124eeb62a9d5cbbd5e1a authored over 1 year ago
Add lezbebad to Adultime (#1639)

URL lezbebad.com/en/video/ added.

e2b4ebbbbed15065dfde3a93e529bebc0281327d authored over 1 year ago
Rewrite Jellyfin scraper

0d4d844c2cc235fcd79e514c510ec1e74ca32a79 authored over 1 year ago
Fix renamed tag key (#1638)

27bbd11453ecec7c7f12c7d7c0ae7ba39063bbf8 authored over 1 year ago
Gallery to image Gallery scraper (#1636)

* copy to images scraper

* Add studio code and photographer to py_common

* Change default ...

f3a4555d1084c3710fda37b4fc7444f093fa8f09 authored over 1 year ago
Updated Nympho scraper (#1634)

5cafac0328e1978fc9a46c2524edbe7da838b22c authored over 1 year ago
Add Photographer to gallery scraper in We Are Hairy: thanks @echo6ix

86bbabcafa6aec8eba59bfadf4a6ec84930d7b3f authored over 1 year ago
Issue #1632 - Plugin CopyToGallery (#1633)

As described in Issue #1632 plugin CopyToGallery as unexpected behavior when scenes do not have ...

96649fb84200061cc50dafa98adbddf280ad9d7f authored over 1 year ago
Updated Wake Up n Fuck Scraper (#1630)

04a0254b2b265c23a073ec0dbad807ae6a65ae99 authored over 1 year ago
Update KBProductions scraper to include LucidFlix (#1631)

* Update SCRAPERS-LIST.md for lucidflix.com

Adding lucidflix.com to KBProductions scraper.

...

f3f94fa94ded15cb31438e8adaa1a107afd64931 authored over 1 year ago
Scrape slightly higher res image from Woodman

e67d76bd0a7f40a7e0e3a44a64106165d4b0f70c authored over 1 year ago
[EmilyBloom.com] Add URL scraper (#1628)

* [EmilyBloom.com] Add URL scraper

* Add it to the SCRAPERS-LIST.md

02f73a6fd5dfbe0eedfa0b1a8bb1a8f4e9f5afd2 authored over 1 year ago
Update Hentaied.yml (#1627)

*Another* subsite added - Plants Vs Cunts

66badbccabbb4cc554519f5316c2c0beecc1cbe9 authored over 1 year ago
Small Fix for TopWebModels (#1626)

* CommunityScrapers

* CommunityScrapers

b2b413cca7344567f5d507f1de2a836f2e22bb86 authored over 1 year ago