Alternative Scraping tools and utilities for Python
transistor
Github stargazers
213
Github forks
21
Commits
198
Code contributors Contributors
2
Transistor, a Python web scraping framework for intelligent use cases.
Created
Nov. 12, 2018
Updated
Aug. 16, 2020
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
246
SouqScraper
Github stargazers
212
Github forks
166
Commits
36
Code contributors Contributors
4
Simple scripts for Level UP your scraping Skills, and source code for Level UP playlist on Youtube
Created
April 11, 2019
Updated
March 17, 2024
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
4
Homepage
scrapelib
Github stargazers
208
Github forks
43
Commits
522
Code contributors Contributors
14
⛏ a library for scraping unreliable pages
Created
July 6, 2010
Updated
Aug. 20, 2024
License
bsd-2-clause
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
7
first-web-scraper
Github stargazers
205
Github forks
166
Commits
220
Code contributors Contributors
11
A step-by-step guide to writing a web scraper with Python
Created
Oct. 2, 2013
Updated
July 15, 2024
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
3
Homepage
first-web-scraper
Github stargazers
205
Github forks
166
Commits
220
Code contributors Contributors
11
A step-by-step guide to writing a web scraper with Python
Created
Oct. 2, 2013
Updated
July 15, 2024
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
3
Homepage
ebayMarketAnalyzer
Github stargazers
200
Github forks
26
Commits
147
Code contributors Contributors
2
Scrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel
Created
Dec. 2, 2020
Updated
Aug. 22, 2021
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
31
ScrapPY
Github stargazers
196
Github forks
19
Commits
37
Code contributors Contributors
3
ScrapPY is a Python utility for scraping manuals, documents, and other sensitive PDFs to generate wordlists that can be utilized by offensive security tools to perform brute force, forced browsing, and dictionary attacks against targets. The tool dives deep to discover keywords and phrases leading to potential passwords or hidden directories.
Created
Nov. 4, 2022
Updated
Aug. 7, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
1
atp-world-tour-tennis-data
Github stargazers
192
Github forks
107
Commits
798
Code contributors Contributors
6
Using Python to scrape ATP World Tour tennis data
Created
April 9, 2013
Updated
Aug. 28, 2023
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
6
blinkist-scraper
Github stargazers
191
Github forks
36
Commits
121
Code contributors Contributors
9
📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
Created
March 30, 2020
Updated
May 8, 2021
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
16
FBMessageScraper
Read-only repository, archived by owner Archived
Github stargazers
189
Github forks
42
Commits
5
Code contributors Contributors
1
A python script to download facebook chats
Created
Sept. 29, 2014
Updated
Sept. 29, 2014
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
11
linkedin-jobs-scraper
Github stargazers
187
Github forks
78
Commits
50
Code contributors Contributors
2
Scrape LinkedIn job postings using Selenium WebDriver with python bindings
Created
July 29, 2016
Updated
Dec. 10, 2016
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
2
funda-scraper
Github stargazers
186
Github forks
85
Commits
11
Code contributors Contributors
1
Scraper of the Dutch real estate website www.funda.nl, implemented in Python with Scrapy
Created
July 18, 2016
Updated
April 7, 2017
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
crawley
Github stargazers
186
Github forks
33
Commits
595
Code contributors Contributors
5
Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.
Created
Sept. 7, 2011
Updated
June 12, 2015
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
10
tweeds
Read-only repository, archived by owner Archived
Github stargazers
181
Github forks
32
Commits
21
Code contributors Contributors
1
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a Tweets and more while evading most API limitations.
Created
Dec. 10, 2022
Updated
April 26, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
2
Homepage
news-fetch
Github stargazers
179
Github forks
110
Commits
31
Code contributors Contributors
4
A Python Package which helps to scrape all news details from any news websites
Created
June 29, 2019
Updated
Jan. 29, 2021
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
9
AO3Scraper
Github stargazers
173
Github forks
55
Commits
116
Code contributors Contributors
8
A Python scraper for getting fan fiction content and metadata from Archive of Our Own.
Created
Sept. 23, 2016
Updated
Sept. 20, 2022
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
6
scraperwiki-python
Github stargazers
160
Github forks
69
Commits
389
Code contributors Contributors
18
ScraperWiki Python library for scraping and saving data
Created
April 17, 2012
Updated
Aug. 8, 2022
License
bsd-2-clause
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
13
languagepod101-scraper
Github stargazers
153
Github forks
24
Commits
34
Code contributors Contributors
6
Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
Created
Oct. 31, 2020
Updated
Oct. 14, 2022
License
mit
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
5
Hockey-Scraper
Github stargazers
141
Github forks
45
Commits
135
Code contributors Contributors
8
Python Package for scraping NHL Play-by-Play and Shift data
Created
June 23, 2017
Updated
June 26, 2024
License
gpl-3.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
5
Instagram-Bot-Scrape-DM-Users
Github stargazers
136
Github forks
19
Commits
13
Code contributors Contributors
1
This Python Bot Scrape Instagram User Followers & Send DM to the Scraped Users with Account Switcher Feature.
Created
Sept. 17, 2022
Updated
Feb. 3, 2023
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
12