Alternative Scraping tools and utilities for Python
Updated :
April 15, 2024
TheScrapper
Github stargazers
204
Github forks
44
Commits
43
Code contributors Contributors
5
Scrape emails, phone numbers and social media accounts from a website.
Created
May 7, 2021
Updated
Sept. 19, 2023
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
2
instagram-follower-scraper
Github stargazers
202
Github forks
81
Commits
25
Code contributors Contributors
7
A python script that can automatically scrape other people followers on instagram and save them in a txt file.
Created
Jan. 6, 2021
Updated
March 5, 2024
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
19
first-web-scraper
Github stargazers
202
Github forks
164
Commits
220
Code contributors Contributors
10
A step-by-step guide to writing a web scraper with Python
Created
Oct. 2, 2013
Updated
Feb. 10, 2024
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
4
Homepage
scrapelib
Github stargazers
202
Github forks
43
Commits
522
Code contributors Contributors
14
⛏ a library for scraping unreliable pages
Created
July 6, 2010
Updated
March 2, 2024
License
bsd-2-clause
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
3
first-web-scraper
Github stargazers
202
Github forks
164
Commits
220
Code contributors Contributors
10
A step-by-step guide to writing a web scraper with Python
Created
Oct. 2, 2013
Updated
Feb. 10, 2024
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
4
Homepage
scrapeOP
Github stargazers
191
Github forks
74
Commits
169
Code contributors Contributors
1
A python package for scraping oddsportal.com
Created
July 9, 2020
Updated
Feb. 26, 2023
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
11
FBMessageScraper
Read-only repository, archived by owner Archived
Github stargazers
186
Github forks
44
Commits
5
Code contributors Contributors
1
A python script to download facebook chats
Created
Sept. 29, 2014
Updated
Sept. 29, 2014
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
11
atp-world-tour-tennis-data
Github stargazers
184
Github forks
103
Commits
798
Code contributors Contributors
6
Using Python to scrape ATP World Tour tennis data
Created
April 9, 2013
Updated
Aug. 28, 2023
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
5
ebayMarketAnalyzer
Github stargazers
183
Github forks
24
Commits
147
Code contributors Contributors
2
Scrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel
Created
Dec. 2, 2020
Updated
Aug. 22, 2021
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
31
blinkist-scraper
Github stargazers
182
Github forks
33
Commits
121
Code contributors Contributors
9
📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
Created
March 30, 2020
Updated
May 8, 2021
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
16
crawley
Github stargazers
182
Github forks
33
Commits
595
Code contributors Contributors
5
Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.
Created
Sept. 7, 2011
Updated
June 12, 2015
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
10
ScrapPY
Github stargazers
182
Github forks
19
Commits
37
Code contributors Contributors
3
ScrapPY is a Python utility for scraping manuals, documents, and other sensitive PDFs to generate wordlists that can be utilized by offensive security tools to perform brute force, forced browsing, and dictionary attacks against targets. The tool dives deep to discover keywords and phrases leading to potential passwords or hidden directories.
Created
Nov. 4, 2022
Updated
April 12, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
1
linkedin-jobs-scraper
Github stargazers
181
Github forks
77
Commits
50
Code contributors Contributors
2
Scrape LinkedIn job postings using Selenium WebDriver with python bindings
Created
July 29, 2016
Updated
Dec. 10, 2016
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
2
funda-scraper
Github stargazers
176
Github forks
84
Commits
11
Code contributors Contributors
1
Scraper of the Dutch real estate website www.funda.nl, implemented in Python with Scrapy
Created
July 18, 2016
Updated
April 7, 2017
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
tweeds
Read-only repository, archived by owner Archived
Github stargazers
175
Github forks
33
Commits
21
Code contributors Contributors
1
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a Tweets and more while evading most API limitations.
Created
Dec. 10, 2022
Updated
April 26, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
2
Homepage
news-fetch
Github stargazers
166
Github forks
109
Commits
31
Code contributors Contributors
4
A Python Package which helps to scrape all news details from any news websites
Created
June 29, 2019
Updated
Jan. 29, 2021
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
9
AO3Scraper
Github stargazers
162
Github forks
53
Commits
116
Code contributors Contributors
8
A Python scraper for getting fan fiction content and metadata from Archive of Our Own.
Created
Sept. 23, 2016
Updated
Sept. 20, 2022
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
6
scraperwiki-python
Github stargazers
160
Github forks
69
Commits
389
Code contributors Contributors
18
ScraperWiki Python library for scraping and saving data
Created
April 17, 2012
Updated
Aug. 8, 2022
License
bsd-2-clause
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
13
languagepod101-scraper
Github stargazers
146
Github forks
24
Commits
34
Code contributors Contributors
6
Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
Created
Oct. 31, 2020
Updated
Oct. 14, 2022
License
mit
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
3
Hockey-Scraper
Github stargazers
135
Github forks
41
Commits
135
Code contributors Contributors
7
Python Package for scraping NHL Play-by-Play and Shift data
Created
June 23, 2017
Updated
Jan. 3, 2024
License
gpl-3.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
5