Top alternative scraping utilities for Nodejs
Updated :
June 3, 2023
cheerio
Github stargazers
26444
Github forks
1
Commits
2584
Code contributors Contributors
133
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
Created
Oct. 9, 2011
Updated
June 2, 2023
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
TypeScript
Issues
14
jsdom
Github stargazers
18854
Github forks
1
Commits
3616
Code contributors Contributors
311
A JavaScript implementation of various web standards, for use with Node.js
Created
Jan. 19, 2010
Updated
May 27, 2023
License
mit
Github repo
Type
Module/library
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
453
apify-js
Github stargazers
8384
Github forks
369
Commits
3668
Code contributors Contributors
67
Crawleeโ€”A web scraping and browser automation library for Node.js that helps you build reliable crawlers. Fast.
Created
Aug. 26, 2016
Updated
June 2, 2023
License
apache-2.0
Github repo
Type
Cli
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
TypeScript
Issues
104
Homepage
scrape-it
Github stargazers
3922
Github forks
234
Commits
197
Code contributors Contributors
19
๐Ÿ”ฎ A Node.js scraper for humans.
Created
April 28, 2016
Updated
March 19, 2023
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
google-play-scraper
Github stargazers
1976
Github forks
585
Commits
454
Code contributors Contributors
61
Node.js scraper to get data from Google Play
Created
April 7, 2015
Updated
April 12, 2023
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
JavaScript
Issues
91
node-website-scraper
Github stargazers
1372
Github forks
255
Commits
474
Code contributors Contributors
16
Download website to local directory (including all css, images, js, etc.)
Created
Sept. 4, 2014
Updated
May 20, 2023
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
6
Homepage
noodle
Github stargazers
740
Github forks
72
Commits
531
Code contributors Contributors
15
A node server and module which allows for cross-domain page scraping on web documents with JSONP or POST.
Created
June 29, 2012
Updated
June 13, 2018
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
14
Homepage
node-scraper
Github stargazers
519
Github forks
66
Commits
13
Code contributors Contributors
1
Easier web scraping using node.js and jQuery
Created
Dec. 5, 2010
Updated
May 31, 2011
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
22
openGraphScraper
Github stargazers
481
Github forks
87
Commits
918
Code contributors Contributors
31
Node.js scraper service for Open Graph Info and More!
Created
Sept. 1, 2013
Updated
May 8, 2023
License
mit
Github repo
Type
Module/library
Platform
Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
3
webster
Github stargazers
456
Github forks
58
Commits
118
Code contributors Contributors
2
a reliable high-level web crawling & scraping framework for Node.js.
Created
Nov. 4, 2017
Updated
Feb. 11, 2023
License
gpl-3.0
Github repo
Type
App
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
node-google
Github stargazers
448
Github forks
124
Commits
96
Code contributors Contributors
15
A Node.js module to search and scrape Google.
Created
July 10, 2012
Updated
Sept. 20, 2016
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
30
micro-open-graph
Read-only repository, archived by owner Archived
Github stargazers
381
Github forks
51
Commits
44
Code contributors Contributors
7
A tiny Node.js microservice to scrape open graph data with joy.
Created
Feb. 21, 2017
Updated
Feb. 11, 2019
License
mit
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
3
node-readability
Github stargazers
337
Github forks
35
Commits
261
Code contributors Contributors
3
Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.
Created
May 10, 2014
Updated
Aug. 1, 2018
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
7
yakuza
Github stargazers
295
Github forks
29
Commits
383
Code contributors Contributors
6
Highly scalable Node.js scraping framework for mobsters
Created
Sept. 30, 2014
Updated
Sept. 30, 2015
Github repo
Type
Tool/utility
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
10
synth-secrets
Github stargazers
282
Github forks
35
Commits
5
Code contributors Contributors
1
Screen-scraped articles on subtractive synthesis (using Node.js)
Created
Dec. 10, 2014
Updated
Dec. 10, 2014
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
node-web-scraper
Github stargazers
271
Github forks
200
Commits
13
Code contributors Contributors
4
Code for the tutorial: Scraping the Web With Node.js by @kukicado
Created
March 13, 2014
Updated
Jan. 10, 2018
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
6
Homepage
link-preview-generator
Github stargazers
231
Github forks
63
Commits
32
Code contributors Contributors
8
Get preview data (a title, description, image, domain name) from a url. Library uses puppeteer headless browser to scrape the web site.
Created
Nov. 12, 2019
Updated
March 3, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
7
nutella-scrape
Github stargazers
209
Github forks
12
Commits
20
Code contributors Contributors
2
:chocolate_bar: learn to scrape the web with Node.js -- it tastes like chocolate
Created
Aug. 14, 2015
Updated
Sept. 11, 2015
Github repo
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
siphon
Github stargazers
96
Github forks
8
Commits
280
Code contributors Contributors
3
First distributed web scraping library for Node.js
Created
Nov. 18, 2016
Updated
July 31, 2017
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
kijiji-scraper
Github stargazers
84
Github forks
40
Commits
77
Code contributors Contributors
9
A lightweight node.js module for retrieving and scraping ads from Kijiji
Created
Feb. 27, 2015
Updated
Feb. 12, 2023
License
mit
Github repo
Type
Module/library
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
TypeScript
Issues
5