Top alternative scraping utilities for Nodejs
Updated :
May 25, 2022
cheerio
Github stargazers
25079
Github forks
1
Commits
1971
Code contributors Contributors
126
Fast, flexible, and lean implementation of core jQuery designed specifically for the server.
Created
Oct. 9, 2011
Updated
May 25, 2022
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
TypeScript
Issues
12
jsdom
Github stargazers
17409
Github forks
1496
Commits
3544
Code contributors Contributors
291
A JavaScript implementation of various web standards, for use with Node.js
Created
Jan. 19, 2010
Updated
April 24, 2022
License
mit
Github repo
Type
Module/library
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
445
scrape-it
Github stargazers
3835
Github forks
225
Commits
183
Code contributors Contributors
18
🔮 A Node.js scraper for humans.
Created
April 28, 2016
Updated
Feb. 21, 2022
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
JavaScript
Issues
9
apify-js
Github stargazers
3338
Github forks
236
Commits
3067
Code contributors Contributors
44
Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Created
Aug. 26, 2016
Updated
May 17, 2022
License
apache-2.0
Github repo
Type
Cli
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
72
google-play-scraper
Github stargazers
1699
Github forks
493
Commits
427
Code contributors Contributors
49
Node.js scraper to get data from Google Play
Created
April 7, 2015
Updated
April 15, 2022
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
JavaScript
Issues
62
node-website-scraper
Github stargazers
1125
Github forks
228
Commits
442
Code contributors Contributors
14
Download website to local directory (including all css, images, js, etc.)
Created
Sept. 4, 2014
Updated
May 10, 2022
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
10
Homepage
noodle
Github stargazers
739
Github forks
72
Commits
531
Code contributors Contributors
15
A node server and module which allows for cross-domain page scraping on web documents with JSONP or POST.
Created
June 29, 2012
Updated
June 13, 2018
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
14
Homepage
node-scraper
Github stargazers
519
Github forks
66
Commits
13
Code contributors Contributors
1
Easier web scraping using node.js and jQuery
Created
Dec. 5, 2010
Updated
May 31, 2011
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
22
node-google
Github stargazers
444
Github forks
123
Commits
96
Code contributors Contributors
15
A Node.js module to search and scrape Google.
Created
July 10, 2012
Updated
Sept. 20, 2016
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
30
webster
Github stargazers
386
Github forks
49
Commits
113
Code contributors Contributors
2
a reliable high-level web crawling & scraping framework for Node.js.
Created
Nov. 4, 2017
Updated
Feb. 11, 2022
License
gpl-3.0
Github repo
Type
App
Primary Language, based on Github DataLanguage
JavaScript
Issues
3
micro-open-graph
Github stargazers
377
Github forks
52
Commits
44
Code contributors Contributors
7
A tiny Node.js microservice to scrape open graph data with joy.
Created
Feb. 21, 2017
Updated
Feb. 11, 2019
License
mit
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
3
openGraphScraper
Github stargazers
362
Github forks
74
Commits
625
Code contributors Contributors
29
Node.js scraper service for Open Graph Info and More!
Created
Sept. 1, 2013
Updated
Nov. 15, 2021
License
mit
Github repo
Type
Module/library
Platform
Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
2
node-readability
Github stargazers
327
Github forks
33
Commits
261
Code contributors Contributors
3
Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.
Created
May 10, 2014
Updated
Aug. 1, 2018
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
7
yakuza
Github stargazers
295
Github forks
27
Commits
383
Code contributors Contributors
6
Highly scalable Node.js scraping framework for mobsters
Created
Sept. 30, 2014
Updated
Sept. 30, 2015
Github repo
Type
Tool/utility
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
10
node-web-scraper
Github stargazers
270
Github forks
206
Commits
13
Code contributors Contributors
4
Code for the tutorial: Scraping the Web With Node.js by @kukicado
Created
March 13, 2014
Updated
Jan. 10, 2018
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
6
Homepage
synth-secrets
Github stargazers
267
Github forks
33
Commits
5
Code contributors Contributors
1
Screen-scraped articles on subtractive synthesis (using Node.js)
Created
Dec. 10, 2014
Updated
Dec. 10, 2014
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
nutella-scrape
Github stargazers
209
Github forks
12
Commits
20
Code contributors Contributors
2
:chocolate_bar: learn to scrape the web with Node.js -- it tastes like chocolate
Created
Aug. 14, 2015
Updated
Sept. 11, 2015
Github repo
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
link-preview-generator
Github stargazers
184
Github forks
58
Commits
32
Code contributors Contributors
8
Get preview data (a title, description, image, domain name) from a url. Library uses puppeteer headless browser to scrape the web site.
Created
Nov. 12, 2019
Updated
March 3, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
4
node-scrapy
Github stargazers
144
Github forks
25
Commits
185
Code contributors Contributors
5
Simple, lightweight and expressive web scraping with Node.js
Created
June 4, 2014
Updated
Aug. 24, 2020
License
mit
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
3
Humanoid
Github stargazers
136
Github forks
18
Commits
29
Code contributors Contributors
1
Node.js package to bypass CloudFlare's anti-bot JavaScript challenges
Created
Oct. 20, 2018
Updated
Aug. 11, 2020
License
mit
Github repo
Type
Module/library
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
8