Top alternative scraping utilities for Nodejs
Updated :
March 25, 2024
node-metainspector
Github stargazers
129
Github forks
52
Commits
74
Code contributors Contributors
13
Node npm for web scraping purposes. It scrapes a given URL, and returns you its title, meta description, meta keywords, an array with all the links, all the images in it, etc. Inspired by the metainspector Ruby gem
Created
April 23, 2013
Updated
Feb. 16, 2019
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
20
micro-scraper
Github stargazers
109
Github forks
67
Commits
4
Code contributors Contributors
1
Node.js ηˆ¬θ™«η€ΊδΎ‹ (forοΌšη™ΎεΊ¦η™Ύη§‘οΌ‰
Created
June 6, 2013
Updated
Aug. 8, 2013
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
2
node-web-crawler
Github stargazers
107
Github forks
40
Commits
54
Code contributors Contributors
4
A web scraper with a web user interface which shows scraping stats in realtime. Uses Node.JS, jQuery, socket.io and Express.
Created
Feb. 3, 2013
Updated
Oct. 15, 2014
License
mit
Github repo
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
4
librus-api
Github stargazers
100
Github forks
25
Commits
97
Code contributors Contributors
14
Advanced node.js Librus scraping API(http://synergia.librus.pl/)
Created
Dec. 10, 2015
Updated
Feb. 6, 2024
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
10
scraper
Github stargazers
99
Github forks
16
Commits
595
Code contributors Contributors
1
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
Created
Dec. 5, 2020
Updated
June 13, 2022
License
mit
Github repo
Type
Module/library
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
TypeScript
Issues
12
duck-duck-scrape
Github stargazers
99
Github forks
13
Commits
215
Code contributors Contributors
4
πŸ”Ž Search from DuckDuckGo and utilize its spice APIs in Node
Created
April 7, 2018
Updated
Dec. 4, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
TypeScript
Issues
1
siphon
Github stargazers
96
Github forks
8
Commits
280
Code contributors Contributors
3
First distributed web scraping library for Node.js
Created
Nov. 18, 2016
Updated
July 31, 2017
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
lambda-phantom-scraper
Github stargazers
96
Github forks
15
Commits
3
Code contributors Contributors
1
PhantomJS/Node.js web scraper for AWS Lambda
Created
May 25, 2016
Updated
May 29, 2016
Github repo
Primary Language, based on Github DataLanguage
JavaScript
kijiji-scraper
Github stargazers
90
Github forks
42
Commits
79
Code contributors Contributors
9
A lightweight node.js module for retrieving and scraping ads from Kijiji
Created
Feb. 27, 2015
Updated
Dec. 18, 2023
License
mit
Github repo
Type
Module/library
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
TypeScript
Issues
5
node-google-search-scraper
Github stargazers
86
Github forks
40
Commits
20
Code contributors Contributors
3
Google search scraper with captcha solving support
Created
May 14, 2014
Updated
June 20, 2018
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
13
nodejs-web-scraper
Github stargazers
80
Github forks
26
Commits
201
Code contributors Contributors
7
--
Created
Aug. 20, 2018
Updated
Jan. 7, 2023
Github repo
Type
Tool/utility
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
6
browser
Github stargazers
74
Github forks
9
Commits
20
Code contributors Contributors
1
browsing urls with cookies, that is, we can scrape with authenticated pages! (Node.js)
Created
Oct. 17, 2011
Updated
Feb. 23, 2014
License
mit
Github repo
Platform
Browser, Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
4
scraping-service
Github stargazers
61
Github forks
12
Commits
107
Code contributors Contributors
1
REST API for scraping dynamic websites using Node.js, headless Chrome and Cheerio.
Created
Aug. 14, 2017
Updated
Feb. 17, 2024
License
mit
Github repo
Platform
Node.js, Browser
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
node-website-scraper-phantom
Read-only repository, archived by owner Archived
Github stargazers
58
Github forks
13
Commits
15
Code contributors Contributors
2
Plugin for website-scraper which returns html for dynamic websites using PhantomJS.
Created
Feb. 27, 2017
Updated
Dec. 29, 2021
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Homepage
node-anime-scraper
Github stargazers
58
Github forks
17
Commits
19
Code contributors Contributors
3
Scrapes information from Gogoanime to get Anime, Episode & Video information & urls.
Created
Feb. 7, 2015
Updated
Dec. 22, 2020
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
4
torrent-indexer
Github stargazers
54
Github forks
12
Commits
82
Code contributors Contributors
2
Yet another node.js torrent scraper made especially for movie, series, anime and music (scrape from 1337x, eztv, limetorrents, rarbg, skytorrents, thepiratebay, torrentproject, yts and zooqle)
Created
Jan. 13, 2020
Updated
May 14, 2020
License
mit
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
13
gs-scraper
Github stargazers
54
Github forks
14
Commits
84
Code contributors Contributors
1
A little app that searches multiple sources for items and garage sales (Craigslist, LetGo, OfferUp, VarageSale, Oodle, EstateSales.net). It also allows you to use eBay's finding API to search prices on eBay and compare.
Created
Aug. 12, 2018
Updated
May 18, 2020
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript
Issues
1
n8n-nodes-puppeteer
Github stargazers
54
Github forks
7
Commits
28
Code contributors Contributors
2
n8n node for requesting webpages using Puppeteer
Created
May 7, 2022
Updated
Aug. 29, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
TypeScript
Issues
4
indeed-scraper
Github stargazers
53
Github forks
42
Commits
57
Code contributors Contributors
10
A Node.js package for getting job listings from Indeed.com.
Created
Dec. 24, 2016
Updated
Nov. 20, 2021
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
JavaScript
Issues
7
node-image-scraper
Github stargazers
53
Github forks
13
Commits
16
Code contributors Contributors
3
Node.js module for scraping images from the web.
Created
June 26, 2013
Updated
Jan. 27, 2018
License
mit
Github repo
Type
Module/library
Platform
Node.js
Primary Language, based on Github DataLanguage
JavaScript