Code contributors Contributors
95
CrawleeโA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Created
Aug. 26, 2016
Updated
Sept. 27, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
TypeScript
Issues
131