Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Stars
22.6k
Forks
1.3k
Watchers
22.6k
Open Issues
174
Overall repository health assessment
^1.0.0^2.4.0^0.1.0^2.3.11^20.0.01.58.11.58.11.58.1^5.0.0^1.1.5^1.0.1^2.4.2^5.0.0^11.0.0^8.2.1^3.0.1^4.5.8^4.6.7^2.1.1^24.0.0^4.1.2^1.1.2^4.0.0^1.0.0^7.3.12^1.7.2^17.0.26^4.0.16*^1.0.15^0.0.2^2.0.0^0.9.3^20.0.0^10.0.0^2.0.5^9.23.0^10.1.1^4.18.1^11.0.0^1.1.3^17.0.0^15.0.0^13.0.0^9.0.11^4.0.0^9.0.0^16.0.0^13.4.01.58.1^1.0.1^1.0.224.36.1^6.0.0^4.4.0^2.1.0^6.0.0^8.28.0^4.0.161.1k
commits
863
commits
711
commits
603
commits
444
commits
242
commits
175
commits
107
commits
60
commits
47
commits
feat: Modularize SystemStatus to allow custom backpressure mechanisms (#3529)
7159a13View on GitHubci: use standalone apify-cli binary in E2E tests (#3543)
be6dbc4View on GitHubdocs: serve markdown via Accept header in nginx (#3542)
9586a6aView on GitHubchore(deps): bump handlebars from 4.7.8 to 4.7.9 (#3539)
a2d34a4View on GitHubchore(deps): update dependency typescript to v6 (#3516)
1d48644View on GitHubchore(deps): bump brace-expansion from 1.1.12 to 1.1.13 in /website/versioned_docs/version-3.16 (#3536)
c7c8d30View on GitHubchore(deps): bump path-to-regexp from 0.1.12 to 0.1.13 in /website (#3537)
6d30b47View on GitHubchore(deps): update dependency impit to ^0.13.0 (#3534)
516908aView on GitHubchore(deps): update actions/configure-pages action to v6 (#3535)
5637fd2View on GitHubfeat: add abortDownload context helper to FileDownload (#3528)
3b3d8f6View on GitHubfix(e2e): resolve flaky Camoufox tests due to GitHub rate limits (#3527)
08c98cfView on GitHubfix: Correctly track the number of requests handled by a crawler (#3410)
b2296ceView on GitHubchore(deps): bump yaml from 2.8.2 to 2.8.3 in /website (#3522)
abece11View on GitHubchore(deps): update actions/deploy-pages action to v5 (#3523)
364bf03View on GitHub