Found 130,086 repositories(showing 30)
firecrawl
๐ฅ The Web Data API for AI - Power AI agents with clean web data
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Mintplex-Labs
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
soimort
:arrow_double_down: Dumb downloader that scrapes the web
D4Vinci
๐ท๏ธ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
dgtlmoon
Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monitoringโall for free or enjoy our SaaS plan!
ScrapeGraphAI
Python scraper based on AI
apify
CrawleeโA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
dzhng
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the simplest implementation of a deep research agent - e.g. an agent that can refine its research direction overtime and deep dive into a topic.
Evil0ctal
๐ใDouyin_TikTok_Download_APIใๆฏไธไธชๅผ็ฎฑๅณ็จ็้ซๆง่ฝๅผๆญฅๆ้ณใๅฟซๆใTikTokใBilibiliๆฐๆฎ็ฌๅๅทฅๅ ท๏ผๆฏๆAPI่ฐ็จ๏ผๅจ็บฟๆน้่งฃๆๅไธ่ฝฝใ
getmaxun
๐ฅ The open-source no-code platform for web scraping, crawling, search and AI data extraction โข Turn websites into structured APIs in minutes ๐ฅ
yusufkaraaslan
Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection
seleniumbase
APIs for browser automation, testing, and bypassing bot-detection.
JCodesMore
Clone any website with one command using AI coding agents
clips
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
apify
CrawleeโA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
pinchtab
High-performance browser automation bridge and multi-instance orchestrator with advanced stealth injection and real-time dashboard.
mherrmann
Lighter web automation with Python
lorien
List of libraries, tools and APIs for web scraping and data processing.
alirezamika
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
go-rod
A Chrome DevTools Protocol driver for web automation and scraping.
autoscrape-labs
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
firecrawl
๐ฅ Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.
MontFerret
Declarative web scraping
adbar
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
lexiforest
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.
niespodd
Analysis of Bot Protection systems with available countermeasures ๐ฟ. How to defeat anti-bot system ๐ป and get around browser fingerprinting scripts ๐ต๏ธโโ๏ธ when scraping the web?
jaypyles
Self-hosted webscraper.
REMitchell
Code samples from the book Web Scraping with Python http://shop.oreilly.com/product/0636920034391.do
dotnetcore
DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework