Found 17,806 repositories(showing 30)
jaypyles
Self-hosted webscraper.
any4ai
AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.
anaskhan96
Web Scraper in Go, similar to BeautifulSoup
scrapfly
Scalable Python web scraping scripts for +40 popular domains
benibela
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
rootVIII
a class that uses scraped proxies to make http GET/POST requests (Python requests)
salimk
An R web crawler and scraper
onepointAI
An AI assistant tool that integrates coding, writing, and reading functions. For better alternatives see https://monica.im/desktop
intergalacticalvariable
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
toby-p
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
TBosak
RSS feed builder created with Bun🥖 and Hono🔥- builds from webpages, email folders, and REST API calls.
zzzgydi
Scrape the webpage convert it into Markdown, and enhance AI search applications.
AliAkhtari78
Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song
serpapi
Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.
MichaelYochpaz
A Python command-line tool for scraping and downloading subtitles from AppleTV and iTunes movie pages.
3xploitGuy
A web scraper to scrape email's and phone numbers from Websites.
s-r-e-e-r-a-j
WebExtractor is a powerful OSINT and ethical hacking tool developed in Python. It is used to extract email addresses, phone numbers, and links from a target website
mehmetozkaya
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
kubowania
Demo for showing how to get backend data to the frontend
bitsummation
SQL Based DSL Web Scraper/Screen Scraper
dwallach1
Financial Web Scraper & Sentiment Classifier
chuanenlin
Web scrapper for Shutterstock
nmcassa
A letterboxd webscraper
CuriousLearner
Scrapes g4g and creates PDF
JesseVent
Cryptocurrency Historical Market Data R Package
EchterAlsFake
A lightweight API for Pornhub
hedii
A php crawler that finds emails on the internets
No description available
JonathanVusich
This is an unofficial API for the website pcpartpicker.com.
brandonrobertz
An automated, programming-free web scraper for interactive sites