Search Results

Found 76 repositories(showing 30)

crawler-user-agents

monperrus

🧡69

Syntactic patterns of HTTP user-agents used by bots / robots / crawlers / scrapers / spiders. pull-request welcome :star:

1.4k

279

MIT

Updated 2 days ago

slither

kaelscion

🧡50

A simple, easy to use framework for adding randomized, anonymous IP addresses and user-agents to web scrapers, crawlers, and penetration testing solutions

MIT

Python

Updated 2 months ago

user-agent_strings_db

binkybear

❤️35

Python scraper that grabs user agent strings and populates them into sqlite/JSON files

Python

Updated 3 years ago

User-Agent_Scraper

Floorp-Projects

🧡50

This tool automatically retrieves the user agent from each browser.

MPL-2.0

JavaScript

Updated 2 months ago

YP_scraper

ZaxR

❤️20

Yellow Pages scraper webapp made with Flask, requests, and beautifulsoup. Outputs results to users as csvs. Uses random proxy and user-agent for each request.

Python

Updated 4 years ago

goodrxscraper

snackattas

❤️35

Selenium scraper of goodrx.com (for IE, Chrome, and mobile Safari) with and useragentstring.com user-agents

Python

Updated 5 months ago

AGENTIC AI BOILERPLATE — starter kit buat bikin AI agent pakai Agno AGI + Google Gemini + Qdrant (RAG), dengan web scraper “bersih”, Google Search, dan user memory; siap buat cari, baca, dan jawab dari sumber tepercaya ⚡🤖

Python

Updated 7 months ago

crawle

toveons

❤️25

Syntactic patterns of HTTP user-agents used by bots / robots / crawlers / scrapers / spiders. pull-request welcome :star:

MIT

Updated 3 months ago

ScrapedIn-Agentic-LinkedIn-Scraper-

sohom2004

❤️35

Built an agentic linkedIn lead scraper that performs query based google search, fetches profiles and scrapes necessary information like user name, position, about, email and the profile url

Python

Updated 6 months ago

linkedin-lead-scraper-python

abq2904

❤️35

🤖 AI-Powered LinkedIn Scraper | Extract Emails & Profiles Ethically with Python + Selenium ✅ Bypasses LinkedIn restrictions with human-like behavior ✅ Enriches data using OpenAI API (job titles, company info) ✅ Exports to CSV/Google Sheets for CRM integration ✅ Anti-detection techniques (random delays, user-agent rotation)

Python

Updated 3 months ago

webscrapper

Pushparaj13811

❤️35

Python-based web scraper using Selenium with user agent rotation, error handling, and structured JSON storage. Supports batch URL processing from CSV files.

Python

Updated 11 months ago

last-defense-system

maon-git

❤️40

The list of User-Agent and IPv4 addresses to block unwanted crawlers, bad robots, suspicious spiders, junk web-scrapers, malicious spammers, and unauthorized access including DDoS attack.

MIT

PHP

Updated 2 years ago

API-Header-Spoofer

DevArqf

❤️40

Practical examples of how header spoofing works. Shows you the tricks scrapers use like changing user agents and mimicking human behavior. Code works in browser console or Python. Built this while learning about bot detection.

MIT

Python

Updated 3 months ago

api-testingautomationdev-tools+8

BikeScraper

hunterness-dev

❤️40

An asynchronous web scraper built with Python that collects and analyzes bike price data from Amazon, featuring proxy rotation, user-agent spoofing, and robust error handling. Built with aiohttp for high-performance concurrent requests and pandas for data processing, with comprehensive logging and anti-detection measures.

MIT

Python

Updated 6 months ago

medicine-data-scraper-with-proxy-rotation

sujoy-kr

❤️35

This project is a web scraper using Puppeteer, Cheerio, and Axios. It scrapes product details (generic name, strength, brand, price) from medex.com.bd and updates them to an API. It includes proxy rotation, stealth browsing, randomized user agents, human-like scrolling, and auto-restart on Cloudflare blocks.

JavaScript

Updated 11 months ago

User-Agent_Scraper

goastian

❤️40

This tool automatically retrieves the user agent from each browser.

MPL-2.0

JavaScript

Updated 2 years ago

user-agent-scraper

i-redspector

🧡50

A Python tool to generate and save random user-agent strings for web scraping and automation purposes. Useful for projects like YouTube automation, scraping websites, or bypassing anti-bot measures. The user-agent list is saved in a text file for later use.

MIT

Batchfile

Updated 1 month ago

user_agent_scraper

gabrielmajosi

❤️40

easy python module to get a list of user agents

GPL-3.0

Python

Updated 4 years ago

simple-scraper

userforce

❤️40

Web scraper with dynamic proxy and user agent.

MIT

Python

Updated 4 years ago

real-estate-platform

open-estate-ai

❤️35

User-facing platform integrating scrapers, data lake, MCP server, and Agentic AI into a unified advisory system.

Updated 6 months ago

scala-scraper

amal-qokka

❤️40

fork of https://github.com/ruippeixotog/scala-scraper to set user agent and referrer in connection header

MIT

Scala

Updated 2 years ago

hpricotscape

crowdmob

❤️35

A browser for master web scrapers. It saves cookies, has a decent user agent, sends referer urls, and let's you (the scraper) easily submit forms on the page. It *doesn't* run the page's javascript yet, however.

Ruby

Updated 12 years ago

AI_WebScraper

andredisa

❤️40

🕵️‍♂️ Welcome to AI Web Scraper Agent, a powerful and user-friendly app to scrape websites using natural language via OpenAI or Llama 3.2 (Ollama).

MIT

Python

Updated 5 months ago

llamaopenai-apipython+2

PythonWebscraper

Blzs2005

❤️40

A simple Python web scraper to crawl a specific domain (like documentation sites), extract text content, and save it to a file. Configurable crawl depth, delay, and user agent.

MIT

Python

Updated 8 months ago

mubawab-real-estate-scraper

TheCodeNoodle

❤️40

A Python web scraper for extracting real estate listings from Mubawab.ma using BeautifulSoup. Supports multiple cities and languages (French/English) with user-agent rotation and flexible CSS selectors to handle dynamic page structures.

MIT

Python

Updated 7 months ago

AI-JobRecommender-MCP

shivzz-creator

❤️35

A smart, agent-powered job recommendation system built using Model Context Protocol (MCP), Apify job scrapers, and OpenAI. This project fetches real-time job listings from LinkedIn and Naukri, processes user queries, and returns personalized job recommendations with LLM-powered reasoning. ✨ Ideal for learning MCP tooling, agentic workflows, API .

Python

Updated 3 months ago

ai-realestateagent

bala-ceg

❤️35

AI Real Estate Agent is an Apify Actor that searches for real estate listings on Zillow based on user queries. It extracts ZIP codes using an LLM (GPT-3.5-Turbo) and fetches property listings using the Zillow Scraper.

Python

Updated 1 year ago

ShadowScrape

HenryElliott

❤️35

ShadowScrape is an advanced asynchronous Python web scraper with a sleek GUI. It features rate limiting, ban detection, user-agent rotation, optional image scraping, and export to JSON or CSV. Built with Playwright and CustomTkinter for cross-platform ease and performance.

Python

Updated 9 months ago

parser_JS_website

GligoryKun

❤️35

Dynamic JS Website Parser with Selenium. Selenium-based scraper for dynamic JavaScript-driven websites. Handles user-agent rotation and other anti-bot measures to reliably extract required product attributes. Scraped data is processed and saved in Excel, JSON, and CSV formats for easy use.

Python

Updated 3 months ago

Web-Scraper-Pro

matbanik

❤️40

A robust, multi-process web scraper with a Tkinter GUI. It uses Scrapy and Trafilatura for intelligent data extraction from a URL queue. Features configurable PDF/ZIP packaging, request delays, user-agent rotation, exclusion lists, and granular logging control. The application is packaged for Windows using PyInstaller.

MIT

Python

Updated 3 months ago

GitHub Explorer

Search Results

crawler-user-agents

slither

user-agent_strings_db

User-Agent_Scraper

YP_scraper

goodrxscraper

Agentic-AI

crawle

ScrapedIn-Agentic-LinkedIn-Scraper-

linkedin-lead-scraper-python

webscrapper

last-defense-system

API-Header-Spoofer

BikeScraper

medicine-data-scraper-with-proxy-rotation

User-Agent_Scraper

user-agent-scraper

user_agent_scraper

simple-scraper

real-estate-platform

scala-scraper

hpricotscape

AI_WebScraper

PythonWebscraper

mubawab-real-estate-scraper

AI-JobRecommender-MCP

ai-realestateagent

ShadowScrape

parser_JS_website

Web-Scraper-Pro

crawler-user-agents

slither

user-agent_strings_db

User-Agent_Scraper

YP_scraper

goodrxscraper

Agentic-AI

crawle

ScrapedIn-Agentic-LinkedIn-Scraper-

linkedin-lead-scraper-python

webscrapper

last-defense-system

API-Header-Spoofer

BikeScraper

medicine-data-scraper-with-proxy-rotation

User-Agent_Scraper

user-agent-scraper

user_agent_scraper

simple-scraper

real-estate-platform

scala-scraper

hpricotscape

AI_WebScraper

PythonWebscraper

mubawab-real-estate-scraper

AI-JobRecommender-MCP

ai-realestateagent

ShadowScrape

parser_JS_website

Web-Scraper-Pro