Found 21 repositories(showing 21)
Multithreading web scraping with real-time-changing proxy-servers
Kushal334
No description available
Jonathancui123
A project to learn about multithreading in python and web scraping with Selenium + BeautifulSoup.
avirajs
a multithreaded python script that scrapes customized tax information on millions of properties accounts in Dallas and consolidates them into a formatted Excel file based on different queries.
ScrapingRobert
A no-nonsense, multithreaded Python tool to scrape Reddit post JSON directly. Built for low-latency signal extraction and high-concurrency discovery. Perfect for feeding NLP pipelines or intent-mapping as seen in the mention.broker case studies. Pure JSON, zero DOM overhead, and fast as hell. Export structured data to CSV or JSON.
kielquitain
A web scraping script that utilizes multithreading features in Python for faster scraping.
sam-ezz
A fast, concurrent web scraper built in Python. Uses multithreading to run scraping tasks in parallel, significantly speeding up data extraction from multiple URLs. A simple tool for efficient web scraping
HetalBagal
A practice-based implementation of Multithreading and Multiprocessing concepts in Python including ThreadPoolExecutor, ProcessPoolExecutor, and real-world examples like web scraping and factorial computation.
DiabSaeed
This is a Python project that scrapes detailed product data from the Sephora website. It demonstrates advanced web scraping techniques using Selenium, image processing, multithreading, and structured data storage in SQLite and Excel.
arvindh-manian
Python/Tkinter/BeautifulSoup/JSON/Multithreading. Scrapes the top games from each year at https://metacritic.com and displays them in a Tkinter GUI
SarkarPriyanshu
The Web Scraping and Data Storage Application is a Python-based project that leverages multiple technologies and design principles to scrape data from websites, store it in a MySQL database, and provide efficient data retrieval using multithreading.
SHAIMOOM251283
"SCRAPE THE BOOKS" is a Python project that scrapes book data from Books to Scrape. It extracts book details such as title, price, and rating, handles multi-page scraping with multithreading, and provides interactive visualizations using Plotly to analyze the data. Results are saved in CSV or JSON formats.
OleksiyM
A Python project that scrapes quotes and author data from https://quotes.toscrape.com using requests and BeautifulSoup4, storing the data in quotes.json and authors.json. It uses multithreading for faster scraping and ensures data integrity with error handling
farazkhancodes
This project includes Web scraping of posts in parallel using multithreading off of Reddit using python library Beautiful Soup, processing and doing Sentiment Analysis on it using NLP and further analyzing the data using SQL.
LeaDer-E
A Python script to scrape gold prices in Egypt from Arraf.app for a specified date range, featuring multithreading for high-speed data collection and automatic Excel export.
AnasAlmakhamreh
A Python GUI app that scrapes and displays announcements from Medipol University’s website. It uses multithreading and BeautifulSoup to fetch and filter content, showing headlines, dates, and details in a clean Tkinter interface.
Manojkuamrbg0210
The source code is written in python scraps the job details data from the cermati website(sample) and stores the details in the solution file JSON format, the multithreading concept is used to scrape job posting data concurrently.
andviktor
This project provides a universal system in Python for building multithreaded web scrapers. It offers a set of classes and utilities to simplify the process of web scraping, allowing you to efficiently fetch data from multiple websites concurrently.
shreyjain1099
In this project we scraped data for multiple customers using multithreading in python. The values of username, password, startdate and enddate were directly fetched from a csv file and passed into a function which was called using the thread. All the scapped data for every particular date was saved in different folers on the basis of client names.
SHIVAM200669
A fast, multithreaded proxy checker script built in Python. This tool validates large lists of proxies (HTTP, HTTPS, SOCKS4, SOCKS5), checking their status, latency, anonymity, and location. It takes a raw proxy file as input and outputs a clean list of only the working proxies, making it perfect for web scraping and security tasks
Did Web Scraping Using Beautiful Soup library for scrapping seven publications (Towards Data Science, UX design, The Startup, The Writing Cooperative, Data Driven Investor, Better Human, Better Marketing) from an American Online Publishing platform i.e. Medium, Optimized the code using Multithreading and Multiprocessing for fast scrapping, Cleaned the data using Python Pandas, Transformed the data in Power Query Editor, Displayed meaning full insights in Power BI dashboard like Monthly analysis of number of Articles published and the amount of appreciation, responses and reading time articles have for each publication, Analyzing the best article published for each publication, Comparing parameters for knowing the trends for each publication
All 21 repositories loaded