Found 265,452 repositories(showing 30)
asgeirtj
Extracted system prompts from ChatGPT (GPT-5.4, GPT-5.3, Codex), Claude (Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 Flash, CLI), Grok (4.2, 4), Perplexity, and more. Updated regularly.
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
apify
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Perfare
AssetStudio is a tool for exploring, extracting and exporting assets and assetbundles.
coderamp-labs
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
interagent
HTTP API design guide extracted from work on the Heroku Platform API
moonD4rk
Extract and decrypt browser data, supporting multiple data types, runnable on various operating systems (macOS, Windows, Linux).
addyosmani
Extract & Inline Critical-path CSS in HTML pages
jsvine
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
apify
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
YaoFANGUK
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
kreuzberg-dev
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
peazip
Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.
briangonzalez
🦎 A jQuery plugin for extracting the dominant color from images and applying the color to their parent.
cloudquery
Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.
DerekYRC
mini-spring is a simplified version of the Spring framework that helps you quickly familiarize yourself with Spring source code and master Spring's core principles. It extracts Spring's core logic with extremely simplified code while preserving Spring's core functionality.
nickscamara
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
postlight
📜 Extract meaningful content from the chaos of a web page
vi3k6i5
Extract Keywords from sentence or Replace keywords in sentences.
allure-framework
Allure Report is a flexible, lightweight multi-language test reporting tool. It provides clear graphical reports and allows everyone involved in the development process to extract the maximum of information from the everyday testing process
grobidOrg
A machine learning software for extracting information from scholarly documents
bjesus
Swiss-army tool for scraping and extracting data from online assets, made for hackers
jariz
Extract prominent colors from an image. JS port of Android's Palette.
activemerchant
Active Merchant is a simple payment abstraction library extracted from Shopify. The aim of the project is to feel natural to Ruby users and to abstract as many parts as possible away from the user to offer a consistent interface across all supported gateways.
deanmalmgren
extract text from any document. no muss. no fuss.
blueimp
Load images provided as File or Blob objects or via URL. Retrieve an optionally scaled, cropped or rotated HTML img or canvas element. Use methods to parse image metadata to extract IPTC and Exif tags as well as embedded thumbnail images, to overwrite the Exif Orientation value and to restore the complete image header after resizing.
PiotrMachowski
This tool retrieves tokens for all devices connected to Xiaomi cloud and encryption keys for BLE devices.
torakiki
PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages
jlegewie
Zotero plugin to manage your attachments: automatically rename, move, and attach PDFs (or other files) to Zotero items, sync PDFs from your Zotero library to your (mobile) PDF reader (e.g. an iPad, Android tablet, etc.), and extract PDF annotations.
Bioruebe
Universal Extractor 2 is a tool to extract files from any type of archive or installer.