Found 9,925 repositories(showing 30)
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Unstructured-IO
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Zipstack
LLM-Driven Extraction of Unstructured Data โ Built for API Deployments & ETL Pipeline Workflows
neo4j-labs
Neo4j graph construction from unstructured data using LLMs
varunshenoy
Extrapolating knowledge graphs from unstructured text using GPT-3 ๐ต๏ธโโ๏ธ
microsoft
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
milvus-io
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
syslog-ng
syslog-ng is an enhanced log daemon, supporting a wide range of input and output methods: syslog, unstructured text, queueing, SQL & NoSQL.
NanoNets
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
yobix-ai
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
dingodb
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
datamade
:us: a python library for parsing unstructured United States address strings into address components
D-Star-AI
High-performance retrieval engine for unstructured data
Renumics
Interactively explore unstructured datasets from your dataframe.
Oxen-AI
Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets as easy as versioning code.
brettkromkamp
Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
fstab
Export Prometheus metrics from arbitrary unstructured log data.
Unstructured-IO
No description available
rust-fuzz
Generating structured data from arbitrary, unstructured input.
jerlendds
Node graphs, OSINT data mining, and plugins. Connect unstructured and public data for transformative insights. The rewrite can be found @ osintbuddy/osintbuddy
Renumics
Curated list of open source tooling for data-centric AI on unstructured data.
ICLRandD
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
gretelai
Synthetic data generators for structured and unstructured text, featuring differentially private learning.
whyhow-ai
Knowledge Table is an open-source package designed to simplify extracting and exploring structured data from unstructured documents.
datamade
:family: a python library for parsing unstructured western names into name components.
aryn-ai
๐ Sycamore is an LLM-powered search and analytics platform for unstructured data.
towhee-io
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
SPECFEM
SPECFEM3D Cartesian simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, or poroelastic seismic wave propagation in any type of conforming mesh of hexahedra (structured & unstructured).
WeBankFinTech
Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources
aniskoubaa
ROSGPT is a pioneering approach that combines the power of ChatGPT and ROS (Robot Operating System) to redefine human-robot interaction. By leveraging large language models like ChatGPT, ROSGPT enables the conversion of unstructured human language into actionable robotic commands. T