Found 466 repositories(showing 30)
Zipstack
LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows
Unstructured-IO
No description available
Unstructured-IO
A Python client for the Unstructured Platform API
DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access
medelman17
Blackstone is a spaCy model and library for processing long-form, unstructured legal text. Here, we wrap Blackstone with a performant API layer written in Go.
Unstructured-IO
A JavaScript/Typescript client for the Unstructured Platform API
aws-solutions-library-samples
This Guidance shows how Amazon Bedrock Data Automation streamlines the generation of valuable insights from unstructured multimodal content such as documents, images, audio, and videos through a unified multi-modal inference API.
l1m-io
The easiest way to get structured data from unstructured text or images using LLMs. No prompt engineering, no chat history, just a simple API to extract structured JSON from text or images.
gurtejrehal
Falcon Search has been created to aid the National Crime Records Bureau keeping in mind the need for an efficient AI data crawler that collects classified data from the web based on given keywords. It is a SaaS web data integration (WDI) platform which converts unstructured web data into structured format by extracting, preparing and integrating web data in areas of crime for consumption in criminal investigation agencies. Falcon provides a visual environment for automating the workflow of extracting and transforming web data. After specifying the target website url, the web data extraction module provides a visual environment for designing automated workflows for harvesting data, going beyond HTML/XML parsing of static content to automate end user interactions yielding data that would otherwise not be immediately visible. Once extracted, the software provides full data preparation capabilities that are used for harmonizing and cleansing the web data. For consuming the results, Falcon provides several options. It has its own visualization and dashboarding module to help criminal investigators gain the insights that they need. It also provides APIs that offer full access to everything that can be done on our platform, allowing web data to be integrated directly. FALCON is capable of crawling ten million links and scrape one million links per month using Celery Worker. It moreover has the potential of outperforming this number if tested under standard cloud platforms.
Alignment-Lab-AI
a pipeline for using api calls to agnostically convert unstructured data into structured training data
Unstructured-IO
No description available
covalenthq
Decode unstructured, raw event logs into structured data with a simple API.
We will process unstructured data from web (obtained by crawling some sample websites) by maybe: having a Apache SolR installation locally and manually feeding it web pages. We can use Stanford NLP API or Metamind API to extract semantics from the unstructured text. After we extract some semantics, we can construct a structured data format, probably RDF/XML/OWL and also have a visual representation of the graph of the data using Gruff
j-fu
VTK based C++ graphics library for plotting and for data on rectilinear and unstructured grids with an flexible and easy to use API.
bethune
Unstructured.io API GUI
lxchtan
Code for DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access.
pyoceans
API for interpolation on regular grid, curvilinear orthoganal grid, unstructured grid
box-community
The OpenAI Responses API will soon allow developers to inject additional data into the response. This includes web search, computer use, and other types of data. This repository shows how to use this new capability and a Box Agent to add proprietary unstructured data from your Box instance into your agentic work.
redhat-na-ssa
prepare patient data for LLM ingestion using unstructured ai apis
Traves-Theberge
A CLI tool for extracting unstructured data from websites using customizable schemas and Google's Gemini API and outputing them into structured schemas.
CodeOne45
anyToJSON is a powerful SaaS tool designed to automatically convert any unstructured or structured data into clean, standardized JSON format. Whether your data comes from text files, spreadsheets, or other sources, anyToJSON simplifies the process of transforming it into a format that can be easily used in modern applications and APIs.
thevpc
Unstructured Persistence API (Java-C# ORM and ODM)
ASRagab
Unstructured4s: An API wrapper for Unstructured.io
msshahata
This trail APP make you create Unstructured Supplementary Service Data (USSD) service through whatsapp using Twillo API
This is a RAG Microservice Backend built using Redis (to cache conversations), Postgres/pgvector (as the vector store), the Unstructured library (to aid in table/image extraction in PDFs), LangChain, OpenAI API, and FastAPI.
ralstonraphael
Stone Scraper is an AI-powered tool for automated web data extraction. Built with Streamlit, Langchain, and Selenium to transform unstructured web content into clean, actionable insights. CURRENTLY WORKING ON OPEN AI API CLOUD DEPLOYMENT. Try it out ⬇️⬇️
naimkatiman
This project uses the CrewAI framework to automate stock analysis, enabling AI agents to collaborate and execute complex tasks efficiently. Example stock: Nvidia. Technologies include Python, CrewAI, Unstructured, PyOWM, Tools, Wikipedia, yFinance, SEC-API, tiktoken, faiss-cpu, python-dotenv, langchain-community, langchain-core, and OpenAI.
malvag
ClaimLinker is a Web service and API that links arbitrary text to fact-checked claims, offering a novel kind of semantic annotation of unstructured content. The system is based on a scalable, fully unsupervised and modular approach that does not require training or tuning and which can serve high quality results at real time.
brown2020
No description available
kkacsh321
Unofficial Unstructured-API Helm Chart for Kubernetes