Found 252 repositories(showing 30)
grobidOrg
A machine learning software for extracting information from scholarly documents
titipata
Python PDF parser for scientific publications: content and figures
grobidOrg
Python client for GROBID Web services
eLifePathways
A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools together to generate a full XML document.
ScienciaLAB
Streamlit PDF viewer
shauryr
This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs
YS0meone
Multi-agent AI research system — finds academic papers via semantic search & citation snowballing, then answers questions over them using agentic RAG with self-reflection. Built with LangGraph, FastAPI, Celery, and Qdrant.
lfoppiano
GROBID extension for identifying and normalizing physical quantities.
ScienciaLAB
Viewer for the structure extracted by Grobid on PDF documents
grobidOrg
A Named-Entity Recogniser based on Grobid.
papercast-dev
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
MedKhem
No description available
ourresearch
PDF parser powered by grobid
lfoppiano
Grobid module for superconductor material and properties extraction
grobidOrg
Simple node.js client for GROBID REST services
kermitt2
Some examples of usage of Grobid in a third party java project.
alisonmitchell
Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.
lfoppiano
Material parsers and other tools, scripts Initially developed for Grobid Superconductor
nate-russell
Make MP3 albums out of Academic PDFs. Works by gluing together Grobid and TTS offerings.
com3dian
The grobidmonkey package is an open-source package designed for postprocessing GROBID outputs.
Kabongosalomon
This program produces the test data for classification over a set of predefined task#dataset#metrics#software labels. Given input a pdf file, it scrapes the text from the file using the Grobid parser, subsequently generating the test data file for input to the neural network classifier.
kermitt2
A machine learning software for extracting astronomical entities from scholarly documents
tantikristanti
A GROBID module for extracting and structuring medical reports into structured XML/TEI encoded documents
jacksongoode
A tool for the bibliographic analysis of the NIME proceedings archive
grobidOrg
Simple Java client for GROBID REST services
ram02z
Python library for serializing GROBID TEI XML to dataclass
tantikristanti
NERD and wiKIData (NERD KID) is a machine learning application for classifying Wikidata items into 27 classes (as defined by the Grobid-NER project).
miku
A Go (golang) client for GROBID.
howisonlab
No description available
Vi-dot
GROBID GUI to manage training data tasks