From PDFs to Structured XML: How GROBID Simplifies the Process
GROBID is a powerful machine-learning library designed to transform raw documents, like PDFs, into structured XML/TEI documents, with a focus on technical and scientific publications. It started originally a hobby project in 2008, GROBID was open-sourced in 2011 and has steadily evolved as a side project ever since. GROBID