Open-source tools for document processing, form extraction, and workflow automation.
Extract structured text, tables, and metadata from PDF and DOCX files.
Detect and extract form fields from scanned documents and PDFs.
Command-line tool for document processing workflows.
LaTeX and Jupyter templates for document processing research.