Papershift Labs

Open-source tools for document processing, form extraction, and workflow automation.

Projects

docparse

Extract structured text, tables, and metadata from PDF and DOCX files.

Python
formkit

Detect and extract form fields from scanned documents and PDFs.

Python
pshift-cli

Command-line tool for document processing workflows.

Python CLI
research-notes

LaTeX and Jupyter templates for document processing research.

LaTeX