The smart PDF-to-Markdown router. One command, zero config.
$ pip install pdfmux
click to copy
# convert a pdf to markdown $ pdfmux invoice.pdf ✓ invoice.pdf → invoice.md (2 pages, 100% confidence) # json output with metadata $ pdfmux report.pdf -f json # batch convert $ pdfmux ./docs/ -o ./output/ # start mcp server for ai agents $ pdfmux serve
We don't convert PDFs. We route them to whichever tool converts them best.
| PDF Type | Extractor | Speed | Cost |
|---|---|---|---|
| Digital | PyMuPDF | 0.01s/pg | Free |
| Tables | Docling | 0.3-3s/pg | Free |
| Scanned | Surya OCR | 1-5s/pg | Free |
| Complex | Gemini Flash | 2-5s/pg | ~$0.01 |
Give your AI agent the ability to read PDFs:
$ pip install pdfmux[tables] # Docling $ pip install pdfmux[ocr] # Surya OCR $ pip install pdfmux[llm] # Gemini Flash $ pip install pdfmux[all] # everything