PDF to Markdown
Convert a text-based PDF to clean Markdown — headings, lists, and paragraphs detected automatically. Everything runs in your browser. Your file never leaves your device.
Drag & drop your PDF here
or browse files
Single file · PDF supported
Images and complex layouts are simplified — review the output. Scanned (image-based) PDFs require OCR and are not supported in v1. OCR support is planned for v2.
What this tool does
PDF is a presentation format, not a document structure format. Converting it to Markdown means recovering the structure — headings, lists, paragraphs — from positional clues in the text layer. This tool uses PDF.js to extract per-character position and font size data, then applies heuristics: text significantly larger than the median body font becomes a heading, lines starting with bullet characters become list items, and unusual vertical gaps become paragraph breaks.
The output is GitHub-flavored Markdown compatible — paste it into any .md file, Notion, Obsidian, or a RAG pipeline. It works well for research papers, reports, and structured documents. Heavily designed marketing PDFs with multi-column layouts will produce messier output that needs review.
Why no server?
Your PDF never leaves your device. The entire conversion runs client-side using PDF.js in your browser. This matters for confidential documents — salary slips, medical records, NDA drafts, research data. Competitors that process PDFs server-side retain a copy, even briefly. We never see it.