PDF to CSV

Extract tabular data from PDFs into CSV. Browser-only — nothing uploaded.

Drag & drop your PDF here

or browse files

Single file · PDF supported

How PDF to CSV extraction works

PDF.js reads the text content stream embedded in each page — the same layer used by search engines and screen readers. Every text item comes with an x/y coordinate, width, and height. The tool groups items with a similar y-position (within ±5 px) into rows, then analyzes x-position gaps wider than 2.5× the median character width to identify column boundaries.

This works well on PDFs exported from Excel, accounting software, bank portals, GST return portals, and financial reporting tools — anywhere the source document was digital. Scanned PDFs (paper fed through a scanner) store page images with no text layer; OCR support for those is on the v2 roadmap.

Use cases

  • Bank statement reconciliation — extract transaction rows to CSV for Tally or Excel import
  • GST return preparation — copy invoice line items directly into your accounting system
  • Financial report analysis — get numbers out of PDF reports into a spreadsheet
  • Vendor invoice processing — extract item, quantity, and amount columns automatically

All processing runs locally in your browser. Your PDF never touches a server.