PDF to CSV
Extract tabular data from PDFs into CSV. Browser-only — nothing uploaded.
Drag & drop your PDF here
or browse files
Single file · PDF supported
How PDF to CSV extraction works
PDF.js reads the text content stream embedded in each page — the same layer used by search engines and screen readers. Every text item comes with an x/y coordinate, width, and height. The tool groups items with a similar y-position (within ±5 px) into rows, then analyzes x-position gaps wider than 2.5× the median character width to identify column boundaries.
This works well on PDFs exported from Excel, accounting software, bank portals, GST return portals, and financial reporting tools — anywhere the source document was digital. Scanned PDFs (paper fed through a scanner) store page images with no text layer; OCR support for those is on the v2 roadmap.
Use cases
- Bank statement reconciliation — extract transaction rows to CSV for Tally or Excel import
- GST return preparation — copy invoice line items directly into your accounting system
- Financial report analysis — get numbers out of PDF reports into a spreadsheet
- Vendor invoice processing — extract item, quantity, and amount columns automatically
All processing runs locally in your browser. Your PDF never touches a server.