Convert PDF to CSV
Extract tables from
any PDF.
Free PDF to CSV converter. Extract text tables from any PDF into an editable spreadsheet — no upload, no signup, no server. Powered by PDF.js, the same technology used by Firefox. Runs entirely in your browser.
Drop your PDF file. CSVShift uses PDF.js — the same open-source library used by Mozilla Firefox — to read the file entirely in your browser. The PDF is parsed locally and the page count is detected automatically.
PDF.js extracts every text element with its exact X and Y coordinates. CSVShift groups elements that share the same vertical position into rows, and sorts each row left to right to reconstruct columns — reverse-engineering the table structure.
The reconstructed rows are serialized as CSV and downloaded. Open in Excel, Google Sheets or any database tool. For multi-page PDFs, each page's content is included sequentially.
A text-based PDF is created digitally — exported from Excel, Word or a reporting tool. The text is stored as actual characters with precise positions. You can click and drag to select text in Chrome. These PDFs convert reliably.
A scanned PDF is a photograph of a document. The "text" is pixels — there is no underlying character data. These require OCR software. How to check: open the PDF in Chrome and try to select a word. If text highlights, it's text-based and will work here. If nothing selects, it's a scan.
Banks often export transaction history as PDF. Converting to CSV lets you import transactions into accounting software, Excel or budgeting tools for analysis and reconciliation.
Business intelligence tools sometimes only offer PDF exports. Extracting data tables to CSV lets you continue processing the data programmatically.
Many public datasets — court records, regulatory filings, official statistics — are published only as PDFs. Converting to CSV is the first step in any data analysis workflow.
Supplier invoices sent as PDFs need to be imported into ERP or accounting systems. Extracting line-item tables to CSV saves manual re-entry.
| Option | Values | When to use it |
|---|---|---|
| Pages to extract | All · Range · List | Leave blank for all pages. Enter 1-3 for a range or 1,4,7 for specific pages. |
| Row tolerance | 3 · 5 · 8 · 12 px | Lower values keep tight rows separate. Higher values merge items at slightly different heights. |
| Output delimiter | Comma · Tab · Semicolon | Comma for standard CSV. Tab for databases. Semicolon for European locale systems. |
Your PDF stays on
your device. Always.
PDFs often contain sensitive data. CSVShift uses PDF.js entirely in your browser. Your file is parsed locally and never transmitted to any server. PDF.js is the same open-source library that powers Mozilla Firefox's built-in PDF viewer — security-audited and maintained by Mozilla.
