Format GuidesMarch 16, 2026
Meidy Baffou·LazyPDF

Convert Scanned Receipts to Excel Spreadsheets

Boxes of paper receipts are the bane of accounting season. Whether you're a small business owner reconciling expenses, a freelancer claiming business deductions, or an accountant processing client records, converting scanned receipts to Excel dramatically speeds up the process. The challenge is that scanned receipts are images — the data is trapped in pixels and can't be extracted without OCR (Optical Character Recognition). This guide walks you through the complete workflow: from scanned receipt PDF to structured Excel data, including how to make the OCR step work reliably even on faded or hand-written receipts.

Why Scanned Receipts Need OCR First

A scanned receipt is fundamentally a photograph of paper. The amounts, dates, and vendor names you see on screen are not machine-readable text — they're pixels arranged to look like text. When you try to select text on a scanned PDF, nothing highlights, because there's no text data to select. Before a PDF-to-Excel converter can extract any data, the document must contain a machine-readable text layer. OCR (Optical Character Recognition) creates this layer by analyzing the image and converting what it 'sees' into actual text characters. The quality of OCR output depends on: - **Scan quality**: Sharp, high-contrast scans (300 DPI+) produce much better OCR results than blurry or dark scans - **Font style**: Printed text OCRs very accurately. Handwritten amounts are less reliable - **Image condition**: Faded ink, coffee stains, crumpled paper — all reduce accuracy - **Language and special characters**: Dollar signs, currency symbols, and decimal points need accurate recognition for financial data

Step 1 — Scan and Prepare Your Receipts

The quality of your scan determines the quality of your OCR output.

  1. 1Use a flatbed scanner at 300 DPI minimum (600 DPI for heavily faded receipts)
  2. 2Ensure the receipt lies completely flat — curved edges produce distorted text
  3. 3Use good lighting if using a phone camera: avoid glare, shadows, and reflections
  4. 4Scan in grayscale or black-and-white for text receipts (smaller files, often better OCR)
  5. 5For thermally printed receipts (grocery, gas station): scan soon after the transaction — thermal paper fades significantly within months
  6. 6Export scanned receipts as PDF (most scanners offer this directly)

Step 2 — Run OCR to Make Text Machine-Readable

With your scanned receipt PDF ready, use LazyPDF's OCR tool to create a searchable text layer.

  1. 1Go to LazyPDF OCR tool at lazy-pdf.com/en/ocr
  2. 2Upload your scanned receipt PDF
  3. 3Select the language of the receipt text (English for US/UK receipts)
  4. 4Wait for OCR processing to complete
  5. 5Download the OCR-processed PDF
  6. 6Open the PDF and try selecting text — if OCR succeeded, you should be able to highlight and copy amounts and vendor names
  7. 7If text selection is still impossible, the scan quality may be too low — re-scan at higher resolution

Step 3 — Convert OCR-Processed PDF to Excel

With a machine-readable text layer in place, the PDF to Excel conversion can extract the tabular data.

  1. 1Go to LazyPDF PDF to Excel tool at lazy-pdf.com/en/pdf-to-excel
  2. 2Upload the OCR-processed receipt PDF
  3. 3Wait for conversion to complete
  4. 4Download the Excel file
  5. 5Open in Excel and review the extracted data
  6. 6Verify that amounts, dates, and vendor names were captured correctly
  7. 7Manually correct any OCR errors (misread digits are the most common issue)

Cleaning Up Receipt Data in Excel

Even with good OCR, receipt data often needs cleanup before it's usable for accounting: **Common OCR errors on receipts:** - `0` vs `O` (zero vs letter O) — particularly common in totals - `1` vs `l` (one vs lowercase L) — especially in amounts like `$11.00` vs `$l1.00` - `$` sign attached to numbers: `$47.50` may OCR as one cell rather than two columns - Decimal points: `$47.50` may become `$4750` if the decimal is misread **Cleanup workflow:** 1. Add a verification column next to the OCR amounts — manually enter the correct values for any suspicious amounts 2. Use Excel's data validation to flag values outside expected ranges (a meal receipt shouldn't be $5,000) 3. Add columns for: Date, Vendor, Category, Amount, Tax, Total — fill from the OCR output 4. Cross-reference totals against credit card or bank statements **For recurring receipt processing:** Create an Excel template with predefined columns matching the data you extract from receipts. Each month's receipts go into new rows in the template. This makes monthly expense tracking much more systematic.

Special Case: Handwritten Receipts

Handwritten receipts — common from small vendors, markets, and informal services — are significantly harder for OCR to process accurately. Modern OCR (including Tesseract, which powers LazyPDF's OCR) handles printed text very well but struggles with handwriting, particularly: - Cursive or stylized handwriting - Numbers written in non-standard ways (crossed 7s, European 1s) - Ambiguous amounts where digits look similar For handwritten receipts, the practical approach is: 1. Run OCR as a first pass — it may capture some data 2. Use the OCR output as a starting point and manually verify every amount 3. For bulk handwritten receipts, manual data entry may be faster than OCR+correction Alternately, photograph the receipt with a business card next to it for scale, and use a dedicated receipt scanning app (Expensify, Microsoft Office Lens, or Google Lens) which have specialized receipt-extraction AI that often outperforms general-purpose OCR for handwritten content.

Frequently Asked Questions

What's the minimum scan quality needed for reliable OCR of receipts?

300 DPI is the practical minimum. For faded thermal receipts or receipts with small print, scan at 600 DPI. Poor lighting, camera blur, and curved paper surfaces degrade OCR quality more than resolution alone — a well-lit, flat 200 DPI scan often outperforms a blurry 300 DPI scan.

Can I batch process many receipts at once?

LazyPDF processes one PDF at a time. For batch OCR and conversion of many receipts, combine all scanned receipts into one multi-page PDF first (using the merge tool), run OCR on the combined file, then convert the whole thing to Excel. You'll get all receipts as rows in one spreadsheet.

My OCR completely failed on a thermally printed receipt. Why?

Thermal paper receipts fade quickly and may have very low contrast text. This is one of the hardest cases for OCR. Try: increasing scan brightness/contrast before scanning, photographing with strong directional lighting to reveal faded ink, or using dedicated receipt scanning apps that use AI enhancement specifically designed for thermal receipts.

Is there a faster way to get receipt data into Excel?

For ongoing expense tracking, a dedicated expense app (Expensify, Rydoo, DEXT) is faster than the OCR+convert workflow. These apps use AI to extract structured data from receipt photos and sync directly to accounting software. The LazyPDF workflow is best for one-time large batches of historical receipts.

The Excel output has correct amounts but they're formatted as text, not numbers. How do I fix it?

Select the amount column, go to Data → Text to Columns, click through to finish (this triggers Excel to re-evaluate the cell format). Alternatively, use the VALUE() function to convert text to numbers: =VALUE(A1). Then remove currency symbols and commas using Find & Replace before converting.

Make your scanned receipts machine-readable with OCR, then extract the data to Excel.

Run OCR on Receipts

Related Articles