PDF Converted to Word Has Extra Spaces — Fix It

You convert a PDF to Word expecting a clean, editable document and get something that looks like it was typed by someone hitting the spacebar randomly. Words split apart, double spaces appear between every character, paragraphs break mid-sentence, and formatting looks nothing like the original. This is one of the most common complaints about PDF-to-Word conversion, and it stems from how PDFs fundamentally store text compared to Word documents. Once you understand why it happens, the fixes become obvious.

Why PDF to Word Conversion Produces Extra Spaces

PDFs and Word documents store text completely differently. A Word document stores text as a continuous, formatted string — a paragraph is literally a paragraph object. A PDF stores text as a collection of positioned text objects, each with explicit X and Y coordinates on the page. When a conversion tool reads a PDF, it sees thousands of tiny positioned text fragments. To reconstruct a Word document, it must infer where words end, where spaces exist, and where paragraphs break — entirely from position data. This inference process is imperfect. **Causes of extra spaces:** - Adjacent text objects with a small gap between them get a space inserted, even when the gap is just kerning or letter-spacing in the original font - Ligatures (combined letter shapes like 'fi', 'fl') get broken apart during extraction and a space appears between the split characters - Variable-width fonts have tracking that the converter interprets as intentional spaces - Centered or right-aligned text uses position-based alignment in PDFs, which translates poorly to Word's paragraph alignment model **Causes of broken words:** - Hyphenated words that broke across lines in the PDF get extracted as two separate words without the hyphen - Multi-column layouts cause the converter to read across columns instead of down them - Text in tables gets extracted out of order when the table structure isn't recognized

Quick Fixes in Word After Conversion

These Word-native fixes handle the most common spacing problems quickly:

1Use Find & Replace to fix double spaces. Press Ctrl+H (Cmd+H on Mac). In the 'Find what' field, type two spaces. In the 'Replace with' field, type one space. Click 'Replace All'. Run this multiple times until it reports 0 replacements — some documents have triple or quadruple spaces that need multiple passes.
2Use Find & Replace to fix space-before-punctuation errors. Search for ' .' (space then period) and replace with '.' (just period). Repeat for ' ,' and ' ;' and ' :'. These appear when punctuation was stored as a separate text object in the PDF.
3Use Word's 'Show Formatting Marks' (Ctrl+*) to see where extra spaces, line breaks, and paragraph marks are hiding. This view makes it easy to spot and delete orphaned spaces that are invisible in normal view.
4For large documents, use a macro to run all replacements at once. Open the VBA editor (Alt+F11), paste a simple macro that runs multiple Find & Replace operations sequentially, and run it on the full document.
5Re-apply paragraph formatting after cleanup. Select all text (Ctrl+A), then re-apply the paragraph style from the Styles pane. This removes any leftover positional formatting and replaces it with clean Word paragraph structure.

Conversion Quality by PDF Type

The type of PDF significantly affects conversion quality: **Native PDFs (created digitally from Word, Excel, etc.):** These convert well. The text data in the PDF is clean, and most converters can reconstruct the Word document accurately with minimal extra spaces. **Scanned PDFs (physical documents photographed or scanned):** These convert poorly without OCR. The converter sees an image, not text. Run OCR first, then convert. Even with OCR, extra spaces are common because OCR introduces character-level errors. **PDFs created from complex layouts (magazines, brochures):** Multi-column layouts, text boxes, and artistic typography convert badly. Expect significant cleanup work. For these, consider retyping the content rather than cleaning up a broken conversion. **Password-protected PDFs:** Some converters cannot process protected PDFs and produce garbage output. Remove protection before converting.

Choosing a Better Converter to Reduce Cleanup

Not all converters are equal. The quality difference comes from how well the tool handles PDF text extraction and layout inference. For the cleanest Word output: - Use conversion tools that explicitly mention 'layout preservation' or 'document reconstruction' - LibreOffice-based converters (including LazyPDF's server-side conversion) handle most standard PDFs well - Adobe Acrobat Pro produces the best Word output for complex documents (paid tool) - Google Docs' built-in PDF import is surprisingly capable for simple documents For scanned PDFs specifically, always run OCR before conversion. Converting an image-only PDF without OCR produces an empty or nearly empty Word document.

When to Give Up on Conversion

Some PDFs are not worth converting — the cleanup time exceeds the time it would take to retype the content: - Heavily formatted brochures or magazines with complex layouts - PDFs with text rendered as curves (outlines) — these have no extractable text at all - Very old or low-quality scans where OCR accuracy is below 80% - Short documents (1–2 pages) where retyping is faster than cleaning 50+ errors For these cases, extract what you can, use the PDF as a visual reference, and type the content fresh in Word.

Frequently Asked Questions

Why does my converted Word document have a space between every single letter?

This indicates the PDF used a custom or embedded font that the converter couldn't map to standard characters. Each letter was stored as an individual positioned glyph, and the converter inserted spaces between each position. Try converting with a different tool or run the document through OCR first to extract the text as a text layer.

The first page converts fine but subsequent pages have bad spacing — why?

Multi-page PDFs sometimes switch fonts or layouts on later pages. The converter may handle the first font well but fail on a different embedded font used later in the document. This is a converter limitation. Try a different tool or process problematic pages separately.

Is there a way to convert PDF to Word with no spacing errors?

For simple, digitally-created PDFs, yes — the errors are usually minimal. For complex or scanned PDFs, some level of cleanup is always expected. The goal is choosing the best converter to minimize errors, not eliminate them entirely. Adobe Acrobat Pro comes closest to perfect for native PDFs.

Can I fix extra spaces in bulk using a script?

Yes. Python's python-docx library can process Word documents programmatically. You can write a script that runs regular expression replacements across all paragraphs in the document. This is efficient for processing many converted documents at once.

Convert your PDF to an editable Word document using our LibreOffice-powered converter — free, no signup, no watermarks.

Convert PDF to Word

Comparisons