PDF Split Output Files Are Too Large — Fix It
You split a large PDF into individual pages or sections expecting to get small, easily shareable files. Instead, each output file is still surprisingly large — sometimes nearly as large as the original. This happens for specific technical reasons related to how PDFs store resources, and understanding them leads directly to the fix. In most cases, you can significantly reduce the size of split PDF pages with a quick additional step.
Why Split PDF Pages Are Unexpectedly Large
When a PDF is split, pages are extracted from the original document. The page content is copied, but there's a complication: PDFs store resources (fonts, images, color profiles, embedded content) in a shared resource dictionary that can be referenced by multiple pages. **Shared resources are duplicated.** A 30-page PDF might use the same embedded font across all pages. That font data is stored once in the original. When you split to 30 individual PDFs, some split tools copy the entire resource dictionary to each output file — meaning each 1-page output contains all the fonts and resources used anywhere in the original, not just the resources used on that specific page. **High-resolution images inflate page size.** If the page contains a large embedded image (photos, scanned content), the image data carries over to the split output at full resolution. **Content streams are not cleaned up.** Some split tools perform a simple page extraction without optimizing the output — unused resources referenced in the copied resource dictionary add size without adding content. **ICC color profiles.** Professionally prepared PDFs often embed ICC color profiles for print accuracy. These can be 1–4MB each and are often copied to every split output. **The original was already an large document.** If you're splitting a 100-page scanned document where every page is a high-resolution scan, each individual page will be proportionally large — there's nothing bloated about it.
How to Reduce the Size of Split PDF Output
After splitting, compress the output files to remove shared resources and reduce image sizes:
- 1Run each split output through a PDF compressor. LazyPDF's compress tool can be used on the individual split files. The compressor removes unused resources (fonts referenced but not used on the extracted page), optimizes image data, and removes unnecessary metadata.
- 2Use a split tool that performs resource cleanup during splitting. Some tools (Ghostscript, Adobe Acrobat Pro) optimize the output during split — extracting only the resources actually used on each extracted page. This produces smaller split files without a separate compression step.
- 3If the files are large due to scanned images, compress with appropriate quality settings. For text-heavy scans, higher compression is acceptable. For photos or charts that need quality, use moderate compression to balance size and clarity.
- 4Remove embedded ICC color profiles if print accuracy isn't required. These profiles add significant size and are only relevant for professional printing workflows. Stripping them from web or email distribution versions is appropriate.
- 5For Ghostscript users, use the correct output settings: 'gs -dBATCH -dNOPAUSE -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/screen -sOutputFile=output.pdf input.pdf'. The '/screen' or '/ebook' preset applies compression suitable for digital distribution.
- 6If splitting for email attachment, consider whether splitting plus compression achieves the size you need, or whether the original document should have been created with lower image resolution from the start.
Using Compression After Split: Batch Processing
If you've split a 50-page PDF into 50 individual files and all need compression, doing them one at a time is impractical. Batch options: **LazyPDF:** Upload and compress files one by one (no batch upload currently). For a small number of files, this is manageable. **Ghostscript command line:** A one-line loop (on Unix/Mac: `for f in *.pdf; do gs -dBATCH -dNOPAUSE -sDEVICE=pdfwrite -dPDFSETTINGS=/ebook -sOutputFile="compressed_$f" "$f"; done`) compresses all PDF files in a directory. **Adobe Acrobat Pro batch processing:** The Action Wizard (Tools > Action Wizard) can apply PDF Optimizer settings to a folder of files automatically. **Python scripting:** PyPDF2 or pikepdf can process batches of PDFs programmatically, applying compression as part of a larger automated workflow.
Choosing the Right Strategy Based on Use Case
The right approach depends on why you're splitting and what you're doing with the output: **For email attachment:** Split + compress. Target under 10MB per file for standard email. Under 5MB for conservative email systems. **For web display:** Split + screen-optimized compression. 72 DPI images are sufficient for screen display. Ghostscript's '/screen' setting is appropriate. **For printing:** Split only, no compression. Maintain full image resolution (300 DPI) and embedded color profiles for print quality. **For archiving:** Keep the original split files at full quality. Storage is cheap; quality loss is permanent.
Frequently Asked Questions
Why is a 1-page split output the same size as the original 20-page PDF?
The split tool likely copied the entire resource dictionary (all fonts, color profiles, shared images) to the output without cleaning up unused resources. All the shared resources from the 20-page document are embedded in the 1-page output even though only one page's worth is used. Run the split output through a compressor to strip unused resources.
I split a scanned 100-page PDF into 100 pages — each page is 2MB. Is that normal?
Yes, that's normal for scanned documents at 300 DPI. A scanned page at 300 DPI in color is inherently large. 2MB per page is reasonable for color scans. If you need smaller files, compress after splitting, accepting some quality reduction. Black-and-white (grayscale) compression can significantly reduce scan sizes.
Does the split tool matter for output file size?
Yes. Split tools that perform resource cleanup during extraction (Ghostscript, Acrobat Pro's Extract Pages) produce smaller outputs. Tools that copy the full resource dictionary without cleanup produce outputs inflated by unused shared resources. The same 1-page extract can differ in size by 5–10x depending on the tool.
Can I split and compress in one step?
Some tools support this. Ghostscript can extract specific pages with compression settings in a single command. Adobe Acrobat Pro's Extract Pages function can optimize the output. Most simple online tools separate splitting and compression into two operations.