Tips & TricksMarch 13, 2026

PDF Metadata Privacy Guide 2026: What's Hidden in Your PDFs

Every PDF file you create, receive, or send contains invisible metadata — structured information about the document that isn't visible when reading but is accessible to anyone with a PDF reader and basic curiosity. This metadata can include the author's full name, the organization that created the document, the software used to create it, when it was created and last modified, revision history tracking, GPS coordinates in embedded photos, comments and tracked changes from collaborators, and even prior document content that was 'deleted' but not properly removed. Before sharing PDFs publicly or sending them to third parties, understanding and managing this hidden data is an important privacy and professional practice. This guide explains what PDF metadata exists, how to view it, and how to remove or sanitize it using free tools.

What Metadata Lurks in Your PDFs

PDF metadata exists at multiple levels. Document-level metadata (the most commonly known) includes title, author name, subject, keywords, creator application (e.g., 'Microsoft Word 2024'), producer application (e.g., 'Adobe PDF Library'), creation date, and modification date. This information is viewable in any PDF reader by checking File > Properties. The author name field often contains the computer account name of the person who created the file — potentially exposing a full name even when you intended to send an 'anonymous' document. XMP (Extensible Metadata Platform) is a more detailed metadata format embedded in PDFs that can include additional fields. Embedded images may contain Exif data including GPS location if taken on a smartphone. Comment and annotation metadata reveals who made which comments and when. Document version history in Microsoft Word documents converted to PDF can retain tracked changes.

  1. 1Step 1: View your PDF's metadata before sending it anywhere. In Adobe Acrobat Reader (free), go to File > Properties > Description tab to see Author, Title, Subject, Keywords, Creator, and Producer fields. In Chrome, open the PDF and check the document info. In Foxit Reader, go to File > Properties.
  2. 2Step 2: Check for embedded comments and annotations by looking at the Comments panel in Adobe Acrobat Reader. Even if comments appear hidden at normal view, they may still be present in the PDF structure and visible to recipients who check.
  3. 3Step 3: To remove all metadata, run the PDF through Ghostscript with metadata-clearing commands. The simplest approach: `gs -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -dPDFSETTINGS=/prepress -dFastWebView=false -c '[ /Title () /Author () /Subject () /Keywords () /Creator () /Producer () /DOCINFO pdfmark' -f input.pdf -sOutputFile=clean.pdf`
  4. 4Step 4: Verify metadata removal by opening the cleaned PDF and checking File > Properties again. All fields should be empty or show only the new Ghostscript producer information. Re-check for any embedded images that might retain Exif data within their image streams.

Tracked Changes and Comment History

One of the most surprising metadata privacy risks in PDFs is tracked changes and comments inherited from the source document. When you create a PDF from a Word document that has Track Changes enabled or contains comments, the resulting PDF may embed that revision history. Even if you 'accept all changes' before export, some Word-to-PDF conversion paths retain the revision data in a form readable with specialized PDF analysis tools. The safe practice: accept all changes, delete all comments, and save the Word file to a new copy without revision history (File > Info > Check for Issues > Inspect Document in Word) before converting to PDF. LibreOffice's export to PDF is generally cleaner regarding tracked changes, but best practice is to clear all revisions before export regardless of the source application.

Removing Metadata with Free Tools

Several free methods exist for stripping PDF metadata. Ghostscript (command-line) is the most thorough — as shown in the steps above, it can clear all standard metadata fields. qpdf offers a simpler approach for basic metadata removal: `qpdf --empty --pages input.pdf -- output.pdf` creates a new PDF with pages copied from the original but without document-level metadata. PDF24 Creator (free Windows app) has a built-in metadata removal option under Tools > Edit PDF Metadata. Running a PDF through LazyPDF's compress tool removes most metadata as a side effect of the Ghostscript processing — the author and revision fields are cleared in the output. For maximum metadata privacy before distributing sensitive documents publicly, combine Ghostscript metadata clearing with a pass through a PDF optimizer that rebuilds the file structure from scratch.

Professional Metadata Management Practices

For organizations concerned with document metadata as a systematic privacy issue, establishing a metadata management policy is practical. Define which metadata fields should be populated (title, keywords for searchability) and which should always be cleared before external distribution (author personal names, organization names, creation software). Set up a pre-distribution checklist that includes metadata review as a standard step. For law firms, consultancies, and agencies where client confidentiality is paramount, consider a PDF sanitization workflow for all outgoing documents — run every PDF through Ghostscript metadata cleaning before it leaves your systems. This is also good protection against accidentally disclosing vendor names, internal author names, or document version histories to opposing counsel or competitive parties.

Frequently Asked Questions

Can someone see my name on a PDF I created?

Yes, unless you explicitly remove the author metadata before sharing. Most PDF creation tools automatically populate the Author field from your computer's user account name or your installed software's registration name. In Word, this comes from your Microsoft account name. Check File > Properties in any PDF reader to see what's stored. Before sharing PDFs professionally or publicly, verify the Author field contains what you intend — or clear it entirely using the tools described in this guide.

Does compressing a PDF remove its metadata?

Partially. Ghostscript-based compression (like LazyPDF's compress tool) removes most standard metadata fields including Author, Creator, and keyword data as part of its file restructuring. However, Ghostscript adds its own Producer field identifying itself, and metadata embedded within image Exif data within the PDF's image streams is not always removed. For thorough metadata removal, explicit metadata-clearing commands (as described above) provide the most complete result.

Is PDF metadata visible to Google when PDFs are published online?

Yes. Google indexes PDF metadata including the document title field, which often appears as the search result headline instead of the filename. Author and keyword fields may also influence indexing. This is relevant both for privacy (check that Author doesn't reveal personal names you'd rather not associate with a public document) and for SEO (descriptive, keyword-rich titles in PDF metadata can improve search discoverability). Set PDF metadata intentionally before public publication.

Strip metadata from your PDF with a clean Ghostscript compression pass — free.

Compress and Clean PDF

Related Articles