How to Compress a Scanned PDF Without Making It Unreadable
Scanned PDFs are harder to compress than text-based PDFs because each page is usually an image. A normal report may shrink by removing unused objects, but a scan usually needs image downsampling or recompression.
The useful rule: compress scanned PDFs by reducing image weight carefully, then check whether names, dates, stamps, signatures, and small print are still readable.
BaseToolbox's PDF compressor helps you reduce PDF size in the browser, which is useful when a scan contains IDs, contracts, invoices, medical forms, or other files you do not want to upload.
Why Scanned PDFs Get So Large
A scanned PDF may contain one full-page image per page. If the scanner used high resolution, color mode, and low compression, even a short document can become large.
Common size drivers include:
- 300-600 DPI color scans
- Photos embedded as full pages
- Empty margins saved as image pixels
- Multiple scans of the same page
- OCR text layers plus original images
- Flatbed scanner defaults meant for archiving
The file is not large because the text is long. It is large because every page behaves like a picture.
Decide What Quality You Need
Before compressing, decide how the PDF will be used.
| Use case | Quality priority |
|---|---|
| Email attachment | Readable on screen, smaller file size |
| Legal or HR record | Preserve signatures, stamps, dates |
| Print later | Keep enough resolution for paper output |
| Upload portal limit | Meet size cap without losing key fields |
| Archive copy | Keep original and make a smaller sharing copy |
Do not overwrite the only high-quality copy until you have checked the compressed result.
What to Check After Compression
Open the compressed file and inspect:
- The smallest text on each page.
- Stamps, signatures, checkboxes, and handwritten notes.
- Page order and rotation.
- Color-coded highlights or annotations.
- OCR search, if the original had searchable text.
- File size compared with the upload limit.
Zooming out is not enough. A scan can look fine at page width and fail when printed or reviewed closely.
Color, Grayscale, and Black-and-White
Color scans are usually larger than grayscale. Grayscale scans are usually larger than black-and-white. But switching modes can remove information.
Use black-and-white only when the document is simple text and the result remains readable. Use grayscale for many forms and contracts. Keep color when stamps, photos, IDs, annotations, or colored marks matter.
For official submissions, keep a copy of the original scan. Some portals accept smaller files, but that does not mean every visual detail survived.
Local Compression and Privacy
Scans often contain personal data. A remote compressor may receive the full document if processing happens on a server. Local browser compression reduces that exposure.
This matters for:
- Passports and IDs
- Signed contracts
- Insurance forms
- Bank statements
- School records
- Medical or payroll documents
If a document would be sensitive as an image, it is still sensitive as a PDF.
When Compression Is Not Enough
If the file is still too large, remove duplicate pages, crop blank borders, rescan at a more appropriate resolution, or split the PDF into sections. A strict 100 KB limit may be unrealistic for multi-page scans with photos.
When a portal rejects the file, check whether it limits file size, page count, file type, or encrypted PDFs. The message "too large" is not always the only constraint.
Common Mistakes
The most common mistake is compressing the only copy and losing detail. Always export a smaller copy and keep the original unchanged.
Another mistake is checking only the first page. Scans often have one unusually faint page, one sideways page, or one page with handwriting that fails after compression.
Finally, do not assume a smaller file is always better. For documents that must be reviewed by a human, a slightly larger readable PDF is better than a tiny file that gets rejected.
FAQ
Why did my scanned PDF stay large?
The pages may still be high-resolution images. Text cleanup alone does little when the image data dominates.
Can compression break OCR?
It can. If the PDF has a searchable text layer, confirm search still works after compression.
Should I delete the original?
No. Keep the original scan until the compressed copy has been accepted and reviewed.
Ready to try it yourself?
Put what you have learned into practice with our free online tool.
Compress a PDF Locally