Last updated
Duplicate File Finder Examples
The Duplicate File Finder identifies files with identical content using cryptographic hashing. Below are examples of what the finder detects and how results are presented.
Duplicate Photo Detection
Uploaded files: 150 photos from a camera import
Scan complete: 150 files analyzed
Duplicates found: 3 groups (8 duplicate files)
Potential space savings: 24.6 MB
Group 1 — 3 identical files (8.2 MB each):
✓ KEEP /Photos/2024/vacation/IMG_4521.jpg (newest, Mar 15 2024)
✗ DUPE /Photos/backup/IMG_4521.jpg (Mar 10 2024)
✗ DUPE /Downloads/IMG_4521.jpg (Feb 28 2024)
Group 2 — 2 identical files (6.1 MB each):
✓ KEEP /Photos/2024/vacation/IMG_4522.jpg (newest)
✗ DUPE /Photos/backup/IMG_4522.jpg
Group 3 — 3 identical files (4.1 MB each):
✓ KEEP /Documents/logo.png (newest)
✗ DUPE /Downloads/logo.png
✗ DUPE /Desktop/logo-copy.png
Document Folder Scan
Scan complete: 320 files analyzed
Duplicates found: 5 groups
Group 1 — report.pdf (2.4 MB):
/Documents/reports/Q4-2024-report.pdf
/Desktop/Q4-2024-report.pdf
/Downloads/Q4-2024-report.pdf
Group 2 — contract.docx (180 KB):
/Documents/contracts/service-agreement.docx
/Documents/archive/service-agreement.docx
Group 3 — config.json (4 KB):
/projects/app-v1/config.json
/projects/app-v2/config.json ← same content, different directories
Hash-Based Comparison
Two files with different names but identical content:
File A: logo-final.png (SHA-256: a3f8c2d1...)
File B: logo-final-copy.png (SHA-256: a3f8c2d1...)
Result: DUPLICATE — byte-for-byte identical content
Size: 45,312 bytes each
Action: Mark "logo-final-copy.png" for deletion
Two files with the same name but different content:
File A: config.json in /project-v1/ (SHA-256: b7e4a1c2...)
File B: config.json in /project-v2/ (SHA-256: d9f2b3e4...)
Result: NOT DUPLICATE — different content despite same filename
Code Repository Scan
Scan: /src directory (1,240 files)
Duplicates found: 4 groups
Group 1 — utils.js (identical in 3 locations):
/src/components/utils.js
/src/helpers/utils.js
/src/lib/utils.js
Recommendation: Consolidate into a single shared module
Group 2 — logo.svg (identical in 2 locations):
/src/assets/logo.svg
/public/images/logo.svg
Recommendation: Keep /public/images/logo.svg, update imports
Group 3 — .eslintrc.json (identical in 2 locations):
/src/.eslintrc.json
/src/components/.eslintrc.json
Recommendation: Remove the nested copy, use root config
Scan Statistics
Files scanned: 1,240
Unique files: 1,228
Duplicate groups: 4
Duplicate files: 12
Space wasted: 8.4 MB
Space recoverable: 6.2 MB (keeping one copy per group)
Hashing algorithm: SHA-256
Scan duration: 2.3 seconds
Pre-filter by size: Eliminated 890 files before hashing
Handling Options
- Mark for deletion — flag duplicates for manual review before removing
- Move to folder — relocate duplicates to a /duplicates/ folder for review
- Generate report — export a CSV or text report of all duplicate groups
- Auto-keep newest — automatically mark the most recently modified copy as the keeper
- Auto-keep by path — keep copies in a specified preferred directory
Common Use Cases
- Cleaning up photo libraries after multiple imports from the same camera
- Removing duplicate downloads from the Downloads folder
- Finding accidental file copies in document folders
- Identifying duplicate assets in code repositories
- Auditing backup folders for redundant copies
- Reclaiming storage space on full drives
Upload files or specify a directory to scan. The finder groups identical files by hash, shows file paths and sizes, and provides safe options for handling duplicates without risking accidental data loss.