DupShelf

DupShelf

Find duplicate photos on an external hard drive

External hard drives are where years of phone backups, wedding folders, and old laptop copies accumulate—often with the same album copied twice before a trip or after a restore. DupShelf scans a folder on the connected drive locally in Chrome or Edge: SHA-256 exact matching, review-first moves, no upload to the cloud. Whether the drive is a 2 TB USB HDD from 2015 or a modern SSD, the workflow is identical: mount, pick folder, hash, review, quarantine, verify, delete.

Why external drives collect duplicates

Exact matching finds byte-identical files even when folder names differ.

  • Copy-paste of the same folder twice “just in case”
  • Old phone backups merged without deduping
  • Sync tools leaving Copy of… next to originals
  • Family members dropping the same ZIP on a shared drive

Before you scan

Plug in the drive, wait until it mounts (Windows letter or Mac volume). Pick the top-level photo library folder—not the entire drive if it also holds movies and installers. USB 2.0 readers are slower; hashing may take hours on multi-TB archives but accuracy is the same.

Speed tips for large HDDs

Copy the hottest photo subtree to an internal SSD, scan there, then apply moves back on the HDD if needed—or move duplicates on the drive directly when write access works. Close other disk-heavy apps during scan.

After duplicates are found

Move non-keepers to dupshelf-duplicate-images on that drive, verify in Explorer or Finder with large icons, delete quarantine when sure. Keepers stay at original paths. Never delete on an only copy of irreplaceable data without a second backup.

Read-only mounts and CSV fallback

Some NAS or reader mounts are read-only in the browser. Export CSV, delete or move manually using the path list DupShelf provides.

Multi-terabyte family archives

When a drive holds decades of phone backups, scan by top-level decade or family member first. Collapse exact copies within each branch, then consider merging branches for a final pass. Document which branch you already cleaned so relatives do not repeat work.

Preparing a drive to sell or hand off

Before selling an old drive, dedup personal photos so you do not hand a stranger duplicate family albums—and so you know what to wipe securely. Scan, move quarantine, verify, then use your preferred secure erase tool on the whole volume. If handing to a relative, dedup first so they inherit a lean library. Label the drive with a simple README.txt: dedup date, which folders were scanned, and whether quarantine was emptied.

Thunderbolt and USB-C enclosures

Modern enclosures are fast enough to scan in place on the drive. Older 2.5" spinning drives benefit from copying active projects to NVMe first. Listen for click-of-death sounds—dedup is not repair; clone failing drives before scan.

Labeling drives after dedup

Rename volume or add a README on the drive noting dedup date. Future you will know the archive was cleaned.

SMR and NAS overnight scans

Start long scans overnight on NAS or SMR drives. Cancel if the network drops; rescan when stable.

Summary and next steps

External drives are long-term memory. Dedup before labeling the drive and handing it to family. Keep two drives until you verify quarantine delete on the first. Scan by major branch on multi-TB archives to manage time. Note scan date on a text file on the drive root. When upgrading from HDD to SSD, dedup before clone to avoid copying garbage bytes faster. NAS users should scan from a machine with stable SMB mount. If the drive is your only copy, clone first—DupShelf moves, it does not magically create backups. Future you will forget what was cleaned; write it down. Scan before handoff to family or before selling a drive. Label drives with dedup date.

Questions

Is it safe for my only backup?
DupShelf moves, it does not delete. Still clone the drive before aggressive cleanup on irreplaceable archives.
NTFS vs exFAT vs APFS?
If Chrome can read the folder, scanning works. exFAT is common for shared family drives.
Can I scan 2 TB at once?
Yes if files are reachable; expect long hash times. Scan largest folders first.
Will sleep mode break the scan?
Keep the laptop awake and plugged in during long jobs.
SMR drives slow?
Yes. SSD staging helps.
Multiple partitions?
Scan each volume’s photo folder separately.
Duplicate scan before selling a drive?
Dedup then securely erase personal data before sale—DupShelf only removes identical extras you approve.

Related guides