PS

DuplicateFF

Archived reference architecture for a duplicate-file finder

PowerShell

Delivery

Source-first

Browse code, README, and release notes on GitHub.

Primary lane

PowerShell lane

The clearest adjacent context for this project inside the portfolio.

Freshness

Apr 26, 2026

Updated Apr 26, 2026

Latest release

No tag yet

README is the clearest project overview right now.

Preview

Using the generated project card as a clean fallback until a live capture is available.

Cached at build time, cleaned up for in-site reading, and linked back to the canonical GitHub source.

Professional duplicate file finder with a progressive hashing pipeline for terabyte-scale scanning. PowerShell WPF with Catppuccin Mocha dark theme.

DuplicateFF Screenshot

Progressive Hashing Pipeline - 5-stage elimination (size grouping, prefix hash, suffix hash, full SHA256) minimizes disk I/O
Reference Folders - Mark folders as protected; duplicates will never be selected from these locations
File Type Filters - Images, Videos, Audio, Documents, or All Files
Image Preview - Inline preview panel for visual verification before deletion
Auto-Select Rules - Keep Newest, Oldest, From Reference Folders, Largest, or Shortest Path
Safe Deletion - Move to Recycle Bin (default), Permanent Delete, or Replace with Hardlinks
CSV Export - Full results export with hash values, groups, and file metadata
Async Scanning - Non-blocking UI with real-time progress and cancellation support
Dark Theme - Catppuccin Mocha with premium UI styling

.\DuplicateFF.ps1

The progressive hashing pipeline avoids reading entire files whenever possible:

Stage	Action	Typical Elimination
1	Enumerate files with filters	N/A
2	Group by file size	~70% of files
3	SHA256 of first 4KB	~15% more
4	SHA256 of last 4KB	~5% more
5	Full SHA256 hash	Final confirmation

Only files surviving all stages get fully hashed, making scans fast even on large datasets.

See Building a professional duplicate file finder: A technical guide for the research behind this tool, covering algorithm selection, perceptual hashing for AI upscale detection, and performance architecture.

MIT License