rclone: ADD deduplicate Dropbox files based on checksum ignoring name

use case:

I use Dropbox (because I’m masochistic). Dropbox likes to append (1), (2) etc to filenames when merging 2 folders with identical files. I can’t just delete everything with (1) etc in their names, because it might not be a duplicate anymore.

I would like to rclone dedup those folders by using the checksums of each files, ignoring names, but keeping the shortest name (so it would prefer file.mp4 over file (1).mp4)

About this issue

  • Original URL
  • State: closed
  • Created 7 years ago
  • Comments: 23 (12 by maintainers)

Commits related to this issue

Most upvoted comments

I’ve merged the --by-hash flag to master now which means it will be in the latest beta in 15-30 mins and released in v1.54

does anyone have a solution?

You could use something like this as a start

rclone lsf -R --files-only --hash DropboxHash --format hp dropbox: | sort | uniq -D -w 64

which will print all files with the same content hash

will look into this as I am also interested into adding some fslint-alike functions

This would be nice implemented either as part of the rclone dedupe command or as a new command.