Brian Foster wrote:
> yes (mostly) & no (just a little bit): md5sum(1) can be
> used to tell you, with a high degree of certainty, which
> files are identical. it will not, however, _prove_ any
> two files with the same MD5 checksum are identical.
For the record, FSlint¹ uses size, md5sum and sha1sum
(in that order) to identify duplicate files.
Hopefully this is robust and fast.
Pádraig.
¹http://www.pixelbeat.org/fslint/
Maintained by the ILUG website team. The aim of Linux.ie is to
support and help commercial and private users of Linux in Ireland. You can
display ILUG news in your own webpages, read backend
information to find out how. Networking services kindly provided by HEAnet, server kindly donated by
Dell. Linux is a trademark of Linus Torvalds,
used with permission. No penguins were harmed in the production or maintenance
of this highly praised website. Looking for the
Indian Linux Users' Group? Try here. If you've read all this and aren't a lawyer: you should be!