Request, dup file functions.

Have a suggestion for "Everything"? Please post it here.
Post Reply
Janus
Posts: 84
Joined: Mon Nov 07, 2016 7:33 pm

Request, dup file functions.

Post by Janus »

Requesting something simple sounding, but may be complex to implement, I don't know.

A command such that duplicate files can be detected by various parameters.

dup:name:7 => sets of 7 or more files with matching names.

dup:size:1024:11 => sets of 11 or more files with matching size.

dup:datetime:exact:13 => sets of 13 or more files with exact same date time stamp.

dup:datetime:minute:17 => sets of 17 or more files with the same date time stamp, one minute resolution.

dup:datetime:hour:19 => sets of 19 or more files with the same date time stamp, one hour resolution.

You get the idea.
Not sure how to implement, but would be a tremendous help.


Janus.

Edited to fix my speling.
therube
Posts: 4580
Joined: Thu Sep 03, 2009 6:48 pm

Re: Request, dup file functions.

Post by therube »

What type of use case do you have for wanting something like that?


Possibly, something like that may be of interest.
Like now, I have backups stored on an offline disk, that will indexed "skew" dupe: sizedupe: results - depending on what you're looking to do. (As in, if you wanted dupe: sizedupe: exclusive of "offlinevol", you could set a, in your terminology, :3, so that would account for online, offline, plus one more ... eh, no, that wouldn't really work, would it?

(Yes, I no I can !offlinevol, but still results are skewed, because then you will get 1 lone file on the list that there would be no apparent match for... Or you could do a separate index for offlinevol... Or...
Janus
Posts: 84
Joined: Mon Nov 07, 2016 7:33 pm

Re: Request, dup file functions.

Post by Janus »

First use.
Call everything from an external program with a search.
Parse the list that already has the dups in some order.
Do actual bin/txt compares from this external program.

Second use.
I have a lot of projects on a server, which I do.
I want to see how many files are duplicated between them.

Third use.
I have multiple collections of files I am copying onto a system.
I want to see how many files are the same so I can move them out of the way first.
That lets me concentrate on the files with differences.

Perhaps filter further with !dup:name:2 to list only unique names.

Fourth use.
dup:*.c:2 & dup:size to list all C source files that have the same name and size.
This lets me see how many times I have used or reused a file without changing it.

I could go on, but I believe I have illustrated my point.


Janus.
Post Reply