Duplicate search

Discussion related to "Everything" 1.5 Alpha.
Post Reply
harryray2
Posts: 1056
Joined: Sat Oct 15, 2016 9:56 am

Duplicate search

Post by harryray2 »

Is it possible to find duplicates that aren't exactly the same? Possibly some sort of 'fuzzy' search.
For example:

Freedailymotiondownload.exe 1.2.3.4
Freedailymotiondownload.exe 2.3.4.5
Freedailymotiondownload.exe

or:

test Freedailymotiondownload.exe 1.2.3.4
tester Freedailymotiondownload.exe 1.2.3.4
testing Freedailymotiondownload.exe.1234
void
Developer
Posts: 15675
Joined: Fri Oct 16, 2009 11:31 pm

Re: Duplicate search

Post by void »

No.

I will consider adding support to find duplicates with a fuzzy match.

Thank you for the suggestion.



For now, please try the following search:

regex:\b([a-z0-9_\.\-]+\.exe) addcolumn:a a:=$1: dupe:column-a

This will find files where <word>.exe is duplicated.
harryray2
Posts: 1056
Joined: Sat Oct 15, 2016 9:56 am

Re: Duplicate search

Post by harryray2 »

Thanks David, a fuzzy search would be brilliant..

Is there a way of refining this so that it only shows results that are not exact duplicates?
void
Developer
Posts: 15675
Joined: Fri Oct 16, 2009 11:31 pm

Re: Duplicate search

Post by void »

No.
harryray2
Posts: 1056
Joined: Sat Oct 15, 2016 9:56 am

Re: Duplicate search

Post by harryray2 »

Is there anyone in Everything land that's aware of any (preferably free) programme that has a fuzzy search to find dupes that aren't exact please?

Thanks.
therube
Posts: 4684
Joined: Thu Sep 03, 2009 6:48 pm

Re: Duplicate search

Post by therube »

Depends how you define "fuzzy".

regex:freedaily.*\.exe.*\d{0,}

would be sufficient to find:

Code: Select all

Freedailymotiondownload.exe 1.2.3.4
Freedailymotiondownload.exe 2.3.4.5
Freedailymotiondownload.exe
test Freedailymotiondownload.exe 1.2.3.4
tester Freedailymotiondownload.exe 1.2.3.4
testing Freedailymotiondownload.exe.1234
If you make your search lax enough to find all permutations of what you want,
but specific enough to not find anything you do not want, you're "fuzzy".


regex:\.exe.*\d{0,} name:seamonkey
or
regex:\.exe.*\d{0,} name:seamonkey dupe:size
or
regex:\.exe.*\d{0,} name:seamonkey dupe:sha1

all return:

Code: Select all

seamonkey-2.1b2pre.en-US.win32.installer-0128.exe
seamonkey-2.1b2pre.en-US.win32.installer.exe.ADS
(ADS here means the file came down with an Alternate Data Streams,
where the other "version" did not.)
kazzybash
Posts: 98
Joined: Mon Mar 02, 2020 9:55 pm

Re: Duplicate search

Post by kazzybash »

harryray2 wrote: Sun Aug 20, 2023 9:15 am Is there anyone in Everything land that's aware of any (preferably free) programme that has a fuzzy search to find dupes that aren't exact please?
Maybe Alldup is able to do what you are looking for (it has a function for images, but I am not sure if it has a function for other filetypes).

regards, kazzy
Post Reply