Page 1 of 1

Duplicates in 'content:' searches

Posted: Tue Jul 31, 2018 10:27 am
by Gisle Vanem
First, a million thanks for a super program and the EveryThing SDK which I
use in my https://github.com/gvanem/EnvTool program.

To the issue: I found many forum topics related to Duplicates. But not Duplicates together with content: ...''

Like in this screen-shot:
Image

2 of the exact same syserr.cpp. Any reason for this. Except for (yikes) a bug?
I use EveryThing ver. 1.4.1.895 (x64)

PS. when used in my envtool program as c:\>envtool --evry -s sy*.cpp content:sys_errlist
it reports the duplicate like:
Matches from EveryThing:
22 Sep 2017 - 18:17:38 - 3 KB: c:\Program Files (x86)\Windows Kits\10\Source\10.0.16299.0\ucrt\misc\syserr.cpp
10 Feb 2017 - 18:40:32 - 3 KB: f:\ProgramFiler-x86\WindowsKits\Source\10.0.15063.0\ucrt\misc\syserr.cpp
2 matches found for "sy*.cpp content:sys_errlist" (1 duplicated). Totalling 8 KB (8,192 bytes).

Re: Duplicates in 'content:' searches

Posted: Tue Jul 31, 2018 11:24 am
by void
What are your results if you search for syserr.cpp?

Do you have a folder index which might be duplicating these results from Tools -> Options -> Folders?

Re: Duplicates in 'content:' searches

Posted: Tue Jul 31, 2018 12:02 pm
by Gisle Vanem
void wrote:What are your results if you search for syserr.cpp?
The same.
Do you have a folder index which might be duplicating these results from Tools -> Options -> Folders?
That seemed to be it. I had these:
  • c:\Program Files
    c:\Program Files (x86)
Removing them and selecting Rescan Now fixed it.
But, I'm not sure if it's a good idea to remove them. Isn't it true that a content:foo search is much faster when those folders are indexed?

Re: Duplicates in 'content:' searches

Posted: Tue Jul 31, 2018 1:33 pm
by void
That seemed to be it. I had these:

c:\Program Files
c:\Program Files (x86)
These folders are already indexed as NTFS volumes.
You should see your C: drive listed in Tools -> Options -> NTFS.
Isn't it true that a content:foo search is much faster when those folders are indexed?
content: will only search files that you have indexed.

Re: Duplicates in 'content:' searches

Posted: Tue Jul 31, 2018 2:14 pm
by Gisle Vanem
void wrote:content: will only search files that you have indexed.
I don't get it. Meaning I will get lots of duplicates if I really want the handy content:foo feature?

Re: Duplicates in 'content:' searches

Posted: Tue Jul 31, 2018 9:35 pm
by NotNull
Everything can search content for all the files it has in it's database.

Your files were twice in the database: once through NTFS indexing of your C:-drive (Menu:Tools > Options > NTFS) and once through folder indexing your Program Files folder (Menu:Tools > Options > Folder).
And thus found twice with your search query.

Remove the folder indexing ones and all your files on the C:-drive will still be in the database.
Now your search query will return the file once.