content: which files can be retrieved for their text content?

If you are experiencing problems with "Everything", post here for assistance.
Post Reply
heijingang
Posts: 15
Joined: Thu Sep 14, 2023 12:52 pm

content: which files can be retrieved for their text content?

Post by heijingang »

I found that the content: command retrieves text content from wowd, Excel, access, powerpoint files, and what other files can it retrieve text content from?
horst.epp
Posts: 1344
Joined: Fri Apr 04, 2014 3:24 pm

Re: content: which files can be retrieved for their text content?

Post by horst.epp »

heijingang wrote: Wed Nov 15, 2023 3:09 pm I found that the content: command retrieves text content from wowd, Excel, access, powerpoint files, and what other files can it retrieve text content from?
That depends on what iFilter software is installed on your OS.
Some comes with Windows and others with installed applications.
In Everything 1.5 you can also have a content index which speeds up content searches.
The Nirsoft tool SearchFilterView shows you all installed search filters and their extension assignments.
https://www.nirsoft.net/utils/search_filter_view.html
The result looks like this:
Screenshot - 15.11.2023 , 17_24_57.png
Screenshot - 15.11.2023 , 17_24_57.png (72.21 KiB) Viewed 1507 times
tuska
Posts: 933
Joined: Thu Jul 13, 2017 9:14 am

Re: content: which files can be retrieved for their text content?

Post by tuska »

2heijingang
You can check this yourself in the Windows indexing options:
Search for: Indexing options

Under the "Advanced" button in the "Advanced options" window on the "File types" tab, you can determine the file extensions
for which Windows has automatically selected the menu item "Index properties and file contents".

Here you can make a selection yourself or add new file extensions.

A search in Everything(!) using the "Windows Search" is possible in Everything 1.5a with the Everything parameter si:
e.g. si:searchword

-----------------
In Everything 1.5a, e.g. currently Everything 1.5.0.1359a (x64), file contents can be indexed -> Options > Content.
Here you should pay attention to the note and make a selection:
"Text content is stored in memory.
Set content exclude filters for the best performance."

For example, I have made the following selection:
☑️ Index file content
Include only folders: D:\;C:\Everything\;C:\totalcmd
Include only files: *.csv;*.doc;*.docx;*.eml;*.ini;*.ion;*.md;*.mdb;*.msg;*.odt;*.pdf;*.pps;*.ppt;*.pptx;*.txt;*.xls;*.xlsm;*.xlsx;*.xlt;*.xltm;*.xltx

In the Task Manager (Ctrl+Shift+ESC) you should note the working memory:
Content indexing in Everything is designed for indexing less than 1GB of text.
Content Indexing in Everything 1.5

-----------------
Everything 1.4:
"Yes, "Everything" can search file content with the content: search function.
File content is not indexed, searching content is slow".
-----------------
 
2023-11-15_Content Indexing.png
2023-11-15_Content Indexing.png (49.83 KiB) Viewed 1497 times
heijingang
Posts: 15
Joined: Thu Sep 14, 2023 12:52 pm

Re: content: which files can be retrieved for their text content?

Post by heijingang »

Thanks,horst.epp,tuska
Post Reply