Generic way to find slug characters used in Fingerprinting and tracking

If you are experiencing problems with "Everything", post here for assistance.
Post Reply
aaathemtheyzzz
Posts: 10
Joined: Wed Mar 04, 2020 10:23 pm

Generic way to find slug characters used in Fingerprinting and tracking

Post by aaathemtheyzzz »

From my own research on MFT I bumped into the zero-with space and possible other non visible characters which are found in some of my filenames. Is there a general way to find all those weird files and folders without knowing hex Uni-code codes?
If not, what is the alternative?
Some interesting articles are:
https://www.zachaysan.com/writing/2017-12-30-zero-width-characters
https://www.researchgate.net/publication/308044170_Content-preserving_Text_Watermarking_through_Unicode_Homoglyph_Substitution
https://github.com/fastforwardlabs/steganos
void
Developer
Posts: 15352
Joined: Fri Oct 16, 2009 11:31 pm

Re: Generic way to find slug characters used in Fingerprinting and tracking

Post by void »

To find filenames with non-ASCII characters, please try searching for:
regex:[^\x00-\x7f]

Add more characters after the ^ to allow them.
For example, if you want to ignore the following characters: óàê, please try searching for:
regex:[^óàê\x00-\x7f]

To search for zero width space, please try searching for:
regex:\x{200b}

To search for all 4 zero width characters, please try searching for:
regex:[\x{200b}\x{200c}\x{200d}\x{2060}]

https://en.wikipedia.org/wiki/Zero_width
NotNull
Posts: 5261
Joined: Wed May 24, 2017 9:22 pm

Re: Generic way to find slug characters used in Fingerprinting and tracking

Post by NotNull »

void wrote: Wed Mar 04, 2020 11:42 pm To find filenames with non-ASCII characters, please try searching for:
regex:[^\x00-\x7f]
FWIW: I use
regex:"[^ -~]"
for that (all characters from space to tilde) as I can't remember these codes...
Post Reply