SOLVED: How to find: Blank Char / Invisible Character AND Hidde Char

General discussion related to "Everything".
Post Reply
Debugger
Posts: 565
Joined: Thu Jan 26, 2017 11:56 am

SOLVED: How to find: Blank Char / Invisible Character AND Hidde Char

Post by Debugger »

This document contains characters... (Current Encoding: 1250).


How to find/DETECT a strange character?
This is strange for me because it is a simple text in Polish, so the 1250 encoding is 100% correct.
Last edited by Debugger on Tue Mar 26, 2019 12:06 pm, edited 3 times in total.
void
Developer
Posts: 15096
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to find a strange character?

Post by void »

To search for any character with PCRE regex not in code page 1250:

Code: Select all

[^\x{0000}\x{0001}\x{0002}\x{0003}\x{0004}\x{0005}\x{0006}\x{0007}\x{0008}\x{0009}\x{000a}\x{000b}\x{000c}\x{000d}\x{000e}\x{000f}\x{0010}\x{0011}\x{0012}\x{0013}\x{0014}\x{0015}\x{0016}\x{0017}\x{0018}\x{0019}\x{001a}\x{001b}\x{001c}\x{001d}\x{001e}\x{001f}\x{0020}\x{0021}\x{0022}\x{0023}\x{0024}\x{0025}\x{0026}\x{0027}\x{0028}\x{0029}\x{002A}\x{002B}\x{002C}\x{002D}\x{002E}\x{002F}\x{030}\x{0031}\x{0032}\x{0033}\x{0034}\x{0035}\x{0036}\x{0037}\x{0038}\x{0039}\x{003A}\x{003B}\x{003C}\x{003D}\x{003E}\x{003F}\x{040}\x{0041}\x{0042}\x{0043}\x{0044}\x{0045}\x{0046}\x{0047}\x{0048}\x{0049}\x{004A}\x{004B}\x{004C}\x{004D}\x{004E}\x{004F}\x{050}\x{0051}\x{0052}\x{0053}\x{0054}\x{0055}\x{0056}\x{0057}\x{0058}\x{0059}\x{005A}\x{005B}\x{005C}\x{005D}\x{005E}\x{005F}\x{060}\x{0061}\x{0062}\x{0063}\x{0064}\x{0065}\x{0066}\x{0067}\x{0068}\x{0069}\x{006A}\x{006B}\x{006C}\x{006D}\x{006E}\x{006F}\x{070}\x{0071}\x{0072}\x{0073}\x{0074}\x{0075}\x{0076}\x{0077}\x{0078}\x{0079}\x{007A}\x{007B}\x{007C}\x{007D}\x{007E}\x{007F}\x{0AC}\x{201A}\x{201E}\x{2026}\x{2020}\x{2021}\x{2030}\x{0160}\x{2039}\x{015A}\x{0164}\x{017D}\x{0179}\x{018}\x{2019}\x{201C}\x{201D}\x{2022}\x{2013}\x{2014}\x{2122}\x{0161}\x{203A}\x{015B}\x{0165}\x{017E}\x{017A}\x{0A0}\x{02C7}\x{02D8}\x{0141}\x{00A4}\x{0104}\x{00A6}\x{00A7}\x{00A8}\x{00A9}\x{015E}\x{00AB}\x{00AC}\x{00AD}\x{00AE}\x{017B}\x{0B0}\x{00B1}\x{02DB}\x{0142}\x{00B4}\x{00B5}\x{00B6}\x{00B7}\x{00B8}\x{0105}\x{015F}\x{00BB}\x{013D}\x{02DD}\x{013E}\x{017C}\x{154}\x{00C1}\x{00C2}\x{0102}\x{00C4}\x{0139}\x{0106}\x{00C7}\x{010C}\x{00C9}\x{0118}\x{00CB}\x{011A}\x{00CD}\x{00CE}\x{010E}\x{110}\x{0143}\x{0147}\x{00D3}\x{00D4}\x{0150}\x{00D6}\x{00D7}\x{0158}\x{016E}\x{00DA}\x{0170}\x{00DC}\x{00DD}\x{0162}\x{00DF}\x{155}\x{00E1}\x{00E2}\x{0103}\x{00E4}\x{013A}\x{0107}\x{00E7}\x{010D}\x{00E9}\x{0119}\x{00EB}\x{011B}\x{00ED}\x{00EE}\x{010F}\x{111}\x{0144}\x{0148}\x{00F3}\x{00F4}\x{0151}\x{00F6}\x{00F7}\x{0159}\x{016F}\x{00FA}\x{0171}\x{00FC}\x{00FD}\x{0163}\x{02D9}]
Debugger
Posts: 565
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

He finds nothing.
I tested in Notepad++ and EmEditor.

Image





Image
tuska
Posts: 908
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska »

Debugger wrote: Sat Mar 16, 2019 7:03 am He finds nothing.
I tested in ... EmEditor.
Any character with PCRE regex not in code page 1250.png
Any character with PCRE regex not in code page 1250.png (28.32 KiB) Viewed 5838 times
Debugger
Posts: 565
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)

A strange symbol can also be something that can not be seen.

I have no idea why can not save text with the default encoding.
Which symbol, unicode, special character, it makes it impossible and how you save eg without UTF-8 you will get the name

Code: Select all

??
tuska
Posts: 908
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska »

Debugger wrote:Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)
That's just the find toolbar (menu "View" - "Toolbars"...) - I use EmEditor Pro Version 18.6.91 x64.

If I want to save a file with special characters (as shown in the picture) as .txt file,
then "Save as Unicode (UTF-16LE with signature)" is automatically suggested to me...
The storage in this format allows me to open the text file without any problems and later on
saving without any further request.
Debugger
Posts: 565
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

I understand. Filter Toolbar
but he still does not detect the strange char
I want to save as 1250 - for the Polish text. I do not need any UTF-16LE.
I also tried online tools to detect strange characters, but they also can not detect strange char.


Detects the Polish text, but does not detect strange characters.
[a-żA-Ż ?,:]
tuska
Posts: 908
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska »

Debugger wrote:I understand. Filter Toolbar
No, I meant the "Find Toolbar"! -> Menu "View" - "Toolbars" - "Find Toolbar"

Did you paste the code in the field "Find" of the "Find Toolbar" and
- clicked on button "Use Regular Expressions" in the "Find Toolbar" and
- then clicked on "Find Next" in the "Find Toolbar"?
I then get the special characters marked as shown.
Debugger
Posts: 565
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

But in my text there are no special characters you mentioned.

I have checked thousands of different Unicode, and still do not detect a strange or illegal symbol.

[^\x00-\x7F]+
Post Reply