How to fix the coding into Polish letters? (charset fix) (REGEX ISSUE)

Off-topic posts of interest to the "Everything" community.
Post Reply
Debugger
Posts: 256
Joined: Thu Jan 26, 2017 11:56 am

How to fix the coding into Polish letters? (charset fix) (REGEX ISSUE)

Post by Debugger » Thu Dec 06, 2018 2:55 pm

How to fix the coding into Polish letters? (charset fix). I need a regex to change all the characters throughout the text.

Mini-example:

Code: Select all

Na drodze mojej sstanąłes. Pojawiles się z nikąd. Żeśmy się

"Ä…"=>"ą"
"ć"=>"ć"
"Ä™"=>"ę"
"ó"=>"ó"
"Å‚"=>"ł"
"Å„"=>"ń"
"Å›"=>"ś"
"ż"=>"ż"
"ź"=>"ź"
"Å�"=>"Ł"
"Ó"=>"Ó"
"ü"=>"ü"
"ä"=>"ä"
"Å‘"=>"ö"
"Å�"=>"Ö"
''Å»''=>''Ż''


Or how to change *.html to a text file without losing the original coding?
Of course, the all HTML code must be removed, this is something like a page with a blog

void
Site Admin
Posts: 4265
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to fix the coding into Polish letters? (charset fix) (REGEX ISSUE)

Post by void » Fri Dec 07, 2018 7:03 am

Fixing encoding with regex would be difficult.

Did you want to use regex to find (and not fix) possible encoding issues?

Please try converting from ANSI to UTF-8:
https://superuser.com/questions/762473/ ... in-notepad

Debugger
Posts: 256
Joined: Thu Jan 26, 2017 11:56 am

Re: How to fix the coding into Polish letters? (charset fix) (REGEX ISSUE)

Post by Debugger » Fri Dec 07, 2018 11:32 am

I do not understand anything, it seems to be complicated, so I still do not know how to do it. I used to have a regular expression where it could be done in a few seconds, now I need a regex or step-by-step how to do it in EmEditor (because I use it and it's intuitive)

Post Reply