Notepad++ How to find in page with UTF-8 instead of ANSI ?
Robin Cruise last edited by Robin Cruise
hello, I have a lot of words like
stiintific(with and without diacritics/accent marks). How can I search so as to find both versions?
I can do this in all PDF and MS World files, but in notepad++ I cannot. So, is there a way to do this kind of find and also the replace just with UTF-8 ?
guy038 last edited by guy038
Hello, @robin-cruise and All,
You can achieve this kind of goal with equivalent class structures. Their global syntax is
For instance, the regex
[[=A=]]would match any of these
82Unicode chars :
AaªÀÁÂÃÄÅàáâãäåĀāĂăĄąǍǎǞǟǠǡǺǻȀȁȂȃȦȧȺɐɑɒᴀᴬᵃᵄᶏᶐᶛḀḁẚẠạẢảẤấẦầẨẩẪẫẬậẮắẰằẲẳẴẵẶặₐÅ⒜ⒶⓐⱥⱭⱯⱰ, which have a relation, in some way, with the first letter of the Latin alphabet !
Actually, the regex should be more considered as the
[=<Single_Letter>=]syntax, embedded in a usual character class
[•••••]. For instance, the regex
(?-i)[012[=A=]@b-y[=z=]|]matches all the following characters, sorted by ascending Unicode code-point :
UNICODEchars, with code over
So, practically, to match, either, your strings
stiintific, use the regex :
Robin Cruise last edited by
yes, nice answer. But very hard , because I need to change almost all words from every sentence:)