I can’t see your reported issue concerning the broken characters
Regarding the extra char, this happens when you chose Encode in Ansi, which basically is a view of the current bytes and because those chars are encoded in two
bytes in utf8 you see the additional first byte. If you choose convert to ansi, from a clean state, it shows it correctly.