• Login
Community
  • Login

bug in encoding (greek) in 7.6.3

Scheduled Pinned Locked Moved General Discussion
9 Posts 2 Posters 2.0k Views
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • P
    patrickdrd
    last edited by Jan 28, 2019, 12:13 PM

    I tried to save a xml which includes greek characters,
    it saves it with unrecognized characters,
    this issue doesn’t exist in 7.6.2

    M 1 Reply Last reply Jan 28, 2019, 12:29 PM Reply Quote 0
    • M
      Meta Chuh moderator @patrickdrd
      last edited by Meta Chuh Jan 28, 2019, 12:32 PM Jan 28, 2019, 12:29 PM

      hi @patrickdrd

      unfortunately the current auto detection for character encodings is still broken in notepad++.
      maybe that’s also the reason why your xml got saved with wrong characters, if you have it enabled.

      for example if your xml was utf-8 in it’s original form and notepad++ detected it as another (wrong) encoding (example: encoding > character sets > vietnamese > windows-1258, like it happens many times for french and spanish)
      if you now edit such a file and save it, the characters can get messed up.

      currently, if you have it enabled, it’s recommended to disable “autodetect character encoding” in settings > preferences > misc, like seen at the screenshot below.
      then restart notepad++ and retry with your xml file (i hope you still have an old backup of it, with correct greek characters).

      settings - auto detect character encoding

      hope it helps you a bit.

      1 Reply Last reply Reply Quote 0
      • P
        patrickdrd
        last edited by Jan 28, 2019, 12:40 PM

        I’ve got that one disabled a long time ago (it was suggested here to me)

        as I said and I’ll repeat myself, the bug is definitely in 7.6.3,
        I replaced the executable and it worked fine, just fine

        M 1 Reply Last reply Jan 28, 2019, 12:50 PM Reply Quote 2
        • M
          Meta Chuh moderator @patrickdrd
          last edited by Jan 28, 2019, 12:50 PM

          @patrickdrd

          I’ve got that one disabled a long time ago (it was suggested here to me)

          yes, my apology, i forgot.

          as I said and I’ll repeat myself, the bug is definitely in 7.6.3,
          I replaced the executable and it worked fine, just fine

          i will test that too, both 7.6.2 and 7.6.3 exe, with my files and hope it did not get worse in 7.6.3, or at least that we find a workaround.
          good idea to use the old exe if it works for you.

          1 Reply Last reply Reply Quote 1
          • P
            patrickdrd
            last edited by Jan 28, 2019, 1:03 PM

            strange, I’m trying to reproduce and it works now, something happened though and broke my greek characters when I copied the context between the browser and npp

            1 Reply Last reply Reply Quote 0
            • P
              patrickdrd
              last edited by Jan 28, 2019, 1:52 PM

              encoding is a mess anyway,
              I tried to view that same file on my mobile and
              while my default encoding is greek-iso,
              I’ve had to switch to Unicode in order for these (greek) characters to be recognized,
              it’s very awkward having to switch from greek-iso to utf-8 and vice versa to read a file

              1 Reply Last reply Reply Quote 2
              • P
                patrickdrd
                last edited by Jan 29, 2019, 6:49 AM

                and that android app has auto-detection too and it doesn’t work properly either

                M 1 Reply Last reply Jan 29, 2019, 8:54 AM Reply Quote 0
                • M
                  Meta Chuh moderator @patrickdrd
                  last edited by Meta Chuh Jan 29, 2019, 8:54 AM Jan 29, 2019, 8:54 AM

                  @patrickdrd

                  is it an xml file you can share, or does it contain private data ?
                  can you find out which greek letters will trigger this, by making an empty xml with just some greek words ?

                  in french it is triggered, for example, by a single word Mosaïque because of the ï.
                  @guy038 also found out, that if you combine the two words mosaïque étaitin a new file it will work correctly, but also était alone in a file will not work.

                  if it’s easy to reproduce, you could file an issue at github: https://github.com/notepad-plus-plus/notepad-plus-plus/issues .
                  (and then hope it gets looked at by the developers, due to over 2400 open issues at the moment)

                  i for myself use utf-8 file encoding only, and convert all files to utf-8 if they are not.


                  something that’s quick to try out if it works for you:
                  a user told us, that he converts all problematic documents with encoding problems to utf-8-bom (utf-8 with a byte order mark header), because the bom header will explicitly state which format this file has, and utf-8-bom seems to be compatible with all his applications and web services.

                  it would be interesting if your encoding issue gets better using bom, and what happens on android with a bom file.

                  1 Reply Last reply Reply Quote 2
                  • P
                    patrickdrd
                    last edited by Jan 29, 2019, 9:59 AM

                    thanks for the suggestions,
                    I’m having similar problems on android with an app I use to log my sms messages,
                    some text is unicode and some is not I guess,
                    because if I select unicode some part is recognized, but not all of it,
                    the same if I reverse it, I contacted the app developer and he said that it’s not one of his priorities (to fix it) now, so there is an issue in general

                    as for the xml file, the problem started from the fact that I couldn’t send the file as an email from my work’s desktop to my mobile (it was rejected by exchange server - only mail I have access to - I’ve still haven’t figured out why… anyway…), so I opened it, I copied the content, I pasted it on github and then I opened my mobile browser and I got the text from github, somewhere on the whole process greek characters broke

                    I opened the file from the disk now and it is utf-8-bom, but it didn’t matter I guess

                    1 Reply Last reply Reply Quote 2
                    2 out of 9
                    • First post
                      2/9
                      Last post
                    The Community of users of the Notepad++ text editor.
                    Powered by NodeBB | Contributors