Community
    • Login

    Big encoding problem with UTF-8 / UTF-8-BOM / ANSI / ISO-8859-1

    Scheduled Pinned Locked Moved Help wanted · · · – – – · · ·
    4 Posts 2 Posters 4.5k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Sybell LirwacoS
      Sybell Lirwaco
      last edited by

      notepad++ v7.5.2 (32 bit)
      Win 7 Ultima

      I use notepad++ with WinSCP and have a huge problem with the encoding.

      HTML and PHP files are in HTML5 with <meta charset = “UTF-8”>.

      If I save the .htaccess in UFT-8-BOM, then HEAD will show on the command line for HTML and PHP files stored in UTF-8 or UTF-8-BOM: iso-8859-1.

      If I save the .htaccess in UTF-8, then HEAD shows no charset on the command line for HTML files and UTF-8 for PHP files. No matter if they are stored in UTF-8 or UTF-8-BOM.

      But if I close and reopen the .htaccess in UTF-8, it will open in ANSI !!!

      When I create a new file with WinSCP, it opens in ANSI. In the WinSCP settings the default encoding is UTF-8 and notepad++ is set to open a new document in UTF-8-BOM.

      Where is the problem here?

      1 Reply Last reply Reply Quote 0
      • dailD
        dail
        last edited by

        This is a known bug in v7.5.2 and v7.5.3 so I’d suggest you stick with v7.5.1 for the time being.

        Pull request to fix the issue is here https://github.com/notepad-plus-plus/notepad-plus-plus/pull/3992

        1 Reply Last reply Reply Quote 2
        • Sybell LirwacoS
          Sybell Lirwaco
          last edited by

          Many thanks for your very helpfull hint! It saves my work.

          1 Reply Last reply Reply Quote 0
          • Sybell LirwacoS
            Sybell Lirwaco
            last edited by

            Unfortunately the problem with 7.5.1 is not solved.
            npp.7.5.1.bin.minimalist.x64 opens files stored in UTF-8 containing characters from the Mathematical Alphanumeric Symbols Unicode block as ANSI.

            What I have just seen is that the term “ANSI” means different. Once the term ANSI means Windows-1252 (Menubar -> Encoding) and once means only ISO-8859-1 (Preferences -> New Document -> Encoding). Or there are problems because files stored in Windows-1252 are opened as ANSI, even if “Apply to opened ANSI file” is set.

            1 Reply Last reply Reply Quote 0
            • First post
              Last post
            The Community of users of the Notepad++ text editor.
            Powered by NodeBB | Contributors