• Login
Community
  • Login

Big encoding problem with UTF-8 / UTF-8-BOM / ANSI / ISO-8859-1

Scheduled Pinned Locked Moved Help wanted · · · – – – · · ·
4 Posts 2 Posters 4.6k Views
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • S
    Sybell Lirwaco
    last edited by Dec 16, 2017, 1:12 PM

    notepad++ v7.5.2 (32 bit)
    Win 7 Ultima

    I use notepad++ with WinSCP and have a huge problem with the encoding.

    HTML and PHP files are in HTML5 with <meta charset = “UTF-8”>.

    If I save the .htaccess in UFT-8-BOM, then HEAD will show on the command line for HTML and PHP files stored in UTF-8 or UTF-8-BOM: iso-8859-1.

    If I save the .htaccess in UTF-8, then HEAD shows no charset on the command line for HTML files and UTF-8 for PHP files. No matter if they are stored in UTF-8 or UTF-8-BOM.

    But if I close and reopen the .htaccess in UTF-8, it will open in ANSI !!!

    When I create a new file with WinSCP, it opens in ANSI. In the WinSCP settings the default encoding is UTF-8 and notepad++ is set to open a new document in UTF-8-BOM.

    Where is the problem here?

    1 Reply Last reply Reply Quote 0
    • D
      dail
      last edited by Dec 16, 2017, 2:12 PM

      This is a known bug in v7.5.2 and v7.5.3 so I’d suggest you stick with v7.5.1 for the time being.

      Pull request to fix the issue is here https://github.com/notepad-plus-plus/notepad-plus-plus/pull/3992

      1 Reply Last reply Reply Quote 2
      • S
        Sybell Lirwaco
        last edited by Dec 18, 2017, 7:48 AM

        Many thanks for your very helpfull hint! It saves my work.

        1 Reply Last reply Reply Quote 0
        • S
          Sybell Lirwaco
          last edited by Dec 27, 2017, 1:40 PM

          Unfortunately the problem with 7.5.1 is not solved.
          npp.7.5.1.bin.minimalist.x64 opens files stored in UTF-8 containing characters from the Mathematical Alphanumeric Symbols Unicode block as ANSI.

          What I have just seen is that the term “ANSI” means different. Once the term ANSI means Windows-1252 (Menubar -> Encoding) and once means only ISO-8859-1 (Preferences -> New Document -> Encoding). Or there are problems because files stored in Windows-1252 are opened as ANSI, even if “Apply to opened ANSI file” is set.

          1 Reply Last reply Reply Quote 0
          • First post
            Last post
          The Community of users of the Notepad++ text editor.
          Powered by NodeBB | Contributors