Community
    • Login

    Can't paste "weird" Asian characters properly. Which should be the default document encoding?

    Scheduled Pinned Locked Moved Help wanted · · · – – – · · ·
    3 Posts 2 Posters 967 Views 1 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Ohayo GosaimasO Offline
      Ohayo Gosaimas
      last edited by Ohayo Gosaimas

      Hello,

      I occasionally have to deal with “weird” Asian characters, like the ones used in kaomojis.

      Most of them load fine, but some of them… well, don’t.

      For instance this one, if I copy-paste it into a notepad++ file, instead of

      ¯\_ʘ‿ʘ_/¯
      

      the central character will have become a question mark in a rectangle.

      And it’s a display problem: I can copy-paste it from notepad++ into internet pages (like right here) or into a simple Win-R’s “run” dialogue, the character will be displayed properly again. The original character is preserved, not destroyed.

      Context, if necessary: Windows 10, with two locales on my computer, English and French, no Asian languages. Still, copy-pasting Japanese and Chinese works flawlessly, when needed.

      I’m no alien to weird encoding manipulations (recently, I had an .sql file that required to Convert to ANSI and next Encode in UTF-8, to fix a weird Latin1 database import, it’s like magic at this point, it worked, but no idea why), so I suspect something with encoding could be the solution.

      Which leads me to humbly ask: please, would you know if a notepad++ file “should” definitely have a particular encoding rather than another?

      I mean, I’d have said UTF-8 without BOM, no questions asked, but I can’t paste the emoji I mentioned in there without having it transformed, so maybe it’s got to be something else.

      Or - odd possibility, but who knows - maybe it’s officially that there are characters that simply cannot be properly displayed in Notepad++?

      Thanks if you have a possible explanation in mind, and have a good confinement day! :D

      Alan KilbornA 1 Reply Last reply Reply Quote 0
      • Alan KilbornA Offline
        Alan Kilborn @Ohayo Gosaimas
        last edited by Alan Kilborn

        @Ohayo-Gosaimas

        Which should be the default document encoding?

        This is something you have to answer for yourself.

        maybe it’s officially that there are characters that simply cannot be properly displayed in Notepad++?

        It’s all about the font.

        You probably want to have a read here: https://community.notepad-plus-plus.org/topic/16497/missing-unicode-characters-in-notepad

        1 Reply Last reply Reply Quote 2
        • Ohayo GosaimasO Offline
          Ohayo Gosaimas
          last edited by

          Oh, damn it, I didn’t even suspect it might simply be a font problem.

          Maybe because of my messy latin1 database import problem, in which it wasn’t a font issue. I feel silly now.

          Well, I’m off to experimenting with other fonts now.

          But… a pity, really, there’s no better than Inconsolata, in my biased eyes.

          Thank you Alan!

          1 Reply Last reply Reply Quote 3

          Hello! It looks like you're interested in this conversation, but you don't have an account yet.

          Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.

          With your input, this post could be even better 💗

          Register Login
          • First post
            Last post
          The Community of users of the Notepad++ text editor.
          Powered by NodeBB | Contributors