Community

    • Login
    • Search
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Search

    Notepad++ doesn't find the accented characters with the "Find in files" function

    Help wanted · · · – – – · · ·
    4
    17
    1016
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • guy038
      guy038 last edited by

      Hello, @atelier-traduction and All,

      I’m French too, but I’ve never seen that behavior, even with the last 7.8.7 N++ version !?

      Of course, my default ANSI code page is Windows-1252 where most of the accentuated characters are in range [\xC0-\xFF]

      And the Find in Files search of, for instance, the regex [\xC0-\xFF]+ does get any range of consecutive accentuated char(s) !

      Best Regards,

      guy038

      Bonjour, @atelier-procduction et à Tous :

      Je suis Français également et je n’ai jamais remarqué ce comportement, même avec la dernière version 7.8.7 !?

      Bien entendu, ma page de code ANSI, par défaut, est Windows-1252, où la plupart des caractères accentués se trouvent dans la zone [\xC0-\xFF]

      Et, par exemple, une recherche, avec Find in Files, de l’expression régulière [\xC0-\xFF]+ trouvent bien tout zone de caractère(s) accentué(s) consécutif(s) !

      Bien cordialement,

      guy038

      1 Reply Last reply Reply Quote 0
      • Atelier Traduction
        Atelier Traduction last edited by Atelier Traduction

        Well, I forgot to say I’m using Windows 10 (the same issue is on Windows 7), and we are three users to get the same issue with the default settings. So I think this is a general issue. We can see the exact problem here:

        notepad_isssue.png

        Alan Kilborn 1 Reply Last reply Reply Quote 0
        • Alan Kilborn
          Alan Kilborn @Atelier Traduction last edited by

          @Atelier-Traduction said in Notepad++ doesn't find the accented characters with the "Find in files" function:

          We can see the exact problem here

          How does that show the problem?

          I think there are going to be people that will really put in an extreme amount of effort to help you, but you’re going to have to give them more specifics to go on.

          Oftentimes people experience an issue and feel that “everyone must see the same thing, it is so obvious” and that means (because it is so obvious) no details are needed. In truth, perhaps no one is seeing it besides the person (or 3) with the problem.

          1 Reply Last reply Reply Quote 0
          • Atelier Traduction
            Atelier Traduction last edited by

            Ok, I’m on the 32-bit version of Notepad++ (64-bit version has the same problem) on Windows 10 64 bit (Windows 7 64-bit version has the same problem) and I’m using the default settings. And yeah, 3/3 users in the same case is pretty curious in my opinion, but maybe not for you, I suppose. I mentionned the last working version too (7.6.6). Everything is in its default state in the folders too with the default plugins.

            Here’s how it shows the issue: If I search for “rétablissement” in “Find in Files” with any filter and the default settings once again, no “rétablissement” is recognized in any file not opened by Notepad++ because the ‘é’ character itself is not recognized. The image above shows a □ character instead of ‘é’ if I just search for the term “em”, it’s the proof the versions above 7.6.6 don’t recognize the accented characters, or the ASCII extended ones if you prefer.

            And well, I found another very interesting fact. The bug occurs only if Notepad++ searches in not opened files which are encoded into ANSI. If it searches into UTF-8 (without BOM) not opened files with the proper UTF-8 encoding for the accented characters (so real UTF-8 and not ANSI), all the accented characters are recognized without any problem.

            So, for me, the not opened files are recognized with the UTF-8 (without BOM) format and never in ANSI if there is not a BOM, that’s why Notepad++ can’t understand the encoding of these characters.

            Of course, if you need more details, I will be glad to give you.

            Else, I’m here to try to get in touch with the developper himself, because he doesn’t want to be bothered by mail for bug reports and there is not a appropriate category in here to send bug reports of the tool itself.

            Alan Kilborn 2 Replies Last reply Reply Quote 1
            • Alan Kilborn
              Alan Kilborn @Atelier Traduction last edited by

              @Atelier-Traduction

              I don’t know about other readers, but the additional exposition definitely helped me understand more about it. Thank you.

              I think @guy038 might be in the best position to help you explore this. Hopefully he continues to contribute to this thread.

              Meanwhile, of interest might be THIS. Of course, you could be the author of that; I can’t tell from the usernames, but they are certainly dissimilar (means nothing, though).

              I’m here to try to get in touch with the developer himself, because he doesn’t want to be bothered by mail for bug reports

              Good luck with that. Many have tried. Many have failed. :-)

              there is not a appropriate category in here to send bug reports of the tool itself.

              Correct. It wouldn’t be “here”. THIS describes the actual bug report process.

              1 Reply Last reply Reply Quote 2
              • Alan Kilborn
                Alan Kilborn @Atelier Traduction last edited by

                The last working version was the 7.6.6 one.

                I see a “big” version bump in Scintilla between N++ 7.6.6 (uses Scintilla 3.5.6) and N++ 7.7 (uses Scintilla 4.1.4). Maybe there is something in that which drives this behavior change? Of course, maybe not as well…

                Atelier Traduction 1 Reply Last reply Reply Quote 3
                • Atelier Traduction
                  Atelier Traduction last edited by

                  Ok, thank you ^^
                  At least, I can try to write a bug report on Github with all the details even if others tried before, you never know…

                  Alan Kilborn 1 Reply Last reply Reply Quote 0
                  • Alan Kilborn
                    Alan Kilborn @Atelier Traduction last edited by

                    @Atelier-Traduction said in Notepad++ doesn't find the accented characters with the "Find in files" function:

                    I can try to write a bug report on Github

                    I would certainly say you should ADD to the existing one I linked earlier…did you see that link?

                    1 Reply Last reply Reply Quote 0
                    • Atelier Traduction
                      Atelier Traduction @Alan Kilborn last edited by

                      Yeah. If you think this is better, ok, I just will do that.

                      1 Reply Last reply Reply Quote 0
                      • Atelier Traduction
                        Atelier Traduction last edited by

                        https://github.com/notepad-plus-plus/notepad-plus-plus/issues/7668
                        Done, let’s pray now.

                        Alan Kilborn 1 Reply Last reply Reply Quote 2
                        • Alan Kilborn
                          Alan Kilborn @Atelier Traduction last edited by Alan Kilborn

                          @Atelier-Traduction

                          THIS seems to give some weight to my “Scintilla update” theory (or guess, actually).

                          1 Reply Last reply Reply Quote 2
                          • guy038
                            guy038 last edited by guy038

                            Hi, @atelier-traduction, @alan-kilborn and All,

                            Aaaaah, yes :-(( @atelier-traduction, you’re quite right about it. I’ve never noticed that bug, yet ! Indeed, as I replied, in that forum, in English, all my text files are, mainly, written in English, so without any accentuated characters !

                            Compared to you, with my local N++ configuration, the accentuated characters are even not displayed at all, in the Search result panel !


                            This only happens under three conditions :

                            • A search has been successful and the file is currently seen in the Search result panel

                            • The file is presently not opened in a Notepad++ session, either in the main or in the secondary session

                            • The file is ANSI encoded ( All the other Unicode encodings are OK ! )

                            Note that, in addition, if the search, itself, contained accentuated characters, the Find result panel does not show the ANSI file !


                            I also confirm, as you said, that this bug occurs since Notepad++ V7.7

                            I get no time, presently, but I’ll open an issue on GitHub very soon !

                            BR

                            guy038

                            1 Reply Last reply Reply Quote 1
                            • Atelier Traduction
                              Atelier Traduction last edited by Atelier Traduction

                              I just would like to add that the “Search result” doesn’t find anything if accentuated characters are searched for.

                              https://community.notepad-plus-plus.org/assets/uploads/files/1592392282525-notepad_isssue.png

                              You can see these squares because I put the term “em” to find. If I put “rétablissement”, the “Search result” panel doesn’t find anything, so I don’t agree with the “the file is currently seen in the Search result panel” part. Or I misinterpreted your phrasing.

                              1 Reply Last reply Reply Quote 0
                              • guy038
                                guy038 last edited by guy038

                                Hi, All,

                                Done ! See my comment on GitHub :

                                https://github.com/notepad-plus-plus/notepad-plus-plus/issues/7668#issuecomment-645602955

                                After further tests, problem occurs with any character between \x{0080} and \x{00FF}

                                BR

                                guy038

                                1 Reply Last reply Reply Quote 0
                                • donho
                                  donho last edited by

                                  Fixed in https://github.com/notepad-plus-plus/notepad-plus-plus/commit/c5a0ed7c1aaac56dc96deabba8dd5e7cba261b2d
                                  It will come with next release.

                                  1 Reply Last reply Reply Quote 5
                                  • Atelier Traduction
                                    Atelier Traduction last edited by

                                    Thank you @donho, @guy038 and @Alan-Kilborn ;)

                                    1 Reply Last reply Reply Quote 2
                                    • First post
                                      Last post
                                    Copyright © 2014 NodeBB Forums | Contributors