• Login
Community
  • Login

[New Plugin] MultiReplace

Scheduled Pinned Locked Moved Notepad++ & Plugin Development
67 Posts 11 Posters 13.7k Views
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • V
    Vitalii Dovgan @rdipardo
    last edited by Jun 14, 2023, 7:08 PM

    @rdipardo said in [New Plugin] MultiReplace:

    Except for the Double-byte Character Sets , which are (still!) the typical OEM encoding on PCs in East Asian countries. Scintilla has a dedicated API for those.

    OMG, I’ve completely forgotten about those!
    So, looks like the most proper way is to invoke MultiByteToWideChar first and then to deal with Unicode strings (that consist of WCHAR characters) since they are natively supported by modern Windows. Actually, this is exactly what I’ve been doing in my code, mostly because WCHAR is native for Windows NT family.
    Going further, this can be enhanced to properly handle Unicode Surrogate Pairs as well. (And these may not be handled correctly in my code because I did not add any specific processing for Surrogate Pairs. Actually, I am not sure whether the standard functions such as lstrlenW take Surrogate Pairs into account or not).

    1 Reply Last reply Reply Quote 3
    • W
      wonkawilly @Alan Kilborn
      last edited by Jun 14, 2023, 11:35 PM

      @Alan-Kilborn That one is an old ver. I’ve updated it…

      A 1 Reply Last reply Jun 15, 2023, 12:13 AM Reply Quote 0
      • A
        Alan Kilborn @wonkawilly
        last edited by Jun 15, 2023, 12:13 AM

        @wonkawilly said in [New Plugin] MultiReplace:

        I’ve updated it…

        I’ll hand it to you; you’re tough. Even getting banned over it doesn’t dissuade you. :-)

        W 2 Replies Last reply Jun 15, 2023, 8:36 AM Reply Quote 1
        • W
          wonkawilly @Alan Kilborn
          last edited by wonkawilly Jun 15, 2023, 9:58 AM Jun 15, 2023, 8:36 AM

          This post is deleted!
          1 Reply Last reply Reply Quote 0
          • T
            Thomas Knoefel
            last edited by Thomas Knoefel Jun 15, 2023, 10:47 AM Jun 15, 2023, 10:01 AM

            @rdipardo said in [New Plugin] MultiReplace:

            because Scintilla maps the ANSI code page identifiers to the same values as the Win32 API.

            Does it mean that UTF8 would directly match with ANSI in scintilla? I’m facing the Problem that normal characters are matching in ANSI but special Letters like Ä or Ö don’t. Anybody an idea how to convert a widestr into UTF8 for SCI_SEARCHINTARGET to find these Characters in ANSI? Unsurprisingly the letter Ä matches with Ä in ANSI if i convart Ä into ANSI. … i think i did it but pretty challenging topic.

            R 1 Reply Last reply Jun 15, 2023, 11:52 AM Reply Quote 0
            • W
              wonkawilly @Alan Kilborn
              last edited by Jun 15, 2023, 10:16 AM

              Off Topic:
              @Alan-Kilborn said in [New Plugin] MultiReplace:

              you’re tough. Even getting banned over it doesn’t dissuade you. :-)

              Big changes make always involve taking big risks. And I understand that traditions sometimes are difficult to overcome. It is perfectly normal, at least into human logic. But I also know that traditions will be overcame when people are more aware and ready to make the jump . And this is also part of life and evolution. After all evolution is the meaning of life, and without evolution life cold be less meaningful.
              This is a general rule that also applies to the case.

              1 Reply Last reply Reply Quote 0
              • R
                rdipardo @Thomas Knoefel
                last edited by Jun 15, 2023, 11:52 AM

                @Thomas-Knoefel said in [New Plugin] MultiReplace:

                @rdipardo said in [New Plugin] MultiReplace:

                because Scintilla maps the ANSI code page identifiers to the same values as the Win32 API.

                Does it mean that UTF8 would directly match with ANSI in scintilla?

                Based on how SCI_GETCODEPAGE works in practice, the alternative encoding to Unicode should be thought of as the “system default” rather than “ANSI”.

                For most of N++'s history, the “ANSI” code page was indeed single-byte (or, in the case of the legacy CJK encodings, double-byte). But the addition of a UTF-8 OEM code page in Windows version 1903 makes “ANSI” a less useful identifier, even a potentially deceptive one. The system default is no longer directly opposed to Unicode as it once was.

                So, yes, there may be times when “UTF8 would directly match with ANSI,” but only if 65001 is the value of the ACP key in the system’s registry. Check on this first:

                reg query HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\Nls\CodePage /s /f "CP"
                

                Anybody an idea how to convert a widestr into UTF8 for SCI_SEARCHINTARGET to find these Characters in ANSI?

                When you see const char * in the prototype of a Scintilla API (as you will for SCI_SEARCHINTARGET ), it means the expected input is a byte string (i.e. “ANSI”). The conversion you want is probably from wchar_t* to char*. A debugger can show you what the encoded text looks like after conversion.

                1 Reply Last reply Reply Quote 2
                • T
                  Thomas Knoefel @rdipardo
                  last edited by Thomas Knoefel Jun 15, 2023, 4:23 PM Jun 15, 2023, 4:18 PM

                  Thanks, I was still a little bit too much focused on UTF8 with the preperation of ANSI. But this part is working now all directional.

                  @rdipardo said in [New Plugin] MultiReplace:

                  Except for the Double-byte Character Sets, which are (still!) the typical OEM encoding on PCs in East Asian countries. Scintilla has a dedicated API for those.

                  I’m trying to test DBCS on my non-Asian Windows system. Is this even possible somehow? When I test all encodings in Notepad++, SCI_GETCODEPAGE returns 0 for ANSI, and all the others give me 65001. Is there no chance of obtaining one of these encodings?
                  codePage == 932 || codePage == 936 || codePage == 949 || codePage == 950 || codePage == 136
                  I tried the BIG5 and Shift_JIS encodings, both of which are DBCS, but I obtained the same result. Even saving and reopening makes no difference. I have the feeling that i’m looking in the wrong place.

                  V M 2 Replies Last reply Jun 15, 2023, 5:28 PM Reply Quote 0
                  • V
                    Vitalii Dovgan @Thomas Knoefel
                    last edited by Jun 15, 2023, 5:28 PM

                    @Thomas-Knoefel said in [New Plugin] MultiReplace:

                    I’m trying to test DBCS on my non-Asian Windows system. Is this even possible somehow?

                    Yes, go to the “Language & region” system settings, and by clicking the “Administrative language settings” a “Region” dialog is shown. This “Region” dialog has “Administrative” tab where there is a button “Change system locale” for non-Unicode programs.
                    ( This is for Windows 11, it was much faster to find in Windows 7 :) )

                    V 1 Reply Last reply Jun 15, 2023, 5:37 PM Reply Quote 4
                    • V
                      Vitalii Dovgan @Vitalii Dovgan
                      last edited by Jun 15, 2023, 5:37 PM

                      And regarding your other question about conversion between a custom multi-byte encoding (either ANSI or DBCS) and UTF-8, this actually is achieved by double conversion:

                      1. First, call MultiByteToWideChar to convert the input multi-byte string (e.g. ANSI/DBCS) to WCHAR string
                      2. Second, call WideCharToMultiByte to convert the WCHAR string from the step 1 into a resulting multi-byte string (e.g. UTF-8).

                      To convert from UTF-8 to ANSI/DBCS, just specify CP_UTF8 in the step 1 and then the desired ANSI/DBCS codepage in the step 2.

                      1 Reply Last reply Reply Quote 4
                      • M
                        Michael Vincent @Thomas Knoefel
                        last edited by Jun 16, 2023, 11:04 AM

                        @Thomas-Knoefel

                        I opened a few issues and added some pull requests to your repo.

                        If you are willing to accept pull requests, I have a few more to add once those are merged.

                        Cheers.

                        T 2 Replies Last reply Jun 16, 2023, 11:35 AM Reply Quote 3
                        • T
                          Thomas Knoefel @Michael Vincent
                          last edited by Thomas Knoefel Jun 16, 2023, 12:08 PM Jun 16, 2023, 11:35 AM

                          @Michael-Vincent Thanks, I’ve seen it, and I’m going to commit them. However, the latest updates for codepage handling have not been committed yet. I still need to set up a VMware for Chinese Language settings in order to test DBCS. Once that is finished, I’ll upload the final updates.

                          T 1 Reply Last reply Jun 16, 2023, 8:52 PM Reply Quote 4
                          • T
                            Thomas Knoefel @Thomas Knoefel
                            last edited by Jun 16, 2023, 8:52 PM

                            @Thomas-Knoefel These are the facts i fgured out. In Notepad++, when you ask about SCI_GETCODEPAGE, it’s always 0 for ANSI and 65001 for UTF8 you won’t encounter any other codepage. Asian codepages, like DBCS, only matter when you’re reading or writing files. So, these codepages won’t mess things up unless you’re working with files saved in these codepages. As for the Save and Load File feature of the plugin, which is designed for an internal store, it will always save in UTF8 format when handling CSV files.
                            I think this fact will simplify the handling of codepages.

                            1 Reply Last reply Reply Quote 0
                            • T
                              Thomas Knoefel @Michael Vincent
                              last edited by Jun 16, 2023, 9:13 PM

                              @Michael-Vincent said in [New Plugin] MultiReplace:

                              I have a few more to add once those are merged.

                              Thank you for your input! All requests are welcome. I can just learn from it.

                              1 Reply Last reply Reply Quote 1
                              • T
                                Thomas Knoefel
                                last edited by Thomas Knoefel Jun 18, 2023, 12:38 PM Jun 18, 2023, 12:37 PM

                                I have finished RC-2 version with fixed ANSI support and 32 Bit code compatibility. You can find it on Github .

                                V 1 Reply Last reply Jun 18, 2023, 7:30 PM Reply Quote 2
                                • V
                                  Vitalii Dovgan @Thomas Knoefel
                                  last edited by Jun 18, 2023, 7:30 PM

                                  Thank you! I like it!
                                  What probably may add more abilities to the plugin is: 1) to have a button that swaps the text between the Find What and Replace With fields; 2) to have checkboxes in the list to specify which Find-Replace pairs to activate and which to deactivate.

                                  T 2 Replies Last reply Jun 19, 2023, 8:28 PM Reply Quote 2
                                  • T
                                    Thomas Knoefel @Vitalii Dovgan
                                    last edited by Jun 19, 2023, 8:28 PM

                                    @Vitalii-Dovgan I will probably add both options before final release. Thanks for your input!

                                    1 Reply Last reply Reply Quote 2
                                    • T
                                      Thomas Knoefel @Vitalii Dovgan
                                      last edited by Thomas Knoefel Jun 21, 2023, 10:30 PM Jun 21, 2023, 10:04 PM

                                      @Vitalii-Dovgan said in [New Plugin] MultiReplace:

                                      What probably may add more abilities to the plugin is: 1) to have a button that swaps the text between the Find What and Replace With fields; 2) to have checkboxes in the list to specify which Find-Replace pairs to activate and which to deactivate.

                                      RC-3 has been finished!

                                      This update includes the following new features:

                                      • Set individual or multiple list entries active or inactive by clicking the select button.

                                      • The Space key enables you to select or unselect multiple highlighted entries at once. You can also select or unselect all by clicking at the Header.

                                      • The selection status will be saved in CSV

                                      • Swap Button for switching Find and Replace text.

                                      The Readme and Screenshot has been updated. Feel free for testing it! You can find it on Github .

                                      V 1 Reply Last reply Jun 22, 2023, 5:21 PM Reply Quote 3
                                      • V
                                        Vitalii Dovgan @Thomas Knoefel
                                        last edited by Jun 22, 2023, 5:21 PM

                                        @Thomas-Knoefel
                                        Thank you!
                                        For some reason I had an impression that the Find-Replace list, as well as the histories of the “Find what” and “Replace with” are automatically saved when Notepad++ exits… Could this be added? Probably the plugin may store this information under the PLUGINSCONFIGDIR folder as e.g. “NppMultiReplace.csv” and “NppMultiReplace.ini”
                                        Well, I realize this also implies some settings where one could enable/disable saving of the Find-Replace list and the “Find what” and “Replace with” histories :)

                                        V T 3 Replies Last reply Jun 22, 2023, 5:30 PM Reply Quote 2
                                        • V
                                          Vitalii Dovgan @Vitalii Dovgan
                                          last edited by Vitalii Dovgan Jun 22, 2023, 5:35 PM Jun 22, 2023, 5:30 PM

                                          As a side note, I noticed that the LocationNavigate plugin crashes on huge number of replacements, either when a lot of replacements have been done or when the replacement are undoed. (For example, I reproduced it while replacing “CMD” with “CMD” inside of “NppExecEngine.cpp” - 1149 occurrences).
                                          It seems to be a problem of the LocationNavigate plugin itself, but it was worth mentioning anyway.
                                          As maybe an idea for the future version of the MultiReplace plugin, the replacements performance could be dramatically increased if the whole text was read from Scintilla into a memory buffer first, then all the replacements were done with the text in the memory buffer, and finally the resulting text was put into Scintilla. That would probably fix the problem with the LocationNavigate plugin as well.

                                          T 1 Reply Last reply Jun 23, 2023, 9:50 AM Reply Quote 3
                                          19 out of 67
                                          • First post
                                            19/67
                                            Last post
                                          The Community of users of the Notepad++ text editor.
                                          Powered by NodeBB | Contributors