Search++: A work in progress

guy038

Hi, @coises and All,

Ah…, many thanks, in avance, for resolving bugs and trying to take into account my suggestions !

Regarding the Search++.pdb file, I can, of course, try to re-download the x64 archive. But can I, without any problem, simply delete this present file in my Search++ folder ?

BR

guy038

Coises

@guy038 said in Search++: A work in progress:

Regarding the Search++.pdb file, I can, of course, try to re-download the x64 archive. But can I, without any problem, simply delete this present file in the Search++ folder ?

Yes.

guy038

Hello, @coises and All,

Sorry to disturb you again, but would it be possible to, either :

Increase the width of the caret

OR

Allow, like in native N++, the modification of its width, in your Settings dialog

Personally, I use the value 3 in Caret Settings in Notepad++ and yours seems really tiny. So, it rather difficult to notice the caret location at first sight, in the Find dialog !

I must admit that I probably get used to this maximum value for the caret and that, now, any smaller size bothers me a bit !

Best Regards,

guy038

Coises

@guy038 said in Search++: A work in progress:

Sorry to disturb you again, but would it be possible to, either :

Increase the width of the caret

OR

Allow, like in native N++, the modification of its width, in your Settings dialog

I will add that to the list of things I copy from the the active document window when I initialize my Scintilla controls, and look for other similar settings (like the blink rate) that I missed. Not everything is simple to copy, but those are.

Thank you for pointing that out.

guy038

Hello, @coises and All,

I finally succeeded to get an almost exhaustive list of all the Unicode properties recognized by the ICU regex engine of the Search++ plugin !

As its file size is adbout 213 Kb, I’m about to share this simple text file, named ICU.txt, on my Drive Account

https://drive.google.com/file/d/15n8ttdX0hNxIazlRkToZn2XsINus5IOW/view?usp=sharing

Of course, this is my first attempt, which probably will need some modifications !

Best Regards,

guy038

Coises

@guy038 said in Search++: A work in progress:

I finally succeeded to get an almost exhaustive list of all the Unicode properties recognized by the ICU regex engine of the Search++ plugin !

https://drive.google.com/file/d/1litn6Ggjk-nRc8UOuxYS-5iO10_J-Z_2/view?usp=sharing

Edit to match updated quoted post: the link is now

https://drive.google.com/file/d/15n8ttdX0hNxIazlRkToZn2XsINus5IOW/view?usp=sharing

That clearly entailed a lot of work!

When I get closer to a “real” release, I will ask your permission to include some or all of that information as an appendix in the documentation for Search++.

guy038

Hi, @coises,

That’s really kind of you to ask for my permission.But, considering all the times you’ve listened to me, the least I can do is, of course, give you my full permission to use this file ;-))

Just note that I noticed, at the very end of the ICU.txt file, a small section that I used to verify if the \p{...} syntaxes were written correctly

As this part should not be part of the file, I modified the current file which, of course, changed the sharing link ! And I’ve just updated this link in my previous post !

Best Regards,

guy038

guy038

Hello, @coises and All,

Here is the second version of my list of all the Unicode properties, recognized by the ICU regex engine of the Search++ plugin !

I added 5 Unicode properties and some sections ( unsupported features, deprecated properties,… ) and I corrected a lot of mistakes !

In addition, although the ICU syntax is very flexible, I tried to adopt the same scheme throughout all sections of this file !

You can download this text file, named ICU.txt, from my Drive Account, at this location :

https://drive.google.com/file/d/1PAY5C2JO0q4-j8kfGKYs3VKarKn_xVa5/view?usp=sharing

Of course, I also deleted the previous version !

Now, one question regarding ICU

So far, you have chosen to disable replacements when using the ICU regular expression engine. What is that, exactly :

Are you worried about a possible malfunction of that feature ?
Are there any technical obstacles to implementing such a feature ?
Do you need more time to learn about and/or implement this feature ?
Or did you simply decide that it would never be used ?

Personally, I don’t see why this should be any more complicated than when using the Boost search engine when you click on the Regex button of your plugin !

Best Regards,

guy038

Coises

@guy038 said in Search++: A work in progress:

Now, one question regarding ICU

So far, you have chosen to disable replacements when using the ICU regular expression engine. What is that, exactly :

Are you worried about a possible malfunction of that feature ?

Are there any technical obstacles to implementing such a feature ?

Do you need more time to learn about and/or implement this feature ?

Or did you simply decide that it would never be used ?

Personally, I don’t see why this should be any more complicated than when using the Boost search engine when you click on the Regex button of your plugin !

The Boost.Regex design includes an interface that accesses the text to be searched through a templated iterator. That’s a bit of a technical C++ concept. In short, template means that the programmer can specify what sort of value will be matched (I chose a full Unicode code point, UTF-32) and iterator means that rather than giving the interface a single, contiguous block of memory filled with the value type, you give it a kind of index and separately write routines that return the value at that index; increase the index to the index of the next value; and decrease the index to the index of the previous value.

Scintilla stores documents in two separate blocks with a gap between — this facilitates inserting and deleting text without having to move all the following data every time — and it stores the data either in the system default encoding (“ANSI”) or in UTF-8. The template-and-iterator concept works well with this. Notepad++ search works that way, but there were technical reasons I did not want to use the same iterator code Notepad++ uses. Writing the three iterators I needed (one for single byte character sets, one for double-byte character sets, and one for UTF-8) was one of the trickier parts of getting my Columns++ search to work. Writing the template specialization for the “character traits” of my UTF-32 values was also a bit of work.

When doing a replacement, Boost.Regex takes a structure that is produced as a result of a match and the replacement string with symbols ($1, etc.) and returns the string with replacements, etc. made. From that, I use normal Scintilla editing commands to replace the matched string with the processed replacement.

The regular expression search in ICU is not templated. It operates strictly in UTF-16. It does not use iterators, but it does have its own way of virtualizing the text to be searched (UText). The only format directly supported by UText that is also used in Scintilla is UTF-8. Scintilla does accept a command to make all its text contiguous (moving the gap to the end), after which the text can be accessed — so long as the text is not modified and only until the next Scintilla call is performed — as a UTF-8 string. By limiting ICU search to only UTF-8 documents and allowing no modification, I could use the utext_openUTF8 interface to access the internal Scintilla buffer (after telling Scintilla to make it contiguous) in a way that is acceptable to the ICU regular expression matching interface.

The Find and Replace operation in ICU Regular Expressions is, to me, very strange. The documented way to use it is to build a new text that reproduces the entire source text, with replacements. That would make sense when reading a file and writing a new file; but it is, to me, not obvious how to apply it sensibly in the context of a text editor.

It’s probably possible to do everything necessary for good integration with Scintilla with ICU regular expressions, it’s just a big task, starting with learning more about how their UText extensibility works. I saw enough to think that it could be made to work on “ANSI” documents, it would just be a whole other side project in itself to figure out how. (The problem with just converting the document to Unicode is that you wind up with the starting and ending character positions of a match in the converted text, but no good way to convert those to positions in the original text, which is what you need to select the match in Scintilla.) Beyond the “ANSI” problem, forcing Scintilla to move its gap to the end before each search is not ideal, and for large documents that are being changed (as in find and replace) the performance loss would be bad. So even for Unicode I would need to write a different UText extension that doesn’t require contiguous UTF-8.

Then there’s analyzing that replacement logic and figuring out how to use it in a way that makes sense in Scintilla. It might be easier to implement replacements from scratch and completely ignore ICU’s replacement logic and syntax: based on what documentation exists, it seems likely that their supported syntax is very limited — none of the fancy stuff in Boost.Regex extended format strings. (Unfortunately, I couldn’t just use the Boost.Regex formatter, because it depends on the structured data produced by a Boost.Regex match, which is different than the structured data produced by an ICU match.)

Since I haven’t dug deeply into how ICU implements regular expressions, I can’t say how difficult it might be to customize or extend them. I have a better (not by any means comprehensive!) idea of what might be possible with Boost.Regex. Because of that, and because Notepad++ users are familiar with Boost.Regex syntax, I judged that it will probably be more practical to extend Boost.Regex with some features from ICU than to extend ICU toward Boost.Regex. Honestly, I don’t see the detailed Unicode properties of ICU as being nearly so valuable in practical, real-world use as the features Boost.Regex has that ICU doesn’t (\K and backtracking control are the ones I remember).

I do hope to extend the Boost.Regex implementation further. I’d like to implement Unicode word boundaries, but I haven’t yet gone into it deeply enough to determine whether it is practical. There might be a way to expose arbitrary Unicode properties, but that is also something I will have to study further. The biggest thing I’d like to do is figure out how to get a progress monitoring hook inside the matching process, so that annoying “too complex” message could be replaced by the ability to click cancel on a progress dialog — I know that will be challenging, and maybe not possible. I want to get the framework supporting what I have now up to a level where I feel that I can responsibly release it without classifying it as a “pre-release” before I get into those projects.

In the ICU search, I mostly included the stuff that was relatively easy — that I could copy from the same logic used for Plain and Regex. I thought it might serve as a good comparison test for whether I had implemented Unicode properties correctly in Regex — where there is a discrepancy, I should know why (such as my “ignore case insensitivity for character classes” rule). So I expected to leave it for future use to check as I try to extend Boost.Regex some more, but I thought I would probably hide it so users wouldn’t stumble over it. I never really thought it would be something many (any?) users would want.

The bottom line is that I only have so much time and capacity for concentration, and I don’t think making ICU regular expressions fully functional is likely to be the best use of it — at least not yet.

guy038

Hi, @coises,

Many thanks for your very exhaustive answer ! So OK, I understand that the ICU replacement seems really difficult to implement !

In my opinion, as the replacement syntax seemed simpler when using ICU than when using Boost, I naively thought that a solution could be implemented enough easily !

Sorry for my noob approach of the problem. And given what I now know, I won’t dare ask you about specific topics like this one, again . Just follow your train of thought, which I am convinced ,will lead to a polished final Search++ plugin !

BR

guy038

Coises

@guy038 said in Search++: A work in progress:

I won’t dare ask you about specific topics like this one, again

There is no reason you shouldn’t ask. I am sorry if I sounded defensive or otherwise implied that the question was troublesome. I realize I probably gave a more exhaustive (or exhausting) reply than you really needed… I kind of got caught up “explaining it to myself.”

It’s never really possible for someone who isn’t doing the programming to know whether something will be easy, difficult or anywhere in between. Some things seem like they should be easy… and they are! Others have complications that don’t turn up until you’ve already put in a bunch of time and effort, then you discover one little detail that you can’t change that blocks your entire approach. So for me, too, it’s only a guess how much work it would be, though I have more information from which to make my guess.

Please don’t hesitate to ask questions and make suggestions. It’s possible that I have a good reason for doing or not doing any given thing; it’s equally possible that I just never thought about it. I don’t expect people who aren’t working with the code to know the difference, and neither should they expect that of themselves. Though I have to prioritize, and some things don’t make the “first cut,” all feedback helps; what doesn’t get done for one release will still be waiting to be considered in another. Even if I outright say, “No, I’m just not going to do that,” it still tells me there’s a need adjacent to what I’m building that I haven’t addressed, and I should think about how to make it better when I can.

Thank you, @guy038, for all the work you’ve done so far to test and explore this project.

Coises

Just a few small fixes in Search++ version 0.5.5:

Fix Replace All with ICU search engine should give “Command not implemented” message.
Copy current line indicator and caret settings from active document to Find and Replace box and search results list.
Use a more distinct symbol for “in selection” scope on command buttons.
Make messages for Mark and Show commands more accurate when there are null matches.

All observations, criticisms, experiences and suggestions are welcome!

guy038

Hello, @coises and All,

Thanks for that new release ! Briefly :

I quickly test the Replace All button, while in ICU mode as well as the new icon for the Selection scope on command buttons => I onfirm the expecting results !
Thanks for including the Current Line Indicator and Caret settings within Search++. It’s worth to point out that if you modify these N++ settings during a N++ session, you’ll need to close and restart Notepad++ in order that these new parameters are taken in account by the Search++ plugin !
Finally the message, regarding the null matches when using a Mark or Show command, is, indeed, much more explicit !

Now, I stumbled upon a weird bug : while within a N++ session :

If Search++ panel is already opened, click on the cross, on the far right to close the Search++ dialog

Now, re-open Search++ with the Plugins > Search++ > Search... option ( Note that I use the Docking mode )

Click on the current tab ( not within the text window ! )
Click on the Search++ windows title
Sometimes, you’ll need to repeat these last two actions twice, in order to trigger that bug
Now, try to close the Search++ panel by clicking on the cross, on the far right => Nothing happens !?
Generally, after some clicks, the dialog finally closes !

Note that I mapped the Ctrl + Shift + N shortcut to the Plugins > Search++ > Search... option. So, if I use this shortcut, I’m able to actually close the Search++ dialog in that specific case and also in all the other cases !

I would consider that it’s a minor bug and I’m not 100 % certain about the steps to reproduce it : not quite obvious !

Best Regards,

guy038

Coises

@guy038 said in Search++: A work in progress:

Thanks for including the Current Line Indicator and Caret settings within Search++. It’s worth to point out that if you modify these N++ settings during a N++ session, you’ll need to close and restart Notepad++ in order that these new parameters are taken in account by the Search++ plugin !

Another way is to change between light and dark mode; that causes Search++ to reset all the appearance information it copies from the active tab. That’s because plugins get a notification when light/dark mode changes, but I don’t think there is anything that would notify Search++ when current line or caret settings change.

A good point, though: I should either document this behavior, or reset appearance every time a Search++ window gains focus.

Now, I stumbled upon a weird bug : while within a N++ session :

If Search++ panel is already opened, click on the cross, on the far right to close the Search++ dialog

Now, re-open Search++ with the Plugins > Search++ > Search... option ( Note that I use the Docking mode )

Click on the current tab ( not within the text window ! )

Click on the Search++ windows title

Sometimes, you’ll need to repeat these last two actions twice, in order to trigger that bug

Now, try to close the Search++ panel by clicking on the cross, on the far right => Nothing happens !?

Generally, after some clicks, the dialog finally closes !

Note that I mapped the Ctrl + Shift + N shortcut to the Plugins > Search++ > Search... option. So, if I use this shortcut, I’m able to actually close the Search++ dialog in that specific case and also in all the other cases !

I would consider that it’s a minor bug and I’m not 100 % certain about the steps to reproduce it : not quite obvious !

I haven’t yet been able to get this to happen on my system. Thank you for reporting it. If I can reproduce it, I’ll attempt to figure out why it happens. What’s strange is that Notepad++ manages the close button for docking dialogs. Even if I completely remove all my close (actually hide) dialog code, clicking that X still closes the docking panel and hides the search dialog. So either you have stumbled on a Notepad++ bug, or something I am doing in Search++ is interfering with normal Notepad++ behavior.

Have you ever seen this behavior with any other docking window?

guy038

Hi, @coises,

I re-tested the supposed bug and I simplified the procedure which is necessary to trigger that bug !

If opened, close the Search++ plugin by clicking on the cross, at the far right
A Re-open Search++ with the Plugins > Search++ > Search... option
B Try to close the Search++ panel by clicking on the cross, at the far right => Nothing happens !?
C After some trys, if you move slightly the mouse, you should be able to close the Search++ panel

Sometimes, you’ll need to repeat the actions A ato C, up to 5 times consecutively, to trigger that bug. but this event may also occur at the first try !

I also noted that, when the bug occurs, any subsequent left click on the cross does nothing until I move very slightly the mouse, without any clicking, that is enough, then, to close the search++ panel by clicking againg onto the cross icon. Very strange, indeed !?

As I suspected that the problem could be an hardware issue with my bluethooth mouse, I disabled it and installed a classical USB mouse, instead. But, unfortunately, results were identical as well as the uncertainty regarding the manifestation of the bug !

I have different portable versions of Notepad++, but the one which is concerned is the v8.9 release where I installed, both, your Columns++ and Search++ plugins

Here is my Debug info :

Notepad++ v8.9   (64-bit)
Build time: Jan 10 2026 - 02:25:19
Scintilla/Lexilla included: 5.5.8/5.4.6
Boost Regex included: 1_90
pugixml included: 1.15
nlohmann JSON included: 3.12.0
Path: D:\890_x64\Notepad++.exe
Command Line: 
Admin mode: OFF
Local Conf mode: ON
Cloud Config: OFF
Periodic Backup: OFF
Placeholders: OFF
Scintilla Rendering Mode: SC_TECHNOLOGY_DIRECTWRITE (1)
Multi-instance Mode: monoInst
asNotepad: OFF
File Status Auto-Detection: cdEnabledNew (for current file/tab only)
Dark Mode: OFF
Display Info:
    primary monitor: 1920x1080, scaling 125%
    visible monitors count: 1
    installed Display Class adapters: 
        0001: Description - Intel(R) Iris(R) Xe Graphics
        0001: DriverVersion - 32.0.101.7084
OS Name: Windows 11 Pro (64-bit)
OS Version: 25H2
OS Build: 26200.7462
Current ANSI codepage: 1252
Plugins: 
    mimeTools (3.1)
    NppConverter (4.7)
    NppExport (0.4)
    ComparePlus (2.2)
    ColumnsPlusPlus (1.3.1)
    NppUISpy (1.2)
    MultiReplace (4.6.0.33)
    Marginalize (1)
    Search++ (0.5.5)

Finally, note that this portable version of N++ is installed on an USB drive

Best Regards,

guy038

Lachlanmax

@guy038, @coises and all,

I use the dark mode display for almost all apps and I really like it, so I think we are in agreement :)

You know what they say - join the dark mode, we have cookies… 🍪

On another note, I would like to hear your thoughts as veterans on this forum. I’m working on an AI chat plugin to streamline the use of prompting an LLM into the workflow with N++. I wonder - has anyone done it before? What would be important features to implement?

I would appreciate hearing your thoughts.

Thank you!

guy038

Hello, @lachlanmax and All,

Well, presently I’m not a modern C++ developer and I rather use the old Microsoft Qbasic or sometimes the GAWK software and, more rarely, the N++ PythonScript, in addition to N++ and its plugins, of course !

However, I’ve got some documentation on C/C++. So, I’ll probably learn a lot from scratch but I’m not sure yet if I’ll be able to make improvements to our beloved editor !

So, to my mind, you should get in touch with @Richárd-stockinger and @jw which already developed AI tools for N++. They will surely provide you with valuable advice for your project ! Refer to :

https://github.com/krazal/nppopenai

https://github.com/Qdthon/Notepad-AIPlugin

Best Regards,

guy038

P.S. :

Although you might assume that some of my regex solutions seem to have been created with the help from an IA engine, I can assure you that it’s only my modest brain that’s at shake when it comes to these resolutions !

Lachlanmax

@guy038
I wasn’t familiar with these projects. Thank you for the tip-off, I have a lot to learn from these. Glad I asked your advice.

We don’t know each other too well yet, but from your replies I get the feeling you have been coding for a while now… as a relative n00b I think it’s good to learn coding the nuts-and-bolt way, not just “vibe coding” like everyone is nowadays. (Even though I’m developing an AI plugin, so a bit of a contradiction. But I like to develop plugins that I would use personally, and I don’t use it to “vibe code”. Granted though that some might.)

tl;dr Hard work pays off!

guy038

Hi, @lachlanmax and All,

I forgot to give you the Community threads of, both, @Richárd-stockinger and @jw :

https://community.notepad-plus-plus.org/topic/24444/new-plugin-nppopenai

https://community.notepad-plus-plus.org/topic/27361/ai-plugin

BR

guy038

guy038

Hello, @coises and All,

How are you, @coises ? Pretty good, I guess ! I just uploaded a new version of the ICU.txt file which simply corrects a few typos.

https://drive.google.com/file/d/1_TwEV1oorsoWUUjL97BmptcR4fpNQ98_/view?usp=sharing

First, regarding the bug I mentioned in my two previous posts, I also detect it with native N++ and, for example, the Find Results panel :

Open the Find dialog
Do any kind of search
Click on the Find All in Current Document button

=> A Search results panel opens and displays all the occurrences of the searched string

Now :

(1) First, click once, anywhere on the blue Search results title
(2) Then, moving your mouse horizontally right to the cross, at the far right, try to close the panel => Nothing happens !
(3) Just, move down the mouse within the Search results panel then re-click on the cross => This time, the Search results panel should close, as expected.
Re-open the Search results panel, using the F7 key
Repeat the actions, 1, 2 and 3 to trigger again that bug. Sometimes, some tries are necessary to get into the problem !

Remark :

If, you accidentally double click on the Search results title bar, this panel will become a floating window. To restore its initial position, simply double-click again on the Search results title bar

Note that, in the end, this bug does not really matter because I mapped the Ctrl + Shift + N shorctut to the Search... option of your plugin. So, hitting this shortcut twice does close the plugin, anyway !

I thought of an enhancement of your Search++ plugin :

Let’s suppose that I want to mark all lines which satisfy the regex below, against the latest change.log file ( release 8.9.6 ). So :

We open or select the change.log file / tab
We open the Mark dialog ( Ctrl + M )
We check the Bookmark line, Purge for each search and Wrap around options, ONLY
MARK (?i-s)(?:\G|Q).*?T
We select the Regular expression mode
Finally, we click on the Mark All button

Two lines only are bookmarked : the very first one and the point #8 of v8.9.4. However, the total number of occurrences is 8

Notepad++ v8.9.6 vulnerability fix, regression fixes & bug-fixes:
 8. Fix quote escaping causing incorrect JSON syntax highlighting (Lexilla update related).

As you can see :

For the first line, as we are at the very begining of file, the \G alternative finds two occurrences
For the second line, the Q alternative finds the first occurrence and the \G alternative finds the five remaining occurrences

So, I was wondering if you could alternate the colors, between the present mark color and an other one, in order to easily distinguish these occurrences ?

Now, near the end of your initial Search++ post, you said :

I plan to add a Save function that will let you save searches you might want to use again. Of course, once it is possible to save, it has to be possible to delete and rename and edit and organize… I haven’t designed a user interface for any of that yet.

I do support this future enhancement !

Best Regards,

guy038