Community
    • Login

    Poorman regex based styler/lexer

    Scheduled Pinned Locked Moved Help wanted · · · – – – · · ·
    16 Posts 4 Posters 11.3k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Claudia FrankC
      Claudia Frank
      last edited by

      Hello Johannes,

      will follow up on this when back from work.

      But, without callbacks would mean that you have to run it over and over again
      to make it coloring again if text changes.

      One question in the meantime, is the 8 columns as shown in your photo,
      also fix or could it be that more or less columns are required.

      Cheers
      Claudia

      1 Reply Last reply Reply Quote 1
      • Johannes DillingerJ
        Johannes Dillinger
        last edited by

        Hello Claudia,

        thanks again for the quick reply! Oh, does not have to to without callbacks, I thought it might make life easier. In my version I tried without, but it did not work, so…:)

        The 8 columns are fix.

        In the meantime I will try some more if I find the time; in case I can get something to work, will let you know…

        Thanks a lot again!
        Kind regards,

        Johannes

        1 Reply Last reply Reply Quote 0
        • Claudia FrankC
          Claudia Frank
          last edited by

          Hello Johannes,

          I do understand that you will give it a try yourself
          and you will come back in case of a question, correct?

          Cheers
          Claudia

          1 Reply Last reply Reply Quote 0
          • guy038G
            guy038
            last edited by guy038

            Hello @Claudia-frank and All,

            If you don’t mind, Claudia, I would like you to test my example text, with my own version of your script !

            Here is the sample text, in the Test.log file :

            1234567890123456789012345
            
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            12
            123456
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            
            
            1234567890123456789012345
            
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            12
            123456
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            
            
            1234567890123456789012345
            
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            12
            123456
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            
            
            1234567890123456789012345
            
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            12
            123456
            1234567890123456789012345
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            1234567890123456789012345
            1234567890
            1234567890123456789012345
            

            Now, here is the part of your script, that I slightly changed :

            # ---------------------------- configuration area -----------------------------
            
            # stlye definitions - changes here need to match setup calls in init_scintilla()
            STYLE_DEFAULT               = 0 # don't change this
            STYLE_WARNINGS              = 1
            STYLE_ERRORS                = 2
            
            STYLE_DEFAULT_FOREGROUND    = (0,0,0)           # Black
            STYLE_DEFAULT_BACKGROUND    = (255,255,255)     # White
            STYLE_WARNINGS_BACKGROUND   = (0,255,0)         # Green
            STYLE_ERRORS_BACKGROUND     = (255,0,0)         # Red
            
            REGISTERED_FILE_EXTENSIONS  = ['.log']
            USE_AS_DEFAULT_LANGUAGE     = False
            
            REGEX_DICT = OrderedDict()  # to be sure that regex get called in the same order each time
            REGEX_DICT['STYLE_WARNINGS']      = '(?-s)^.{8}'
            REGEX_DICT['STYLE_ERRORS']        = '(?-s)^.{16}\K.{4}'
            
            # -------------------------- configuration area end ---------------------------
            
            CUSTOM_STYLER_IS_RUNNING = globals().get('CUSTOM_STYLER_IS_RUNNING', False) 
            LIST_OF_FIRST_RUN_DONE = []
            DOC_NEEDS_TO_BE_COLORED = False
            
            def init_scintilla():
                # this needs to match the definitions in configuration area
                editor.styleSetFore(STYLE_DEFAULT,   STYLE_DEFAULT_FOREGROUND)
                editor.styleSetBack(STYLE_DEFAULT,   STYLE_DEFAULT_BACKGROUND)   
                editor.styleSetBack(STYLE_WARNINGS,  STYLE_WARNINGS_BACKGROUND)  
                editor.styleSetBack(STYLE_ERRORS,    STYLE_ERRORS_BACKGROUND)  
            

            Notes :

            • As you can see, as I, usually, work with the default colours, I changed the STYLE_DEFAULT_… … colours

            • Of course, I changed the two regexes for testing :-D

            • In the init_scintilla() function, the last two lines, begin with editor.styleSetBack ( instead of editor.styleSetFore )

            • And, accordingly, I renamed the two variables STYLE_WARNINGS_FOREGROUND and STYLE_ERRORS_FOREGROUND into STYLE_WARNINGS_BACKGROUND and STYLE_ERRORS_BACKGROUND

            After starting N++ and opening the Test.log file, I ran your script and… bingo : Wow, I got two coloured columns :

            • The eight characters and green one, between column 1 and column 8

            • The four characters and red one, between column 17 and column 20

            That’s great, indeed !


            Now, there an small problem : If you scroll up and down, the editor window, quickly enough, with the mouse, some parts of the text, which should not be matched, are highlighted too :-((. I, first, thought it was due to the \K syntax. Unfortunately, even if you write, as below :

            REGEX_DICT['STYLE_ERRORS'] = 'XYZ'
            

            You’ll surely notice, some green parts of text are wrongly highlighted !

            What do you think about it ?

            Cheers,

            guy038

            1 Reply Last reply Reply Quote 0
            • Claudia FrankC
              Claudia Frank
              last edited by Claudia Frank

              Hi Guy,

              scrolling that fast - you must be one of those who can read a whole book in under a minute ;-)

              I assume the asynchronous updateui callback isn’t good in such a case.
              Give it a try with the synchronous version.

              editor.callbackSync(callback_UPDATEUI, [SCINTILLANOTIFICATION.UPDATEUI])
              

              Cheers
              Claudia

              1 Reply Last reply Reply Quote 0
              • Johannes DillingerJ
                Johannes Dillinger
                last edited by

                Hi Claudia,
                got it working, thanks a lot! Also the synchronous version works well, better than the asynchronous in my case I guess.

                https://goo.gl/photos/yqRr2o8rDJUm7Z2x6

                (never mind the colors…)

                One last question… I want to grey out comments-lines, which in this language start with a dollar sign and last till the end of the line (always). The dollar-sign can appear in the middle of a line, so also after a few numbers of actual code.
                I thought that this regex might do, but it does not:

                REGEX_DICT[‘STYLE_COMM’] = ‘(?<=[$]).+$’

                Can you give me a hint on how it should look like?
                Thanks again!
                Kind regards,

                Johannes

                1 Reply Last reply Reply Quote 0
                • guy038G
                  guy038
                  last edited by guy038

                  Hello, @Claudia-frank, @johannes-dillinger and All

                  A quick reply, because it’s 3h00 am, in France, and… I work, tomorrow !

                  • @Claudia-frank :

                  It’s just fine ! No problem, anymore, with the synchronous UPDATEUI callback :-))) So, thanks to you, you have a new wonderful world to discover, with multiple colouring of, either, foreground and/or background of lines of files, with a specific extension, handled with regular expressions !!


                  • @johannes-dillinger :

                  I think that the correct regex ( tested, with a grey colour, on foreground and background ! ) should be, as below :

                  REGEX_DICT[‘STYLE_COMM’] = '(?-s)\$.*'
                  

                  Notes :

                  • The (?-s) modifier tells the regex engine that the meta-character . matches a single standard character, only

                  • The \$ searches for a literal $ character

                  • The final part .* looks for the remainder of current line, possibly empty !


                  Once more, a very powerful script, that you give us, Claudia ! A thousand thanks, Yeah !

                  Cheers,

                  guy038

                  1 Reply Last reply Reply Quote 1
                  • Johannes DillingerJ
                    Johannes Dillinger
                    last edited by

                    Hello guy,
                    perfekt!! Thanks for the solution, all happy now:-)
                    Kind regards,
                    Johannes

                    1 Reply Last reply Reply Quote 0
                    • guy038G
                      guy038
                      last edited by

                      Hi, @claudia-frank,

                      When I, first, saw the @jcrmatos post, as the address, below :

                      https://notepad-plus-plus.org/community/topic/13753/suggestion-please-consider-adding-a-secondary-vertical-edge/1

                      I thought that it should have been possible to use your great script to visualize the gap between column 73 and 79, colouring the background. So, @jcrmatos could easily see, at once, the two limits, located at columns 72 and 79 !

                      Then, from your original script, I built the script, below, which highlights, in pale blue, the columns between the columns 73 and 79 :

                      import re, os
                      from collections import OrderedDict
                      
                      # ---------------------------- configuration area -----------------------------
                      
                      # style definitions - changes here need to match setup calls in init_scintilla()
                      STYLE_DEFAULT               = 0 # don't change this
                      STYLE_EDGES                 = 1
                      
                      STYLE_DEFAULT_FOREGROUND    = (0,0,0)           # Black
                      STYLE_DEFAULT_BACKGROUND    = (255,255,255)     # White
                      STYLE_EDGES_BACKGROUND      = (208,240,255)     # Pale Blue ( &xD0 , &xF0 , &xFF )
                      
                      REGISTERED_FILE_EXTENSIONS  = ['.log']
                      USE_AS_DEFAULT_LANGUAGE     = False
                      
                      REGEX_DICT = OrderedDict()  # to be sure that regex get called in the same order each time
                      REGEX_DICT['STYLE_EDGES']  = '(?-s)^.{72}\K.{1,7}'
                      
                      # -------------------------- configuration area end ---------------------------
                      
                      CUSTOM_STYLER_IS_RUNNING = globals().get('CUSTOM_STYLER_IS_RUNNING', False) 
                      LIST_OF_FIRST_RUN_DONE = []
                      DOC_NEEDS_TO_BE_COLORED = False
                      
                      def init_scintilla():
                          # this needs to match the definitions in configuration area
                          editor.styleSetFore(STYLE_DEFAULT,   STYLE_DEFAULT_FOREGROUND)
                          editor.styleSetBack(STYLE_DEFAULT,   STYLE_DEFAULT_BACKGROUND)   
                          editor.styleSetBack(STYLE_EDGES,     STYLE_EDGES_BACKGROUND)  
                      
                      def custom_lexer(start_pos, end_pos):
                          for k in REGEX_DICT.keys():
                              matches = []
                              editor.research(REGEX_DICT[k], lambda m: matches.append(m.span()), 0, start_pos, end_pos) 
                              for match in matches:
                                  # console.write('match:{} - {}\n'.format(match, editor.getTextRange(*match)))
                                  editor.startStyling(match[0],31)
                                  editor.setStyling(match[1]-match[0], eval(k))
                      
                      def this_doc_should_be_styled():
                          global DOC_NEEDS_TO_BE_COLORED
                          current_file = notepad.getCurrentFilename()
                          _file, _ext = os.path.splitext(current_file)
                          if _ext in REGISTERED_FILE_EXTENSIONS or (USE_AS_DEFAULT_LANGUAGE and _file.startswith('new ')):
                              DOC_NEEDS_TO_BE_COLORED = True
                          else:
                              DOC_NEEDS_TO_BE_COLORED = False
                          return DOC_NEEDS_TO_BE_COLORED
                      
                      def get_visible_area():   
                          first_visible_line = editor.getFirstVisibleLine()
                          last_visible_line = editor.linesOnScreen() + first_visible_line
                          start_pos = editor.positionFromLine(first_visible_line)
                          end_pos = editor.positionFromLine(last_visible_line)
                          return start_pos, end_pos
                      
                      def callback_BUFFERACTIVATED(args): 
                          if this_doc_should_be_styled():
                              init_scintilla()
                              if args['bufferID'] not in LIST_OF_FIRST_RUN_DONE:
                                  custom_lexer(*get_visible_area())
                                  LIST_OF_FIRST_RUN_DONE.append(args['bufferID'])
                      
                      def callback_UPDATEUI(args):
                          if DOC_NEEDS_TO_BE_COLORED and args['updated'] >= 4:
                              custom_lexer(*get_visible_area())
                      
                      def main():
                          global CUSTOM_STYLER_IS_RUNNING
                          if CUSTOM_STYLER_IS_RUNNING:
                              notepad.clearCallbacks([NOTIFICATION.BUFFERACTIVATED])
                              editor.clearCallbacks([SCINTILLANOTIFICATION.UPDATEUI])
                              CUSTOM_STYLER_IS_RUNNING = False
                          else:
                              if this_doc_should_be_styled():
                                  init_scintilla()
                                  custom_lexer(*get_visible_area())
                              notepad.callback(callback_BUFFERACTIVATED, [NOTIFICATION.BUFFERACTIVATED])
                              editor.callbackSync(callback_UPDATEUI, [SCINTILLANOTIFICATION.UPDATEUI])
                              CUSTOM_STYLER_IS_RUNNING = True
                      
                      main()
                      

                      Although, it works fine on .log files, I noticed two problems :

                      • If I modify a .log file, by adding a new line, of more than 79 characters long, I have to re-run the script, twice, in order to get this new line highlighted, between the 73th and the 79th character ! ( BTW, a small drawback : the current line cannot be highlighted ! )

                      • When I decided to change the .log extension by the .py extension, I was quite surprised to get, almost, all comments, of the python script, highlighted, too ! And sometimes, the columns between positions 73 and 79 were not highlighted.

                      So, Claudia, any explanation about these behaviours ?

                      Best Regards,

                      guy038

                      1 Reply Last reply Reply Quote 0
                      • Claudia FrankC
                        Claudia Frank
                        last edited by Claudia Frank

                        Hi Guy,

                        rerun two times because I implemented my start/stop logic as in most of my scripts.
                        See main section

                        def main():
                            global CUSTOM_STYLER_IS_RUNNING
                            if CUSTOM_STYLER_IS_RUNNING:
                                notepad.clearCallbacks([NOTIFICATION.BUFFERACTIVATED])
                                editor.clearCallbacks([SCINTILLANOTIFICATION.UPDATEUI])
                                CUSTOM_STYLER_IS_RUNNING = False
                            else:
                                if this_doc_should_be_styled():
                                    init_scintilla()
                                    custom_lexer(*get_visible_area())
                                notepad.callback(callback_BUFFERACTIVATED, [NOTIFICATION.BUFFERACTIVATED])
                                editor.callbackSync(callback_UPDATEUI, [SCINTILLANOTIFICATION.UPDATEUI])
                                CUSTOM_STYLER_IS_RUNNING = True
                        

                        If CUSTOM_STYLER_IS_RUNNING it clears the callbacks so no further coloring,
                        if it is not running it registers the callbacks to make coloring happen.

                        Concerning the renaming. This script is NOT a real lexer as it doesn’t register itself
                        to be a lexer by using container object, so it is critical that you choose an extension which hasn’t
                        a lexer assigned, which by the way means, that this script can’t be used as you normally do have
                        the python lexer active.

                        Regarding the active line, yes, it gets overwritten by the active line style.

                        In general, scintilla doesn’t provide many functions when it comes to column based coloring,
                        basically the edge line is the only one I’ve encountered so far. Maybe a solution might be to
                        draw to window context itself but this would mean one have to handle window resizing messages,
                        themes etc…

                        Cheers
                        Claudia

                        1 Reply Last reply Reply Quote 0
                        • guy038G
                          guy038
                          last edited by

                          Hi, @claudia-frank,

                          Oh yes, I just forgot that your script acts, exactly, like your RegexTesterPro.py script. This just proves that I haven’t studied your excellent regex’s tester script, since a long time !!

                          Concerning the second point, I do understand your explanations. To be honest, right after changing the extension to .py and before restarting your script, I already thought that this change could lead to some unpredictable results :-((

                          Cheers,

                          guy038

                          1 Reply Last reply Reply Quote 1
                          • First post
                            Last post
                          The Community of users of the Notepad++ text editor.
                          Powered by NodeBB | Contributors