Community
    • Login

    Search for a text and copy the previous lines of codes

    Scheduled Pinned Locked Moved Help wanted · · · – – – · · ·
    27 Posts 6 Posters 3.2k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Alan KilbornA
      Alan Kilborn
      last edited by Alan Kilborn

      @Scott-Nielson said:

      so, that should be (?<!lookbehind)\K(?<!/a>)\K right (to skip a look behind and a link just before what is searched for)? Should he (the OP) or anyone else…

      Note that I wasn’t considering the OP’s problem when I made my original statement, but was just addressing a caveat involving your earlier reply.

      The OP’s problem wasn’t the clearest in the world. When that happens, I sort of lose interest in the question.

      Scott NielsonS 4 Replies Last reply Reply Quote 0
      • Scott NielsonS
        Scott Nielson @Alan Kilborn
        last edited by

        @Alan-Kilborn You Sir are a legend. I understood what you mean and I believe it will be of use for others who search online and get here! Thanks a lot.

        1 Reply Last reply Reply Quote 0
        • Scott NielsonS
          Scott Nielson @Alan Kilborn
          last edited by

          @Alan-Kilborn What should we do if we’re trying to skip 2 or more strings which are looking behind?

          1 Reply Last reply Reply Quote 0
          • Scott NielsonS
            Scott Nielson @Alan Kilborn
            last edited by Scott Nielson

            @Alan-Kilborn to make it more clear, what RegEx can help skip searching for abc in abcdef, xyz in xyzdef and xxx in xxxdef (all at once, with just one RegEx)?

            PeterJonesP 1 Reply Last reply Reply Quote 0
            • Scott NielsonS
              Scott Nielson @Alan Kilborn
              last edited by Scott Nielson

              @Alan-Kilborn What if it was abc
              def, xyz
              def and xxx
              def?

              1 Reply Last reply Reply Quote 0
              • PeterJonesP
                PeterJones @Scott Nielson
                last edited by PeterJones

                @Scott-Nielson ,

                Anything you put before the \K will be effectively “lookbehind”. Before that \K, you can use any valid regex syntax, which will carry the same meaning. So if you want to match the def, but only if it’s prefaced by abc or xyz, or one-or-more-xes, and with zero or more spaces between it and the def you could look for (?:abc|xyz|x+)\h*\Kdef – this will match the def in abcdef or abc def or abc def (with lots of spaces between, which the forum won’t let me show) or xyz def or xxxxxxx def or xdef or many others.

                (In your actual example of xxx, and without the variable number of spaces, there would have been no reason for the \K, and you could have used a normal lookbehind, because the number of characters could be exactly known by the regex engine: (?<=(abc|xyz|xxx) )def . Thus, I changed the example to something that could match any number of characters, and thus wouldn’t work in a normal lookbehind situation.)

                (I added spaces while writing this up, because your multiple posts and edits were moving in that direction)

                PeterJonesP 1 Reply Last reply Reply Quote 1
                • PeterJonesP
                  PeterJones @PeterJones
                  last edited by PeterJones

                  Examples:
                  19fdb317-fad4-47c3-95e1-9150b8536bd7-image.png

                  0f9cccc2-86ea-4abb-bbb2-85110bfe2083-image.png

                  abcdef
                  xyzdef
                  pdqdef
                  xdef
                  xxdef
                  xxxdef
                  xxxxdef
                  abc def
                  xyz def
                  pdq def
                  x def
                  xx def
                  xxx def
                  xxxx def
                  x       def
                  xx      def
                  xxx     def
                  xxxx    def
                  
                  Scott NielsonS 3 Replies Last reply Reply Quote 1
                  • Scott NielsonS
                    Scott Nielson @PeterJones
                    last edited by

                    @PeterJones I want to use the lookbehind RegEx only. Suppose the def in the string I typed above is the only unique code (or whatever it may be called), how to find all that I asked for above using the lookbehind RegEx?

                    1 Reply Last reply Reply Quote 0
                    • Scott NielsonS
                      Scott Nielson @PeterJones
                      last edited by Scott Nielson

                      @PeterJones Something like (?<!abc)(?<!xyz)(?<!xxx)(?<=def)?

                      PeterJonesP 1 Reply Last reply Reply Quote 0
                      • Scott NielsonS
                        Scott Nielson @PeterJones
                        last edited by

                        @PeterJones I believe your Regex will work but what if I wanted to use the lookbehind RegEx and skip some codes/strings just before a unique code/string?

                        1 Reply Last reply Reply Quote 0
                        • PeterJonesP
                          PeterJones @Scott Nielson
                          last edited by PeterJones

                          @Scott-Nielson ,

                          I am not here to be your personal regex writer and regex debugger, sorry. I gave you general principals. I gave you specific examples. We have linked you to regex documentation. I’m not sure what else you can reasonably expect from us.

                          As we said, if the length varies, \K is equivalent to lookbehind. You need to use whatever syntax works for your particular situation. Your brief examples aren’t enough for us to know if they’ll work for your actual needs.

                          As a parting help,

                          1. I have no idea why you suddenly switched to negative lookbehind. All your descriptions have said “I want to match the def assuming that abc or xyz or xxx come before it”. But the regeex you just proposed says “match def as long as none of abc or xyz or xxx come before it” which is literally the opposite of what you’ve previously asked for.
                          2. Why did you put def in a lookbehind when you wanted it to be part of the match itself. From that, you would just want (?<!abc)(?<!xyz)(?<!xxx)def – with the def not in a lookbehind.

                          But, to use an analogy: it’s time for you to take off the training wheels and try to learn how to balance and bicycle on your own. You need to learn regular expressions enough that you can do it without constantly relying on us to guess whether or not a regular expression will work for the data that we cannot see that you don’t accurately describe to us.

                          ----

                          Please note: This Community Forum is not a data transformation service; you should not expect to be able to always say “I have data like X and want it to look like Y” and have us do all the work for you. If you are new to the Forum, and new to regular expressions, we will often give help on the first one or two data-transformation questions, especially if they are well-asked and you show a willingness to learn; and we will point you to the documentation where you can learn how to do the data transformations for yourself in the future. But if you repeatedly ask us to do your work for you, you will find that the patience of usually-helpful Community members wears thin. The best way to learn regular expressions is by experimenting with them yourself, and getting a feel for how they work; having us spoon-feed you the answers without you putting in the effort doesn’t help you in the long term and is uninteresting and annoying for us.

                          ----

                          Do you want regex search/replace help? Then please be patient and polite, show some effort, and be willing to learn; answer questions and requests for clarification that are made of you. All example text should be marked as literal text using the </> toolbar button or manual Markdown syntax. To make regex in red (and so they keep their special characters like *), use backticks, like `^.*?blah.*?\z`. Screenshots can be pasted from the clipboard to your post using Ctrl+V to show graphical items, but any text should be included as literal text in your post so we can easily copy/paste your data. Show the data you have and the text you want to get from that data; include examples of things that should match and be transformed, and things that don’t match and should be left alone; show edge cases and make sure you examples are as varied as your real data. Show the regex you already tried, and why you thought it should work; tell us what’s wrong with what you do get. Read the official NPP Searching / Regex docs and the forum’s Regular Expression FAQ. If you follow these guidelines, you’re much more likely to get helpful replies that solve your problem in the shortest number of tries.

                          Scott NielsonS 2 Replies Last reply Reply Quote 1
                          • Scott NielsonS
                            Scott Nielson @PeterJones
                            last edited by

                            @PeterJones OK, thanks a lot. Like you said, I have been experimenting!

                            1 Reply Last reply Reply Quote 0
                            • Scott NielsonS
                              Scott Nielson @PeterJones
                              last edited by

                              @PeterJones Just for your information, I had typed, “(?<=def)” above, not, “(?<!def)”.

                              PeterJonesP 1 Reply Last reply Reply Quote 0
                              • PeterJonesP
                                PeterJones @Scott Nielson
                                last edited by

                                @Scott-Nielson said in Search for a text and copy the previous lines of codes:

                                Just for your information, I had typed, “(?<=def)” above, not, “(?<!def)”.

                                Specifically, you typed (?<!abc)(?<!xyz)(?<!xxx)(?<=def). My previous post stands correctly as a comment on that regular expression as you typed it. It was a group of four sub-expressions: three negative lookbehinds and one positive lookbehind. My point #1 referred to the first three negative lookbehinds. My point#2 referred to the final positive lookbehind. I stand by my previous assessment of your regular expression.

                                Scott NielsonS 1 Reply Last reply Reply Quote 0
                                • Scott NielsonS
                                  Scott Nielson @PeterJones
                                  last edited by

                                  @PeterJones got it, thanks.
                                  So for the string

                                  Bible.
                                  <a href=www.jw.org>Jehovah's witnesses</a>
                                  Atheists
                                  abc
                                  xyz
                                  xxx
                                  </span>
                                  </p>
                                  

                                  if I had to find just, “</span>” and
                                  “</p>” when the “</p>” is on the next line, this RegEx should be fine right: (?<!/a)(?<!abc)(?<!xyz)(?<!xxx)(?<=/span)>\s*</p>?

                                  PeterJonesP 1 Reply Last reply Reply Quote 0
                                  • PeterJonesP
                                    PeterJones @Scott Nielson
                                    last edited by PeterJones

                                    @Scott-Nielson ,

                                    this RegEx should be fine right

                                    I do not believe you have understood what I have tried to explain previously. As I have said to you multiple times, if it works for you, great! This forum doesn’t exist to be a regex “do-your-homework-for you” site, nor a “tell-me-if-my-regex-is-right” site, nor a regex tutorial site, or a “regex-for-dummies” site, nor a “let’s-talk-about-regex-and-nothing-else” site.

                                    Regular expressions are just a small part of what Notepad++ can do, and once we’ve helped a given user a couple times with regular expressions (and trust me, you’ve gone beyond “a couple” at this point), and pointed the same user multiple times to all the resources that on regular expressions – including the Notepad++ Online User Manual section on search/replace for the canonical reference to Notepad++ regular expressions, and the regular expression FAQ with links to lots of online regex-learning resources, both linked above – then we expect that user to make more use of those documents and links, and take the onus of learning regular expressions upon themselves, rather than expecting us to generate or proofread their every regular expression.

                                    Some personal advice: the only way you are going to learn more about regular expressions at this point is to experiment with them yourself, and read the resources, and try it on your own; us giving you a “seal of approval” won’t help, and us spoon-feeding you a regex that works with the small example data you share won’t help you learn how to do it on your own.

                                    Good luck.

                                    1 Reply Last reply Reply Quote 2
                                    • guy038G
                                      guy038
                                      last edited by guy038

                                      Hello, @scott-nielson, @peterjones, @alan-kilborn, @terry-r and All,

                                      @scott-nielson, when you said in an older post :

                                      what RegEx can help skip searching for abc in abcdef, xyz in xyzdef and xxx in xxxdef (all at once, with just one RegEx)?

                                      The @peterjones’s answers are just the right ones :

                                      • (?:abc|xyz|x+)\h*\Kdef     with the \K construction

                                      • (?<=(?:abc|xyz|xxx)\h*)def    with a positive look-behind


                                      Back to your last post, IF you had tried the regex (?<!/a)(?<!abc)(?<!xyz)(?<!xxx)(?<=/span)>\s*</p>, you could have seen that this regex, as well as all the following ones :

                                      (?<!abc)(?<!xyz)(?<!xxx)(?<=/span)>\s*</p>

                                      (?<!xyz)(?<!xxx)(?<=/span)>\s*</p>

                                      (?<!xxx)(?<=/span)>\s*</p>

                                      (?<=/span)>\s*</p>

                                      just match a >, followed by any number of vertical or horizontal blank characters, even none and, next, the </p> string

                                      Of course, if you’re really looking for this, the simple regex >\s*</p> seems enough ;-))

                                      Or, as you said :

                                      If I had to find just, “</span>” and “</p>” when the “</p>” is on the next line …

                                      may be, this other one </span>\s*</p>


                                      Now, if you looking for any block </span> ••••• </p>, after a line <a href ••••• > ••••• </a> which do not contain, in the middle, any text abc, xyz and xxx, with this exact case, the following regex should work :

                                      SEARCH / MARK (?s-i)</a>((?<!abc)(?<!xyz)(?<!xxx).)*?\K</span>\s*</p>

                                      Test it against this example :

                                      Bible.
                                      <a href=www.jw.org>Jehovah's witnesses</a>
                                      Atheists
                                      abc
                                      </span>
                                      </p>
                                      
                                      Bible.
                                      <a href=www.jw.org>Jehovah's witnesses</a>
                                      xyz
                                      Atheists
                                      other text
                                      added
                                      </span>      </p>
                                      
                                      Bible.
                                      <a href=www.jw.org>Jehovah's witnesses</a>
                                      Atheists
                                      Line 1
                                      xxx
                                      Line 2
                                      Line 3
                                      </span>
                                      </p>
                                      
                                      Bible.
                                      <a href=www.jw.org>Jehovah's witnesses</a>
                                      Atheists
                                      Line 1
                                      Line 2
                                      Line 3
                                      </span>
                                      
                                      
                                      </p>
                                      
                                      Bible.
                                      <a href=www.jw.org>Jehovah's witnesses</a>
                                      Atheists
                                      First sentence
                                      Second sentence
                                      Third sentence
                                      </span></p>
                                      

                                      As you can verify, this regex just matches the 4th and 5th blocks, which do not contain any text abc, xyz and xxx, between </a> and </span>


                                      Note that I use the \K syntax, because, due to non-fixed length generated by the *? non-greedy quantifier, a look-behind syntax cannot be used, in this situation !

                                      Best Regards,

                                      guy038

                                      Scott NielsonS 2 Replies Last reply Reply Quote 1
                                      • Scott NielsonS
                                        Scott Nielson @guy038
                                        last edited by

                                        @guy038 thanks a lot. I don’t know what I would have done without your help!

                                        1 Reply Last reply Reply Quote 1
                                        • Scott NielsonS
                                          Scott Nielson @guy038
                                          last edited by Scott Nielson

                                          @guy038 Suppose if I don’t have a link, that is </a> and I am not sure what to put there (because it varies), will this RegEx work if I am using the same search strings that you typed above: (?s-i)((?<!abc)(?<!xyz)(?<!xxx).)*?\K</span>\s*</p>?

                                          1 Reply Last reply Reply Quote 0
                                          • First post
                                            Last post
                                          The Community of users of the Notepad++ text editor.
                                          Powered by NodeBB | Contributors