Community
    • Login

    Replace many lines with conditions

    Scheduled Pinned Locked Moved Help wanted · · · – – – · · ·
    11 Posts 2 Posters 503 Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • guy038G
      guy038
      last edited by

      Hello, @trần-hoàng-long and All,

      OK, So, here is the road map :

      • Open your file in N++

      • Open the Replace dialog ( Ctrl + F )

      • SEARCH (?s-i)^target=18028\R\Ktemporary_opinion=\{.+?"love_opinion".+?(?=^temporary_opinion)

      • REPLACE Leave EMPTY

      • Tick the Wrap around option

      • Select the Regular expression search mode

      • Click on the Replace All only ( Do not use the Replace button )

      Et voilà !

      Notes :

      • First the modifiers (?s-i) means that the dot regex symbol, ., will match any character and that the search is sensitive to case

      • Then the part ^target=18028\R looks for the literal string target=18028, with this exact case, at beginning of line ( ^ ) followed with its EOL characters ( \R )

      • Now the \K assertion resets the working position and cancels any previous match

      • So, the part temporary_opinion=\{ look for the literal string temporary_opinion={, with this exact case

      • Followed with the smallest non-null range of any char ( .+?) …

      • Till the literal string "love_opinion"…

      • Followed with an other smallest non-null range of any char (.+?) …

      • which ends when it’s followed with an other literal string temporary_opinion which begins a line, due to the look-ahead structure (?=^temporary_opinion)

      • As the Replacement zone is empty, all this multi-lines range of text is then deleted

      Best Regards,

      guy038

      Trần Hoàng LongT 1 Reply Last reply Reply Quote 3
      • Trần Hoàng LongT
        Trần Hoàng Long @guy038
        last edited by

        @guy038 Many many thanks to you.
        I used your method but not get the result, and dont know why, so I attached my file here for you to check.
        https://drive.google.com/file/d/1WC4VHudMm2Do3_QV1xmQV1h2gAuWo26v/view?usp=sharing

        There are many paragraphs start with “owner=…” and “target=18028”, I want to delete all “temporary_opinion” which are not “love_opinion” or “friendliness_opinion” in those paragraphs.

        Thanks any help in advance.

        Trần Hoàng LongT 1 Reply Last reply Reply Quote 0
        • Trần Hoàng LongT
          Trần Hoàng Long @Trần Hoàng Long
          last edited by

          P/S: I’m using Notepad ++ v7.8.8, maybe need to turn on/off some functions to make the search work, but I dont know which functions. So if needed and anyone know, pls tell me (or tell me any good workingNotepad++ version), tks so much.

          I attach screen picture here
          Notepad++ version.jpg

          1 Reply Last reply Reply Quote 0
          • guy038G
            guy038
            last edited by

            Hi, @trần-hoàng-long and All,

            OK, I’ve got your file, that I simply changed as a .txt file. Nice !

            Now, it’s still unclear ! In your first post, you said :

            … so I want to use Replace function to delete all “love opinion”

            And in your second post, you said :

            I want to delete all “temporary_opinion” which are not “love_opinion” or “**friendliness_opinion**” in those paragraphs.

            Once I know what is exactly needed, with the true file, it should be easy to found out a solution;-))

            BR

            guy038

            Trần Hoàng LongT 1 Reply Last reply Reply Quote 1
            • Trần Hoàng LongT
              Trần Hoàng Long @guy038
              last edited by Trần Hoàng Long

              @guy038 Tks for your reply.

              I want to do exactly as 2nd post.

              In 1st post, I wanted to say in short but maybe it leads to wrong meaning. Sorry about it.

              You can open my file directly with Notepad++, no need to change to .txt :D

              1 Reply Last reply Reply Quote 0
              • guy038G
                guy038
                last edited by

                Hi, @trần-hoàng-long,

                One more question :

                • Your file contains 20 strings “love_opinion”, in target blocks different from 18028

                • Your file contains 868 strings “friendliness_opinion”, in target blocks different from 18028

                Do you want to remove or keep these lines ?

                BR

                guy038

                Trần Hoàng LongT 1 Reply Last reply Reply Quote 1
                • Trần Hoàng LongT
                  Trần Hoàng Long @guy038
                  last edited by Trần Hoàng Long

                  Hi @guy038,

                  There are some paragraphs which have them all, for example like below:

                  {
                  owner=…
                  target=18028
                  temporary_opinion={
                  modifier=…
                  …
                  }
                  temporary_opinion={
                  modifier=…
                  …
                  }
                  temporary_opinion={
                  modifier=“love_opinion”
                  …
                  }
                  temporary_opinion={
                  modifier=“friendliness_opinion”
                  …
                  }
                  }
                  In those paragraphs, I want to delete all “temporary_opinion” graphs which are not “love_opinion” or “friendliness_opinion".

                  If “love opinion” or “friendliness_opinion” stand alone, or not have “target=18028”, so just ignore them.

                  The result should be

                  {
                  owner=…
                  target=18028
                  temporary_opinion={
                  modifier=“love_opinion”
                  …
                  }
                  temporary_opinion={
                  modifier=“friendliness_opinion”
                  …
                  }
                  }

                  Thanks so much for your help.

                  1 Reply Last reply Reply Quote 1
                  • guy038G
                    guy038
                    last edited by

                    Hello, @trần-hoàng-long,

                    In progress ! Be patient !

                    guy038

                    1 Reply Last reply Reply Quote 1
                    • guy038G
                      guy038
                      last edited by guy038

                      Hello, @trần-hoàng-long,

                      Ah…Phhhffff ! I finally found out a method ! It wasn’t that easy because your data are a bit complex !


                      • Firstly , a global section, begins line with a space followed with an opening brace { till right before an other similar scheme. Asbraces are used in many other locations, we’ll have to delimit these sections more precisely !

                      • Secondly, inside a global section, the temporary_opinion sub-region may have three different syntaxes :

                      temporary_opinion •••••• { ••••••••••••••••••••••••••••••••••••• }LF   (  2,779 occurrences )
                      temporary_opinion •••••• { ••••••••• { •••••••••••• } •••••••••• }LF   ( 17,224 occurrences )
                      temporary_opinion •••••• { ••••• { ••••• } ••••• { ••••• } ••••• }LF   (     78 occurrences )
                      

                      So we’ll have to detect these three syntaxes, at the same time

                      • Thirdly, we must delete some temporary_opinion sub-region ONLY in global sections which verify these three criteria :

                        • It must contain the string target=18028

                        • It must contain the temporary_opinion sub-region with modifier = "love_opinion"

                        • It must contain the temporary_opinion sub-region with modifier = "friendliness_opinion"

                      Note that the first criterion comes first, in a section, but the two others occur in any order !

                      So we’ll need to distinguish these specific sections, where the suppression process will occur, in a second time !

                      • Fourthly, in these specific sections, we have to delete any temporary_opinion sub-region when the modifier is different from "love_opinion" and different from "friendliness_opinion".

                      But, as these other sub-regions may come before and/or after the sub-regions with "love_opinion" and "friendliness_opinion", we’ll have to add a mark at the end of general section, to be sure that all the temporary_opinion sub-regions, under target=18028, and different from "love_opinion" and "friendliness_opinion", will be matched and deleted


                      Okay, let’s go :

                      • This first S/R changes the beginning of any global section, using a symbol not used yet, in your file ( I chose the ¤ symbol )

                      • Open the Replace dialog ( Ctrl + H )

                        • SEARCH ^\x20\{

                        • REPLACE ¤

                        • Tick the Wrap around option

                        • Select the Regular expression search mode

                        • Click on the Replace All button

                      => 27,709 replacements

                      From now on, if we force the regex engine to look for characters different from ¤, we’ll be sure that any match will be a subset of a global section, only. Moreover, it clearly shows the beginning of a next global section


                      The second S/R, below, using the free-spacing mode, (?x), will add a @ symbol, in a new line, right before the next global section :

                      (?x)                                           #  FREE-SPACING mode
                      ^ \h+ target= 18028 \R   (?: [^¤\r\n]*\R )*    #  Find the line 'target=18028' and possible range of COMPLETE lines, till ...
                      
                      #  Find a **`temporary_opinion`** sub-region with string "love_opinion" or "friendliness_opinion"
                      \h+ temporary_opinion = { \R \h+ modifier= "(?:(love)|friendliness) _opinion" (?: [^¤{}]+ { [^¤{}]+ } ){0,2} [^¤{}]+ } \R
                      
                      (?: [^¤\r\n]*\R )*                             #  Find possible range of COMPLETE lines, till ...
                      
                      #  Find a **`temporary_opinion`** sub-region with string "friendliness_opinion" or "love_opinion"
                      \h+ temporary_opinion = { \R \h+ modifier= "(?(1)friendliness|love) _opinion" (?: [^¤{}]+ { [^¤{}]+ } ){0,2} [^¤{}]+ } \R
                      
                      [^¤]*                                          #  Find the GREATEST range of chars, DIFFERENT from '¤' ( so ENDING right before a '¤' )
                      

                      • Select all the text above, between (?x) and ( so ENDING right before a '¤' )

                      • Open the Replace dialog ( Ctrl + H )

                        • The Search what: zone should contain the selection

                        • REPLACE $0@\n

                        • Tick the Wrap around option

                        • Select the Regular expression search mode

                        • Click on the Replace All button

                      => 1,692 replacements


                      The third S/R, below, in the free-spacing mode too, will delete, in the specific zones ending with a @ symbol, any temporary_opinion sub-region with modifier different from, either, "love_opinion" and "friendliness_opinion" :

                      (?x)              #  The next line finds ANY temporary_opinion sub-region, DIFFERENT from "love_opinion" and "friendliness_opinion",
                                        #    WHATEVER the 3 POSSIBLE syntaxes
                      ^ \h+ temporary_opinion = { \R \h+ modifier= (?!"love_opinion") (?!"friendliness_opinion") (?: [^¤{}]+ { [^¤{}]+ } ){0,2} [^¤{}]+ } \R
                      (?= [^¤]* @ )     #  This line ensures that the SUPPRESSIONS only occur in the specific regions matching the 3 CRITERIA !   
                      

                      • Select all the text above, between (?x) and matching the 3 CRITERIA !

                      • Open the Replace dialog ( Ctrl + H )

                        • The Search what: zone should contain the selection

                        • REPLACE Leave EMPTY

                        • Tick the Wrap around option

                        • Select the Regular expression search mode

                        • Click on the Replace All button

                      => 1,483 replacements


                      Finally, this fourth regex S/R will delete all the temporary ¤ and @ symbols :

                      • Open the Replace dialog ( Ctrl + H )

                        • SEARCH ^(¤)|^@\R

                        • REPLACE ?1\x20{

                        • Tick the Wrap around option

                        • Select the Regular expression search mode

                        • Click on the Replace All button

                      => 29,401 replacements

                      Best Regards,

                      guy038

                      P.S. :

                      It remains 1 global section, with target different from 18028 :

                       {
                      			owner=30639
                      			target=15484
                      			temporary_opinion={
                      				modifier="friendliness_opinion"
                      				start_date=1074.3.4
                      				expiration_date=1082.5.23
                      				converging={
                      					days=268
                      					opinion=10.8
                      				}
                      				value=10
                      			}
                      			temporary_opinion={
                      				modifier="love_opinion"
                      				start_date=1074.3.14
                      				expiration_date=1098.11.9
                      				converging={
                      					days=258
                      					opinion=15.4
                      				}
                      				value=15
                      			}
                      			scripted_relations={
                      				potential_lover={ 0 }
                      			}
                      		}
                      

                      I’ve found it, empirically, with the regex :

                      (?x-s) target= (?!18028) .+\R (.+\R){1,4} .+? "friendliness_opinion"\R (.+\R){1,9} .+? "love_opinion"\R

                      Trần Hoàng LongT 1 Reply Last reply Reply Quote 1
                      • Trần Hoàng LongT
                        Trần Hoàng Long @guy038
                        last edited by

                        Thank you @guy038 so much, I used your method and got exact result I want. Great work !

                        I’m still trying to understand your method to know more about NPP.

                        Thanks again. Have a nice day !

                        1 Reply Last reply Reply Quote 1
                        • First post
                          Last post
                        The Community of users of the Notepad++ text editor.
                        Powered by NodeBB | Contributors