Community

    • Login
    • Search
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Search

    Sort Lines Lexicographically did not work

    Help wanted · · · – – – · · ·
    17
    56
    6276
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Alan Kilborn
      Alan Kilborn @r l last edited by Alan Kilborn

      @r-l said:

      in Notepadd++ 7.7.1 there needs to be at least one empty line after the last line of text, otherwise the sorting function doesn’t do anything

      This is NOT true when I try it. Can you provide anything more to help reproduce?

      1 Reply Last reply Reply Quote 0
      • r l
        r l last edited by

        You’re right. And I just realized what my mistake was.

        I had searched and replaced certain character sequences prior to sorting, to insert linebreaks. In Notepad++ that linebreak must be “\r\n” rather than just “\n”. With correct linebreaks, sorting works fine. With just “\n”, the sorting function sees the whole file as one single line. Likewise, when I add an empty line at the end of the file, it only sorts that empty line to the beginning of the file but doesn’t change the sequency of anything else.

        So, sorry for the noise… :-)

        Alan Kilborn 1 Reply Last reply Reply Quote 5
        • Alan Kilborn
          Alan Kilborn @r l last edited by

          @r-l said:

          In Notepad++ that linebreak must be “\r\n” rather than just “\n”

          Only if you want it to be. There are Windows files (\r\n) and there are Linux (\n) files when it comes to “line endings”. If you are doing regular expression replacements, you absolutely need to be careful about using \r\n versus using \n. There is nothing there to protect you from yourself.

          And…sorting is known to have problems when you have mixed line-endings in the data, something you shouldn’t allow to happen unless you really know what you are doing, and really need that situation.

          1 Reply Last reply Reply Quote 4
          • Bill Hepler
            Bill Hepler last edited by

            I often have problems sorting text in NotePaid++ - I’m set as Windows CR/LF etc. The problem seems to arise if spaces and tabs are interspersed as beginning of line characters…

            Here is an example:

            BuildAccount()            // Summary of Posted Transactions
            BuildAudAcct()            // Tax Records that relate to Account Records....
            BuildAudit()              // Daily & Account
            BuildBankRec()
            BuildCheque()
            BuildContainer()
            BuildCounterIDs()  // Apr 2020
            BuildDepot( )
            BuildEvents( )    // Dec 2019
            BuildFinBat()
            BuildGrower()
            BuildImpBat()
            BuildPayGrp()
            BuildPostBat()
            BuildPrice()
            BuildProcess()
            BuildProduct()
            BuildReceipts()
            BuildTax( )
            BuildVariety()    // June 2015
            

            BuildChqFmt() // Note the Spaces in front of these lines rather than tabs
            BuildRptCfg( ) // this seems to be the cause of the problems
            BuildVoid()

            Alan Kilborn 1 Reply Last reply Reply Quote 0
            • Alan Kilborn
              Alan Kilborn @Bill Hepler last edited by

              @Bill-Hepler

              The data in the black code region seems already sorted?

              What’s the significance of the 3 items below the code box?

              And I don’t see any leading spaces/tabs anywhere?

              Matthew Ewer 1 Reply Last reply Reply Quote 1
              • Matthew Ewer
                Matthew Ewer @Alan Kilborn last edited by

                @Alan-Kilborn I suspect what happened there is that the lines DID have tabs and spaces, respectively, when they pasted it into their post, but the tabs caused the black box. The three after used a different whitespace, and so were not included in the black box. (But apparently the whitespace was formatted away?) Note that, if you expect sorting to ignore leading whitespace, the three lines after the box should be interspersed between lines within the box (but are not).

                Alan Kilborn 1 Reply Last reply Reply Quote 1
                • Alan Kilborn
                  Alan Kilborn @Matthew Ewer last edited by

                  @Matthew-Ewer

                  That’s a good explanation of what might have happened here.
                  But since the OP never returned to continue the exchange, I suppose the issue is dead.

                  if you expect sorting to ignore leading whitespace

                  Just a note to say that Notepad++'s sorting doesn’t ignore leading whitespace, just in case someone thinks it can from the above.

                  1 Reply Last reply Reply Quote 1
                  • aarocaes
                    aarocaes last edited by

                    This post is deleted!
                    1 Reply Last reply Reply Quote 0
                    • aarocaes
                      aarocaes last edited by

                      Maybe it could be helpful for anyone else. I had this issue, and the problem was related with the line break codes.

                      Even if you are seeing different lines, is possible that only one of the line break codes (\n or \r) is being used. If that’s the case, sorting won’t work as it will take all the text as just one single line. You can check this enabling the “Show all characters” option:

                      2020-10-01 23_00_30-Window.png

                      LF is the code for Line Feed (\n). But it’s missing the CR (Carriage Return, \r) code. Unlike Unix systems, standard line-termination in Windows and in the Internet is \r\n.

                      So if you replace these line-termination codes activating the extended search mode (replacing \n with \r\n), it will work.

                      2020-10-01 22_58_49-Window.png

                      PeterJones 1 Reply Last reply Reply Quote 6
                      • PeterJones
                        PeterJones @aarocaes last edited by

                        @aarocaes ,

                        Interesting. It sorts correctly for me.
                        5cdccf20-469c-44e6-a765-074bf01d15ac-image.png

                        I thought maybe if you have mixed line endings, where some don’t match the current line-endings mode:
                        94dd079f-50dd-4ff3-ae3a-a124003337e8-image.png => 9cc139ab-cde8-41c8-89d1-10cb24fb14fa-image.png
                        Yeah, that’s it: correctly\nsorted and these\nlines were treated as single lines.

                        So, as long as you don’t have mixed line endings, and your line endings match the defined line endings setting shown in the status bar, it will work. But if there’s a mix or discrepancy, it won’t be treated as a line ending.

                        gitberry 1 Reply Last reply Reply Quote 5
                        • gitberry
                          gitberry @PeterJones last edited by

                          Thank you for these posts - I love everything about NotePad++ EXCEPT line sorting because I’ve never understood why NotePad++ was inconsistent with line sorting until I read this. Now I can use it with more confidence.

                          On reflection it would seem that this is an unnecessary pain for those who work in all 3 line ending environments and wondering if we couldn’t throw something into settings that could possibly be like this:
                          [ X ] SORT Line Ending Agnostic (will treat CR, CR-LF and LF equally as line endings when sorting)

                          Alan Kilborn 2 Replies Last reply Reply Quote 0
                          • Alan Kilborn
                            Alan Kilborn @gitberry last edited by

                            @gitberry

                            What is a good example of why someone would need to “mix” line-ending types within one file? I don’t think I’ve ever run across a good reason for this. I welcome being enlightened.

                            gitberry 1 Reply Last reply Reply Quote 3
                            • Terry R
                              Terry R last edited by

                              @gitberry said in Sort Lines Lexicographically did not work:

                              Line Ending Agnostic (will treat CR, CR-LF and LF equally as line endings when sorting)

                              If you read @PeterJones post his last remarks were:
                              So, as long as you don’t have mixed line endings, and your line endings match the defined line endings setting shown in the status bar, it will work.

                              So if a file ONLY contains 1 type of line ending and that matches the status bar then sorting will work as expected. This seems to me to be a reasonable assumption by the developers.

                              Maybe I’m naive when to it comes to the various line endings (Unix, Mac vs Windows) but I don’t see why a file would contain a mixture of the 3 types, unless there had been an error in encoding or reading of the file. So AFAIK a file would ONLY ever contain 1 type of line ending, and that depends on the use/environment the file is being used in.

                              Notepad++ also has a function which resides under Edit main menu, called EOL Conversion where the file is “totally” converted from one type to another. maybe this should be used before a sort operation if at all unsure of whether there is a mixture of types.

                              Terry

                              1 Reply Last reply Reply Quote 1
                              • gitberry
                                gitberry @Alan Kilborn last edited by

                                @Alan-Kilborn So true! I don’t think there is a good reason. When it happens (ie received from am uncaring/uncareful source etc) and the sort doesn’t work…

                                Alan Kilborn 1 Reply Last reply Reply Quote 0
                                • Alan Kilborn
                                  Alan Kilborn @gitberry last edited by Alan Kilborn

                                  @gitberry said in Sort Lines Lexicographically did not work:

                                  So true! I don’t think there is a good reason. When it happens (ie received from am uncaring/uncareful source etc) and the sort doesn’t work…

                                  If you are likely to get files of that nature from another source, suggest you “sanitize” them before beginning to work with them.

                                  For example, do a line-ending conversion in Notepad++, which will unify the line-endings all to one type (whichever type you desire). After that, do your sort, or whatever other data manipulations you need to do.

                                  (I guess Terry already said the same thing; sorry, didn’t see that first before crafting this reply)

                                  mathlete2 1 Reply Last reply Reply Quote 2
                                  • Alan Kilborn
                                    Alan Kilborn last edited by

                                    @Terry-R said in Sort Lines Lexicographically did not work:

                                    I don’t see why a file would contain a mixture of the 3 types, unless there had been an error in encoding or reading of the file. So AFAIK a file would ONLY ever contain 1 type of line ending, and that depends on the use/environment the file is being used in.

                                    Amen, brother, amen.

                                    However, Notepad++ (Scintilla) doesn’t enforce this.
                                    And, by default, it doesn’t let you know that you have “screwed up” files when this situation happens to occur.

                                    One way for it to occur is the aforementioned reception of files from another source.

                                    Another way for it to happen is a regex replacement where uses think that \n works to match a line-ending of any type. It does NOT ; \R should be used instead for this purpose. But, again, Notepad++ lets you do it, so line-ending weirdness can happen from this.

                                    A good way to “set it and forget it” to avoid this type of problem is by using the EditorConfig plugin. With that, you specify your desired line-ending type, and when files are saved in Notepad++, the plugin steps in and corrects any improper line-endings to your desired type.

                                    An alternative way to monitor the situation is to turn on visible line-endings and then hope you notice a mismatch. However, looking at the “heavy” line-ending character representation is too visually overwhelming for me. YMMV.

                                    1 Reply Last reply Reply Quote 1
                                    • Alan Kilborn
                                      Alan Kilborn @gitberry last edited by

                                      @gitberry said in Sort Lines Lexicographically did not work:

                                      [ X ] SORT Line Ending Agnostic (will treat CR, CR-LF and LF equally as line endings when sorting)

                                      There’s an open issue on the ISSUE-TRACKER for this; perhaps you wanna add your voice there so it can be heard by developers?

                                      Personally, I don’t think this needs a setting, I think it should ignore line-endings when sorting.

                                      But, for myself, I use the EditorConfig plugin so that I just don’t get into a situation where a sorting problem (and other problems that could occur from this) doesn’t happen.

                                      1 Reply Last reply Reply Quote 2
                                      • mathlete2
                                        mathlete2 @Alan Kilborn last edited by

                                        @Alan-Kilborn said in Sort Lines Lexicographically did not work:

                                        For example, do a line-ending conversion in Notepad++, which will unify the line-endings all to one type (whichever type you desire). After that, do your sort, or whatever other data manipulations you need to do.

                                        actually, if you open this menu on a file with a mixture of line endings, the original selection is greyed out; NP++ thinks everything is still unified. perhaps this is a bug?

                                        96805736-7d1d-47e7-85b6-e6b2593cfe43-image.png

                                        PeterJones 1 Reply Last reply Reply Quote 0
                                        • PeterJones
                                          PeterJones @mathlete2 last edited by

                                          @mathlete2 said in Sort Lines Lexicographically did not work:

                                          actually, if you open this menu on a file with a mixture of line endings, the original selection is greyed out; NP++ thinks everything is still unified. perhaps this is a bug?

                                          No. It just picked one (probably based on the line1 ending). To unify, you need to trigger at least one conversion, so picking the wrong one (like LF), and then convert back to the right one (CRLF). This is why Alan phrased is as “do a line-ending conversion”, not just “pick the line ending you want”.

                                          mathlete2 1 Reply Last reply Reply Quote 2
                                          • mathlete2
                                            mathlete2 last edited by

                                            also, FWIW, you can get yourself into these situations if you do a RegEx replacement similar to the one below to separate objects into separate lines. visually, this gets you to a the sortable state you want, but the EOL codes interfere with the actual sorting.

                                            ff7074db-b897-4d4e-930f-fe89c4432cc0-image.png

                                            Alan Kilborn 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright © 2014 NodeBB Forums | Contributors