Sort by line possible this way?
-
Hi,
Is this possible.
I have a text with a lot of lines (200k+)
I want to sort the lines with more then 3 times the same start words in the line.For example:
This is a test line.
This is a test line to show.
This is a test line to show you.So i want to bookmark/delete ALL lines that have more than 3 same start words AT ONCE.
Is this possible?
Thnx
-
Since it’s sorted, that makes it quite doable.
With the expanded data set, which has examples of mathching one, two, three, or four words from the beginning of the line:
Matching two words. Matching two. One match. One similar word. The last three match mostly. The last three match somewhat. The last three match with four words as well. This is a single line. This is a test line to show you. This is a test line to show. This is a test line. This only matches three. This only matches with three.
FIND =
(?-s)^(\w+ \w+ \w+ \w+).*$\R(\1.*(\R|\Z))+
That can be used to Mark/bookmark, or if you then replace with the empty string, it could be a replacement instead of a bookmarking.
----
Useful References
-
Thnx a lot!
-
Contrary to the title, this has nothing to do with sorting.