<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[How to sort out duplicates?]]></title><description><![CDATA[<p dir="auto">How do I search for unique values? I have a list of duplicates and want only the unique values from this list as well as a count of the resulting filter.</p>
<p dir="auto">thanks</p>
]]></description><link>https://community.notepad-plus-plus.org/topic/16073/how-to-sort-out-duplicates</link><generator>RSS for Node</generator><lastBuildDate>Fri, 13 Mar 2026 01:22:15 GMT</lastBuildDate><atom:link href="https://community.notepad-plus-plus.org/topic/16073.rss" rel="self" type="application/rss+xml"/><pubDate>Mon, 23 Jul 2018 18:52:49 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to How to sort out duplicates? on Thu, 26 Jul 2018 01:07:02 GMT]]></title><description><![CDATA[<p dir="auto">If you’re working with just a list of values, you can install the <a href="https://github.com/jokedst/CsvQuery" rel="nofollow ugc">CsvQuery</a> plugin and run a SQL type query on your list. Say you have this as your list:</p>
<p dir="auto">filter1<br />
filter1<br />
filter1<br />
filter2<br />
filter2<br />
filter2<br />
Filter1<br />
filter2<br />
Filter3</p>
<p dir="auto">Open the CsvQuery window, (you may have to hit the “read file” button), type this into the command bar:</p>
<pre><code>SELECT col1, COUNT(*) FROM THIS GROUP BY COL1 ORDER BY COL1
</code></pre>
<p dir="auto">Click Execute<br />
the output window will show this:<br />
Filter1	1<br />
Filter3	1<br />
filter1	3<br />
filter2	4</p>
<p dir="auto">If you want case insensitive searches:</p>
<pre><code>SELECT col1, COUNT(*) FROM THIS GROUP BY COL1 COLLATE NOCASE ORDER BY col1 COLLATE NOCASE
</code></pre>
<p dir="auto">That will get you this:<br />
Filter1	4<br />
filter2	4<br />
Filter3	1</p>
<p dir="auto">I had to use ctrl-c to copy the output. Right click - copy didn’t work for me. Note that in a CsvQuery query “this” is the table name (your list in the active window).<br />
Reading a file with 100k lines and executing the search each took about a second on my test case (the sample above repeated a lot). IDK if you can make a macro out of this, but it’s another option besides regex and python.</p>
<p dir="auto">May the Sort be with you.</p>
]]></description><link>https://community.notepad-plus-plus.org/post/33735</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/33735</guid><dc:creator><![CDATA[cipher-1024]]></dc:creator><pubDate>Thu, 26 Jul 2018 01:07:02 GMT</pubDate></item><item><title><![CDATA[Reply to How to sort out duplicates? on Mon, 23 Jul 2018 19:57:21 GMT]]></title><description><![CDATA[<p dir="auto">thank you. Perhaps better to do in Excel then.</p>
]]></description><link>https://community.notepad-plus-plus.org/post/33685</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/33685</guid><dc:creator><![CDATA[Jeff Taylor]]></dc:creator><pubDate>Mon, 23 Jul 2018 19:57:21 GMT</pubDate></item><item><title><![CDATA[Reply to How to sort out duplicates? on Mon, 23 Jul 2018 19:42:56 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="https://community.notepad-plus-plus.org/uid/3737">@Jeff-Taylor</a></p>
<p dir="auto">Not super simple.  Some answers <a href="https://notepad-plus-plus.org/community/topic/14129/sort-file-removing-duplicates-possible" rel="nofollow ugc">here</a> possibly.</p>
]]></description><link>https://community.notepad-plus-plus.org/post/33684</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/33684</guid><dc:creator><![CDATA[Scott Sumner]]></dc:creator><pubDate>Mon, 23 Jul 2018 19:42:56 GMT</pubDate></item></channel></rss>