<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[When case insensitive sorting will be added in Notepad++ x64 (without TextFX)?]]></title><description><![CDATA[<p dir="auto">When case insensitive sorting will be added in Notepad++ x64 (without TextFX)?</p>
<p dir="auto">insensitive sorting vs sensitive sorting</p>
<p dir="auto">A      A<br />
a      B<br />
B      C<br />
b      a<br />
C      b<br />
c      c</p>
]]></description><link>https://community.notepad-plus-plus.org/topic/15640/when-case-insensitive-sorting-will-be-added-in-notepad-x64-without-textfx</link><generator>RSS for Node</generator><lastBuildDate>Mon, 15 Jun 2026 07:37:41 GMT</lastBuildDate><atom:link href="https://community.notepad-plus-plus.org/topic/15640.rss" rel="self" type="application/rss+xml"/><pubDate>Tue, 17 Apr 2018 09:18:42 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to When case insensitive sorting will be added in Notepad++ x64 (without TextFX)? on Fri, 04 May 2018 11:03:34 GMT]]></title><description><![CDATA[<p dir="auto">Hi, <strong>All</strong>,</p>
<p dir="auto">In addition to all the <strong>Latin</strong> letters, matched by an <strong>equivalence class</strong> ( <strong><code>&lsqb;&lsqb;=</code>X<code>=&rsqb;&rsqb;</code></strong> ), in my <strong>two previous</strong> posts, here are the <strong>last</strong> tables, about :</p>
<ul>
<li>
<p dir="auto">The <strong>digit</strong> characters</p>
</li>
<li>
<p dir="auto">Some <strong>currency</strong> symbols</p>
</li>
<li>
<p dir="auto">The <strong>Greek</strong> letters and some <strong>symbols</strong></p>
</li>
</ul>
<pre><code class="language-diff">
            Different characters X found, when using an EQUIVALENCE CLASS &lsqb;&lsqb;=X=&rsqb;&rsqb; :
            ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯


•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |
•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 0030 |   0   |      | 0035 |   5   |      | 2469 | [ ⑩ ] |      | 246F | [ ⑯ ] |      | 00A2 |   ¢   |
| 24EA |   ⓪   |     | 2464 | [ ⑤ ] |     | 247D |   ⑽   |     | 2483 |   ⒃   |      | FFE0 |   ￠   |
| FF10 |   ０   |      | 2478 | [ ⑸ ] |     | 2491 |   ⒑   |     | 2497 |   ⒗   |     •------•-------•
•------•-------•      | 248C |   ⒌   |      •------•-------•      •------•-------•      | 00A3 |   £   |
| 0031 |   1   |      | FF15 |   ５   |      | 246A | [ ⑪ ] |     | 2470 | [ ⑰ ] |     | FFE1 |   ￡   |
| 00B9 |   ¹   |      •------•-------•      | 247E |   ⑾   |      | 2484 |   ⒄   |      •------•-------•
| 2460 |   ①   |      | 0036 |   6   |      | 2492 |   ⒒   |      | 2498 |   ⒘   |      | 00A5 |   ¥   |
| 2474 | [ ⑴ ] |     | 2465 | [ ⑥ ] |     •------•-------•      •------•-------•      | FFE5 |   ￥   |
| 2488 |   ⒈   |      | 2479 | [ ⑹ ] |      | 246B | [ ⑫ ] |      | 2471 | [ ⑱ ] |      •------•-------•
| FF11 |   １   |      | 248D |   ⒍   |     | 247F |   ⑿   |     | 2485 |   ⒅   |     | 20A9 |   ₩   |
•------•-------•      | FF16 |   ６   |      | 2493 |   ⒓   |     | 2499 |   ⒙   |     | FFE6 |   ￦   |
| 0032 |   2   |      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 00B2 |   ²   |      | 0037 |   7   |      | 246C | [ ⑬ ] |      | 2472 | [ ⑲ ] |
| 01BB |   ƻ   |      | 2466 | [ ⑦ ] |      | 2480 |   ⒀   |      | 2486 |   ⒆   |
| 2461 | [ ② ] |      | 247A | [ ⑺ ] |      | 2494 |   ⒔   |      | 249A |   ⒚   |
| 2475 | [ ⑵ ] |      | 248E |   ⒎   |      •------•-------•      •------•-------•
| 2489 |   ⒉   |     | FF17 |   ７   |      | 246D | [ ⑭ ] |     | 2473 | [ ⑳ ] |
| FF12 |   ２   |      •------•-------•      | 2481 |   ⒁   |     | 2487 |   ⒇   |
•------•-------•      | 0038 |   8   |      | 2495 |   ⒕   |      | 249B |   ⒛   |
| 0033 |   3   |      | 2467 | [ ⑧ ] |      •------•-------•      •------•-------•
| 00B3 |   ³   |      | 247B | [ ⑻ ] |      | 246E | [ ⑮ ] |
| 2462 | [ ③ ] |      | 248F |   ⒏   |      | 2482 |   ⒂   |
| 2476 | [ ⑶ ] |     | FF18 |   ８   |      | 2496 |   ⒖   |
| 248A |   ⒊   |      •------•-------•      •------•-------•
| FF13 |   ３   |      | 0039 |   9   |
•------•-------•      | 2468 | [ ⑨ ] |
| 0034 |   4   |      | 247C | [ ⑼ ] |
| 2463 | [ ④ ] |      | 2490 |   ⒐   |
| 2477 | [ ⑷ ] |     | FF19 |   ９   |
| 248B |   ⒋   |      •------•-------•
| FF14 |   ４   |
•------•-------•


•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |
•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 0386 |   Ά   |      | 0389 |   Ή   |      | 039C |   Μ   |      | 03A3 |   Σ   |      | 03A8 |   Ψ   |
| 0391 |   Α   |      | 0397 |   Η   |      | 03BC |   μ   |      | 03C2 |   ς   |      | 03C8 |   ψ   |
| 03AC |   ά   |      | 03AE |   ή   |      •------•-------•      | 03C3 |   σ   |      •------•-------•
| 03B1 |   α   |      | 03B7 |   η   |      | 039D |   Ν   |      | 03F2 |   ϲ   |      | 038F |   Ώ   |
•------•-------•      •------•-------•      | 03BD |   ν   |      •------•-------•      | 03A9 |   Ω   |
| 0392 |   Β   |      | 0398 |   Θ   |      •------•-------•      | 03A4 |   Τ   |      | 03C9 |   ω   |
| 03B2 |   β   |      | 03B8 |   θ   |      | 039E |   Ξ   |      | 03C4 |   τ   |      | 03CE |   ώ   |
| 03D0 |   ϐ   |      | 03D1 |   ϑ   |      | 03BE |   ξ   |      •------•-------•      | 2126 |   Ω   |
•------•-------•      •------•-------•      •------•-------•      | 038E |   Ύ   |      | 2127 |   ℧   |
| 0393 |   Γ   |      | 038A |   Ί   |      | 038C |   Ό   |      | 03A5 |   Υ   |      | 2129 |   ℩   |
| 03B3 |   γ   |      | 0390 |   ΐ   |      | 039F |   Ο   |      | 03AB |   Ϋ   |      •------•-------•
•------•-------•      | 0399 |   Ι   |      | 03BF |   ο   |      | 03B0 |   ΰ   |      | 03DA |   Ϛ   |
| 0394 |   Δ   |      | 03AA |   Ϊ   |      | 03CC |   ό   |      | 03C5 |   υ   |      | 03DB |   ϛ   |
| 03B4 |   δ   |      | 03AF |   ί   |      •------•-------•      | 03CB |   ϋ   |      •------•-------•
•------•-------•      | 03B9 |   ι   |      | 03A0 |   Π   |      | 03CD |   ύ   |      | 03DC |   Ϝ   |
| 0388 |   Έ   |      | 03CA |   ϊ   |      | 03C0 |   π   |      | 03D2 |   ϒ   |      | 03DD |   ϝ   |
| 0395 |   Ε   |      •------•-------•      | 03D6 |   ϖ   |      | 03D3 |   ϓ   |      •------•-------•
| 03AD |   έ   |      | 039A |   Κ   |      •------•-------•      | 03D4 |   ϔ   |      | 03DE |   Ϟ   |
| 03B5 |   ε   |      | 03BA |   κ   |      | 03A1 |   Ρ   |      •------•-------•      | 03DF |   ϟ   |
•------•-------•      | 03F0 |   ϰ   |      | 03C1 |   ρ   |      | 03A6 |   Φ   |      •------•-------•
| 0396 |   Ζ   |      •------•-------•      | 03F1 |   ϱ   |      | 03C6 |   φ   |      | 03E0 |   Ϡ   |
| 03B6 |   ζ   |      | 039B |   Λ   |      •------•-------•      | 03D5 |   ϕ   |      | 03E1 |   ϡ   |
•------•-------•      | 03BB |   λ   |                            •------•-------•      •------•-------•
                      •------•-------•                            | 03A7 |   Χ   |
                                                                  | 03C7 |   χ   |
                                                                  •------•-------•
</code></pre>
<p dir="auto">Best Regards,</p>
<p dir="auto">guy038</p>
]]></description><link>https://community.notepad-plus-plus.org/post/32031</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/32031</guid><dc:creator><![CDATA[guy038]]></dc:creator><pubDate>Fri, 04 May 2018 11:03:34 GMT</pubDate></item><item><title><![CDATA[Reply to When case insensitive sorting will be added in Notepad++ x64 (without TextFX)? on Fri, 04 May 2018 11:00:36 GMT]]></title><description><![CDATA[<p dir="auto">Hi, <strong>All</strong>,</p>
<p dir="auto"><strong>Continuation</strong> of the tables, with, also, some <strong>Latin ligatures</strong> and various letters and <strong>symbols</strong> :</p>
<pre><code class="language-diff">
            Different characters X found, when using an EQUIVALENCE CLASS &lsqb;&lsqb;=X=&rsqb;&rsqb; :
            ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯


•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |
•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 004E |   N   |      | 004F |   O   |      | 0050 |   P   |      | 0051 |   Q   |      | 0052 |   R   |
| 006E |   n   |      | 006F |   o   |      | 0070 |   p   |      | 0071 |   q   |      | 0072 |   r   |
| 00D1 |   Ñ   |      | 00BA |   º   |      | 01A4 |   Ƥ   |      | 02A0 |   ʠ   |      | 0154 | [ Ŕ ] |
| 00F1 |   ñ   |      | 00D2 |   Ò   |      | 01A5 |   ƥ   |      | 211A |   ℚ   |      | 0155 | [ ŕ ] |
| 0143 | [ Ń ] |      | 00D3 |   Ó   |      | 0278 | [ ɸ ] |      | 24AC |   ⒬   |      | 0156 | [ Ŗ ] |
| 0144 | [ ń ] |      | 00D4 |   Ô   |      | 1E54 | [ Ṕ ] |      | 24C6 |   Ⓠ   |      | 0157 | [ ŗ ] |
| 0145 | [ Ņ ] |      | 00D5 |   Õ   |      | 1E55 | [ ṕ ] |      | 24E0 |   ⓠ   |      | 0158 | [ Ř ] |
| 0146 | [ ņ ] |      | 00D6 |   Ö   |      | 1E56 | [ Ṗ ] |      | FF31 |   Ｑ   |      | 0159 | [ ř ] |
| 0147 | [ Ň ] |      | 00D8 |   Ø   |      | 1E57 | [ ṗ ] |      | FF51 |   ｑ   |      | 01A6 |   Ʀ   |
| 0148 | [ ň ] |      | 00F2 |   ò   |      | 2118 |   ℘   |      •------•-------•      | 0210 |   Ȑ   |
| 0149 | [ ŉ ] |      | 00F3 |   ó   |      | 2119 |   ℙ   |                            | 0211 |   ȑ   |
| 019D |   Ɲ   |      | 00F4 |   ô   |      | 24AB |   ⒫   |                            | 0212 |   Ȓ   |
| 019E |   ƞ   |      | 00F5 |   õ   |      | 24C5 |   Ⓟ   |                            | 0213 |   ȓ   |
| 0272 | [ ɲ ] |      | 00F6 |   ö   |      | 24DF |   ⓟ   |                            | 0279 | [ ɹ ] |
| 0273 | [ ɳ ] |      | 00F8 |   ø   |      | FF30 |   Ｐ   |                            | 027A | [ ɺ ] |
| 0274 | [ ɴ ] |      | 014C | [ Ō ] |      | FF50 |   ｐ   |                            | 027B |   ɻ   |
| 1E44 | [ Ṅ ] |      | 014D | [ ō ] |      •------•-------•                            | 027C |   ɼ   |
| 1E45 | [ ṅ ] |      | 014E | [ Ŏ ] |                                                  | 027D |   ɽ   |
| 1E46 | [ Ṇ ] |      | 014F |   ŏ   |                                                  | 027E |   ɾ   |
| 1E47 | [ ṇ ] |      | 0150 | [ Ő ] |                                                  | 027F |   ɿ   |
| 1E48 | [ Ṉ ] |      | 0151 | [ ő ] |      •------•-------•      •------•-------•      | 0280 |   ʀ   |
| 1E49 | [ ṉ ] |      | 019F |   Ɵ   |      | Code | Char. |      | Code | Char. |      | 0281 |   ʁ   |
| 1E4A | [ Ṋ ] |      | 01A0 |   Ơ   |      •------•-------•      •------•-------•      | 1E58 | [ Ṙ ] |
| 1E4B | [ ṋ ] |      | 01A1 |   ơ   |      | 0053 |   S   |      | 0054 |   T   |      | 1E59 | [ ṙ ] |
| 2115 |   ℕ   |      | 01A2 |   Ƣ   |      | 0073 |   s   |      | 0074 |   t   |      | 1E5A | [ Ṛ ] |
| 24A9 |   ⒩   |      | 01A3 |   ƣ   |      | 015A | [ Ś ] |      | 0162 | [ Ţ ] |      | 1E5B |   ṛ   |
| 24C3 |   Ⓝ   |      | 01D1 |   Ǒ   |      | 015B |   ś   |      | 0163 | [ ţ ] |      | 1E5C |   Ṝ   |
| 24DD |   ⓝ   |      | 01D2 |   ǒ   |      | 015C |   Ŝ   |      | 0164 | [ Ť ] |      | 1E5D |   ṝ   |
| FF2E |   Ｎ   |      | 01EA |   Ǫ   |     | 015D |   ŝ   |      | 0165 | [ ť ] |      | 1E5E |   Ṟ   |
| FF4E |   ｎ   |      | 01EB |   ǫ   |      | 015E |   Ş   |      | 01AB |   ƫ   |      | 1E5F |   ṟ   |
•------•-------•      | 01EC |   Ǭ   |      | 015F |   ş   |      | 01AC |   Ƭ   |      | 211B |   ℛ   |
                      | 01ED |   ǭ   |      | 0160 |   Š   |      | 01AD |   ƭ   |      | 211C |   ℜ   |
                      | 01FE |   Ǿ   |      | 0161 | [ š ] |      | 01AE |   Ʈ   |      | 211D |   ℝ   |
                      | 01FF |   ǿ   |      | 01A7 |   Ƨ   |      | 0287 |   ʇ   |      | 24AD |   ⒭   |
                      | 020C |   Ȍ   |      | 01A8 |   ƨ   |      | 0288 |   ʈ   |      | 24C7 |   Ⓡ   |
                      | 020D |   ȍ   |      | 01A9 |   Ʃ   |      | 02A7 |   ʧ   |      | 24E1 |   ⓡ   |
                      | 020E |   Ȏ   |      | 01AA |   ƪ   |      | 02A8 |   ʨ   |      | FF32 |   Ｒ   |
                      | 020F |   ȏ   |      | 0282 |   ʂ   |      | 1E6A | [ Ṫ ] |      | FF52 |   ｒ   |
                      | 0275 | [ ɵ ] |      | 1E60 |   Ṡ   |      | 1E6B | [ ṫ ] |      •------•-------•
                      | 0276 | [ ɶ ] |      | 1E61 | [ ṡ ] |      | 1E6C | [ Ṭ ] |
                      | 0277 | [ ɷ ] |      | 1E62 | [ Ṣ ] |      | 1E6D | [ ṭ ] |
                      | 1E4C | [ Ṍ ] |      | 1E63 | [ ṣ ] |      | 1E6E | [ Ṯ ] |
                      | 1E4D | [ ṍ ] |      | 1E64 | [ Ṥ ] |      | 1E6F | [ ṯ ] |
                      | 1E4E | [ Ṏ ] |      | 1E65 | [ ṥ ] |      | 1E70 | [ Ṱ ] |
                      | 1E4F | [ ṏ ] |      | 1E66 | [ Ṧ ] |      | 1E71 | [ ṱ ] |
                      | 1E50 | [ Ṑ ] |      | 1E67 | [ ṧ ] |      | 1E97 |   ẗ   |
                      | 1E51 | [ ṑ ] |      | 1E68 | [ Ṩ ] |      | 24AF |   ⒯   |
                      | 1E52 | [ Ṓ ] |      | 1E69 | [ ṩ ] |      | 24C9 |   Ⓣ   |
                      | 1E53 | [ ṓ ] |      | 24AE |   ⒮   |      | 24E3 |   ⓣ   |
                      | 1ECC |   Ọ   |      | 24C8 |   Ⓢ   |     | FF34 |   Ｔ   |
                      | 1ECD |   ọ   |      | 24E2 |   ⓢ   |     | FF54 |   ｔ   |
                      | 1ECE |   Ỏ   |      | FF33 |   Ｓ   |      •------•-------•
                      | 1ECF |   ỏ   |      | FF53 |   ｓ   |
                      | 1ED0 |   Ố   |      •------•-------•
                      | 1ED1 |   ố   |
                      | 1ED2 |   Ồ   |
                      | 1ED3 |   ồ   |
                      | 1ED4 |   Ổ   |
                      | 1ED5 |   ổ   |
                      | 1ED6 |   Ỗ   |
                      | 1ED7 |   ỗ   |
                      | 1ED8 |   Ộ   |
                      | 1ED9 |   ộ   |
                      | 1EDA |   Ớ   |
                      | 1EDB |   ớ   |
                      | 1EDC |   Ờ   |
                      | 1EDD |   ờ   |
                      | 1EDE |   Ở   |
                      | 1EDF |   ở   |
                      | 1EE0 |   Ỡ   |
                      | 1EE1 |   ỡ   |
                      | 1EE2 |   Ợ   |
                      | 1EE3 |   ợ   |
                      | 2134 |   ℴ   |
                      | 24AA |   ⒪   |
                      | 24C4 |   Ⓞ   |
                      | 24DE |   ⓞ   |
                      | FF2F |   Ｏ   |
                      | FF4F |   ｏ   |
                      •------•-------•


•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |
•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 0055 |   U   |      | 0056 |   V   |      | 0057 |   W   |      | 0058 |   X   |      | 0059 |   Y   |
| 0075 |   u   |      | 0076 |   v   |      | 0077 |   w   |      | 0078 |   x   |      | 0079 |   y   |
| 00D9 |   Ù   |      | 01B2 |   Ʋ   |      | 0174 | [ Ŵ ] |      | 1E8A |   Ẋ   |      | 00DD |   Ý   |
| 00DA |   Ú   |      | 028B |   ʋ   |      | 0175 | [ ŵ ] |      | 1E8B |   ẋ   |      | 00FD |   ý   |
| 00DB |   Û   |      | 028C |   ʌ   |      | 01BF |   ƿ   |      | 1E8C |   Ẍ   |      | 00FF |   ÿ   |
| 00DC |   Ü   |      | 1E7C |   Ṽ   |      | 028D |   ʍ   |      | 1E8D |   ẍ   |      | 0176 | [ Ŷ ] |
| 00F9 |   ù   |      | 1E7D |   ṽ   |      | 1E80 |   Ẁ   |      | 24B3 |   ⒳   |      | 0177 | [ ŷ ] |
| 00FA |   ú   |      | 1E7E |   Ṿ   |      | 1E81 |   ẁ   |      | 24CD |   Ⓧ   |      | 0178 | [ Ÿ ] |
| 00FB |   û   |      | 1E7F |   ṿ   |      | 1E82 |   Ẃ   |      | 24E7 |   ⓧ   |      | 01B3 |   Ƴ   |
| 00FC |   ü   |      | 24B1 |   ⒱   |     | 1E83 |   ẃ   |      | FF38 |   Ｘ   |      | 01B4 |   ƴ   |
| 0168 | [ Ũ ] |      | 24CB |   Ⓥ   |     | 1E84 |   Ẅ   |      | FF58 |   ｘ   |      | 028E |   ʎ   |
| 0169 | [ ũ ] |      | 24E5 |   ⓥ   |      | 1E85 |   ẅ   |      •------•-------•      | 028F |   ʏ   |
| 016A | [ Ū ] |      | FF36 |   Ｖ   |      | 1E86 |   Ẇ   |                           | 1E8E |   Ẏ   |
| 016B | [ ū ] |      | FF56 |   ｖ   |      | 1E87 |   ẇ   |                            | 1E8F |   ẏ   |
| 016C | [ Ŭ ] |      •------•-------•      | 1E88 |   Ẉ   |                            | 1E99 |   ẙ   |
| 016D | [ ŭ ] |                            | 1E89 |   ẉ   |                            | 1EF2 |   Ỳ   |
| 016E | [ Ů ] |                            | 1E98 |   ẘ   |                            | 1EF3 |   ỳ   |
| 016F | [ ů ] |                            | 24B2 |   ⒲   |                            | 1EF4 |   Ỵ   |
| 0170 | [ Ű ] |                            | 24CC |   Ⓦ   |                            | 1EF5 |   ỵ   |
| 0171 | [ ű ] |                            | 24E6 |   ⓦ   |                            | 1EF6 |   Ỷ   |
| 0172 | [ Ų ] |                            | FF37 |   Ｗ   |                           | 1EF7 |   ỷ   |
| 0173 | [ ų ] |                            | FF57 |   ｗ   |                           | 1EF8 |   Ỹ   |
| 01AF |   Ư   |                            •------•-------•                            | 1EF9 |   ỹ   |
| 01B0 |   ư   |                                                                        | 24B4 |   ⒴   |
| 01B1 |   Ʊ   |                                                                        | 24CE |   Ⓨ   |
| 01D3 |   Ǔ   |                                                                        | 24E8 |   ⓨ   |
| 01D4 |   ǔ   |                                                                        | FF39 |   Ｙ   |
| 01D5 |   Ǖ   |                                                                        | FF59 |   ｙ   |
| 01D6 |   ǖ   |                                                                        •-------•-------•
| 01D7 |   Ǘ   |
| 01D8 |   ǘ   |      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 01D9 |   Ǚ   |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |
| 01DA |   ǚ   |      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 01DB |   Ǜ   |      | 005A |   Z   |      | 00C6 |   Æ   |      | 00DE |   Þ   |      | 005E |   ^   |
| 01DC |   ǜ   |      | 007A |   z   |      | 00E6 |   æ   |      | 00FE |   þ   |      | 02C6 |   ˆ   |
| 0214 |   Ȕ   |      | 0179 | [ Ź ] |      | 01E2 |   Ǣ   |     •------•-------•      | FF3E |   ＾   |
| 0215 |   ȕ   |      | 017A |   ź   |      | 01E3 |   ǣ   |      | 00DF |   ß   |      •------•-------•
| 0216 |   Ȗ   |      | 017B |   Ż   |      | 01FC |   Ǽ   |      •------•-------•      | 005F |   _   |
| 0217 |   ȗ   |      | 017C |   ż   |      | 01FD |   ǽ   |      | 017F |   ſ   |      | FF3F |   ＿   |
| 0289 |   ʉ   |      | 017D |   Ž   |      •------•-------•      •------•-------•      •------•-------•
| 028A |   ʊ   |      | 017E |   ž   |      | 0132 |   Ĳ   |      | 01B7 |   Ʒ   |      | 0060 |   `   |
| 1E72 | [ Ṳ ] |      | 01B5 |   Ƶ   |      | 0133 |   ĳ   |      | 0292 |   ʒ   |      | 02CB |   ˋ   |
| 1E73 | [ ṳ ] |      | 01B6 |   ƶ   |       •------•------•      | 01B8 |   Ƹ   |       | FF40 |   ｀   |
| 1E74 | [ Ṵ ] |      | 0290 |   ʐ   |      | 01C4 |   Ǆ   |      | 01B9 |   ƹ   |      •------•-------•
| 1E75 |   ṵ   |      | 0291 |   ʑ   |      | 01C6 |   ǆ   |      | 01BA |   ƺ   |      | 00AF |   ¯   |
| 1E76 | [ Ṷ ] |      | 1E90 |   Ẑ   |      | 01C5 |   ǅ   |      | 01EE |   Ǯ   |      | 02C9 |   ˉ   |
| 1E77 | [ ṷ ] |      | 1E91 |   ẑ   |      | 01F1 |   Ǳ   |     | 01EF |   ǯ   |       | FFE3 |   ￣   |
| 1E78 | [ Ṹ ] |      | 1E92 |   Ẓ   |      | 01F3 |   ǳ   |      | 0293 |   ʓ   |      •------•-------•
| 1E79 | [ ṹ ] |      | 1E93 |   ẓ   |      | 01F2 |   ǲ   |      •------•-------•      | 00B4 |   ´   |
| 1E7A | [ Ṻ ] |      | 1E94 |   Ẕ   |      •------•-------•      | 01BC |   Ƽ   |      | 02CA |   ˊ   |
| 1E7B |   ṻ   |      | 1E95 |   ẕ   |      | 01C7 |   Ǉ   |      | 01BD |   ƽ   |      •------•-------•
| 1EE4 |   Ụ   |      | 2124 |   ℤ   |      | 01C9 |   ǉ   |      •------•-------•      | 01BE |   ƾ   |
| 1EE5 |   ụ   |      | 2125 |   ℥   |      | 01C8 |   ǈ   |                            | 0294 |   ʔ   |
| 1EE6 |   Ủ   |      | 2128 |   ℨ   |      •------•-------•                            | 0295 |   ʕ   |
| 1EE7 |   ủ   |      | 24B5 |   ⒵   |      | 01CA |   Ǌ   |                            | 0296 |   ʖ   |
| 1EE8 |   Ứ   |      | 24CF |   Ⓩ   |      | 01CC |   ǌ   |                            | 02A1 |   ʡ   |
| 1EE9 |   ứ   |      | 24E9 |   ⓩ   |      | 01CB |   ǋ   |                            | 02A2 |   ʢ   |
| 1EEA |   Ừ   |      | FF3A |   Ｚ   |      •------•-------•                            •------•-------•
| 1EEB |   ừ   |      | FF5A |   ｚ   |      | 02A3 |   ʣ   |                            | 02C7 |   ˇ   |
| 1EEC |   Ử   |      •------•-------•      | 02A5 |   ʥ   |                            | 02D8 |   ˘   |
| 1EED |   ử   |                            •------•-------•                            •------•-------•
| 1EEE |   Ữ   |                            | 02A6 |   ʦ   |
| 1EEF |   ữ   |                            •------•-------•
| 1EF0 |   Ự   |                            | FB05 |   ﬅ   |
| 1EF1 |   ự   |                            •------•-------•
| 24B0 |   ⒰   |                            | FB06 |   ﬆ   |
| 24CA |   Ⓤ   |                            •------•-------•
| 24E4 |   ⓤ   |
| FF35 |   Ｕ   |
| FF55 |   ｕ   |
•------•-------•

guy038</code></pre>
]]></description><link>https://community.notepad-plus-plus.org/post/32030</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/32030</guid><dc:creator><![CDATA[guy038]]></dc:creator><pubDate>Fri, 04 May 2018 11:00:36 GMT</pubDate></item><item><title><![CDATA[Reply to When case insensitive sorting will be added in Notepad++ x64 (without TextFX)? on Fri, 04 May 2018 17:02:28 GMT]]></title><description><![CDATA[<p dir="auto">Hello, <strong>All</strong>,</p>
<p dir="auto">As promised, in my <strong>previous</strong> post, here are below some <strong>tables</strong>, summarizing all the <strong>Latin word</strong> characters which are <strong>matched</strong> with the regex syntax <strong><code>&lsqb;&lsqb;=</code>X<code>=&rsqb;&rsqb;</code></strong>, where <strong>X</strong> is <strong>any single</strong> character of the <strong>second</strong> column of each table, which is <strong>NOT</strong> surrounded with <strong>square brackets</strong> !</p>
<p dir="auto">For instance, the table, regarding the <strong>Latin</strong> letter <strong><code>M</code></strong>, with :</p>
<ul>
<li>
<p dir="auto">The <strong>Unicode</strong> value of a <strong>similar</strong>  <strong><code>M</code></strong> character in  the <strong>first</strong> column</p>
</li>
<li>
<p dir="auto">The <strong>character</strong>, itself, in the <strong>second</strong> column</p>
</li>
</ul>
<p dir="auto">is the following :</p>
<pre><code class="language-diff">•------•-------•
| Code | Char. |
•------•-------•
| 004D |   M   |
| 006D |   m   |
| 019C |   Ɯ   |
| 026F | [ ɯ ] |
| 0270 | [ ɰ ] |
| 0271 | [ ɱ ] |
| 1E3E |   Ḿ   |
| 1E3F |   ḿ   |
| 1E40 |   Ṁ   |
| 1E41 | [ ṁ ] |
| 1E42 | [ Ṃ ] |
| 1E43 | [ ṃ ] |
| 2133 |   ℳ   |
| 24A8 |   ⒨   |
| 24C2 |   Ⓜ   |
| 24DC |   ⓜ   |
| FF2D |   Ｍ   |
| FF4D |   ｍ   |
•------•-------•
</code></pre>
<p dir="auto">This means that the regex <strong><code>&lsqb;&lsqb;=M=&rsqb;&rsqb;</code></strong> matches <strong>any single</strong> character of the list :</p>
<p dir="auto"><strong><code>[MmƜɯɰɱḾḿṀṁṂṃℳ⒨ⓂⓜＭｍ]</code></strong></p>
<p dir="auto"><strong>Notes</strong> :</p>
<ul>
<li>
<p dir="auto">Any <strong>similar <code>M</code></strong> character of that list, that is <strong>NOT</strong> between <strong>square</strong> brackets ( IMPORTANT ), may be inserted in the regex syntax <strong><code>&lsqb;&lsqb;==&rsqb;&rsqb;</code></strong> ! For instance the regex <strong><code>&lsqb;&lsqb;=Ɯ=&rsqb;&rsqb;</code></strong> or the regex <strong><code>&lsqb;&lsqb;=Ｍ=&rsqb;&rsqb;</code></strong> would <strong>also</strong> match <strong>any</strong> char of the list <strong>above</strong> !!</p>
</li>
<li>
<p dir="auto">Any character, between <strong>square</strong> brackets, in the table <strong>above</strong>, gives <strong>wrong</strong> results when used <strong>inside</strong> an <strong>equivalence class</strong> (  <strong><code>&lsqb;&lsqb;==&rsqb;&rsqb;</code></strong> ) <strong>Can’t</strong> explain exactly why, anyway !</p>
</li>
<li>
<p dir="auto">Probably, depending of your <strong>current</strong> font, some characters of the tables, below, may be replaced by a <strong>question mark</strong> (  <strong><code>?</code></strong> ), a small <strong>square box</strong>  ( <strong>□</strong> ) or a <strong>black square</strong> box ( <strong>■</strong> ) because your <strong>current</strong> font <strong>cannot</strong> display the <strong>glyph</strong> of these characters !</p>
</li>
<li>
<p dir="auto">You may <strong>download</strong>, just for <strong>test</strong>, a suitable font, as <strong>Fixedsys Excelsior 2.00</strong>, which <strong>correctly</strong> displays all these characters, from the link :</p>
</li>
</ul>
<p dir="auto"><a href="http://www.fixedsysexcelsior.com/fonts/FSEX2.ttf" rel="nofollow ugc">http://www.fixedsysexcelsior.com/fonts/FSEX2.ttf</a></p>
<p dir="auto"><strong>No</strong> need to <strong>install</strong> that font, on your system ! Simply :</p>
<ul>
<li>
<p dir="auto"><strong>Double-click</strong> on that <strong><code> .ttf</code></strong> file to open it</p>
</li>
<li>
<p dir="auto"><strong>Close</strong> and <strong>re-open</strong> Notepad++</p>
</li>
<li>
<p dir="auto">Select the <strong>default</strong> style ( <strong>Settings &gt; Style Configurator… Global Styles &gt; Default Style</strong> )</p>
</li>
<li>
<p dir="auto">Choose the <strong><code>Fixedsys Excelsior 2.00</code></strong> font, in the drop-down list, in the <strong>Font style</strong> zone</p>
</li>
<li>
<p dir="auto">Click on the <strong>Save &amp; Close</strong> button</p>
</li>
</ul>
<hr />
<p dir="auto">So, here are, below, the different <strong>tables</strong> of characters, matched by an <strong>equivalence class</strong> ( <strong><code>&lsqb;&lsqb;=</code>X<code>=&rsqb;&rsqb;</code></strong> )</p>
<p dir="auto"><strong>Notes</strong> :</p>
<ul>
<li>
<p dir="auto">Due to a <strong>maximum</strong> size of <strong><code>16Ko</code></strong> for a post, these tables must be split over <strong>several</strong> posts !</p>
</li>
<li>
<p dir="auto">The <strong>vertical</strong> lines, will <strong>not</strong> be, exactly, <strong>aligned</strong> because of the <strong>specific width</strong> of some characters :-(</p>
</li>
<li>
<p dir="auto"><strong>Last</strong> info :</p>
</li>
</ul>
<p dir="auto">When using the <strong><code>SciLexer.dll</code></strong> library of <strong>François-R Boyer</strong>, with N++ version <strong><code>v6.9.0</code></strong>, containing <strong>many improvements</strong> of the regex engine, with search processed in <strong>true 32 bit</strong> code-points ( instead of <strong>UTF-16</strong> ), even the <strong>Latin</strong> characters, between <strong>square</strong> brackets give <strong>correct</strong> results !</p>
<p dir="auto">Again, it’s really a <strong>pity</strong> that this <strong>improved</strong> regex engine can’t be <strong>implemented</strong> and <strong>updated</strong> with <strong>recent</strong> versions of N++ :-((( So many regex <strong>issues</strong> would disappear !</p>
<hr />
<pre><code class="language-diff">            Different characters X found, when using an EQUIVALENCE CLASS &lsqb;&lsqb;=X=&rsqb;&rsqb; :
            ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯


•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |
•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 0041 |   A   |      | 0042 |   B   |      | 0043 |   C   |      | 0044 |   D   |      | 0045 |   E   |
| 0061 |   a   |      | 0062 |   b   |      | 0063 |   c   |      | 0064 |   d   |      | 0065 |   e   |
| 00AA |   ª   |      | 0180 |   ƀ   |      | 00C7 |   Ç   |      | 00D0 |   Ð   |      | 00C8 |   È   |
| 00C0 |   À   |      | 0181 |   Ɓ   |      | 00E7 |   ç   |      | 00F0 |   ð   |      | 00C9 |   É   |
| 00C1 |   Á   |      | 0182 |   Ƃ   |      | 0106 |   Ć   |      | 010E |   Ď   |      | 00CA |   Ê   |
| 00C2 |   Â   |      | 0183 |   ƃ   |      | 0107 |   ć   |      | 010F |   ď   |      | 00CB |   Ë   |
| 00C3 |   Ã   |      | 0184 |   Ƅ   |      | 0108 |   Ĉ   |      | 0110 |   Đ   |      | 00E8 |   è   |
| 00C4 |   Ä   |      | 0185 |   ƅ   |      | 0109 |   ĉ   |      | 0111 |   đ   |      | 00E9 |   é   |
| 00C5 |   Å   |      | 0253 | [ ɓ ] |      | 010A |   Ċ   |      | 018A |   Ɗ   |      | 00EA |   ê   |
| 00E0 |   à   |      | 0299 |   ʙ   |      | 010B |   ċ   |      | 018B |   Ƌ   |      | 00EB |   ë   |
| 00E1 |   á   |      | 1E02 |   Ḃ   |      | 010C |   Č   |      | 018C |   ƌ   |      | 0112 |   Ē   |
| 00E2 |   â   |      | 1E03 |   ḃ   |      | 010D |   č   |      | 018D |   ƍ   |      | 0113 |   ē   |
| 00E3 |   ã   |      | 1E04 |   Ḅ   |      | 0186 |   Ɔ   |      | 0256 | [ ɖ ] |      | 0114 |   Ĕ   |
| 00E4 |   ä   |      | 1E05 |   ḅ   |      | 0187 |   Ƈ   |      | 0257 | [ ɗ ] |      | 0115 |   ĕ   |
| 00E5 |   å   |      | 1E06 |   Ḇ   |      | 0188 |   ƈ   |      | 02A4 |   ʤ   |      | 0116 |   Ė   |
| 0100 |   Ā   |      | 1E07 |   ḇ   |      | 0254 | [ ɔ ] |      | 1E0A |   Ḋ   |      | 0117 |   ė   |
| 0101 |   ā   |      | 212C |   ℬ   |      | 0255 | [ ɕ ] |      | 1E0B |   ḋ   |      | 0118 |   Ę   |
| 0102 |   Ă   |      | 249D |   ⒝   |      | 0297 |   ʗ   |      | 1E0C |   Ḍ   |      | 0119 |   ę   |
| 0103 |   ă   |      | 24B7 |   Ⓑ   |      | 1E08 |   Ḉ   |      | 1E0D |   ḍ   |      | 011A |   Ě   |
| 0104 |   Ą   |      | 24D1 |   ⓑ   |      | 1E09 |   ḉ   |      | 1E0E |   Ḏ   |      | 011B |   ě   |
| 0105 |   ą   |      | FF22 |   Ｂ   |      | 2102 |   ℂ   |     | 1E0F |   ḏ   |      | 018E |   Ǝ   |
| 01CD |   Ǎ   |      | FF42 |   ｂ   |      | 2103 |   ℃   |     | 1E10 |   Ḑ   |      | 018F |   Ə   |
| 01CE |   ǎ   |      •------•-------•      | 212D |   ℭ   |      | 1E11 |   ḑ   |      | 0190 |   Ɛ   |
| 01DE |   Ǟ   |                            | 249E |   ⒞   |      | 1E12 |   Ḓ   |      | 01DD |   ǝ   |
| 01DF |   ǟ   |                            | 24B8 |   Ⓒ   |      | 1E13 |   ḓ   |      | 0204 |   Ȅ   |
| 01E0 |   Ǡ   |                            | 24D2 |   ⓒ   |      | 249F |   ⒟   |      | 0205 |   ȅ   |
| 01E1 |   ǡ   |                            | FF23 |   Ｃ   |      | 24B9 |   Ⓓ   |     | 0206 |   Ȇ   |
| 01FA |   Ǻ   |                            | FF43 |   ｃ   |      | 24D3 |   ⓓ   |     | 0207 |   ȇ   |
| 01FB |   ǻ   |                            •------•-------•      | FF24 |   Ｄ   |      | 0258 | [ ɘ ] |
| 0200 |   Ȁ   |                                                  | FF44 |   ｄ   |      | 0259 | [ ə ] |
| 0201 |   ȁ   |                                                  •------•-------•      | 025A | [ ɚ ] |
| 0202 |   Ȃ   |                                                                        | 025B |   ɛ   |
| 0203 |   ȃ   |                                                                        | 025C |   ɜ   |
| 0250 | [ ɐ ] |                                                                        | 025D |   ɝ   |
| 0251 | [ ɑ ] |      •------•-------•      •------•-------•      •------•-------•      | 025E |   ɞ   |
| 0252 | [ ɒ ] |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | 0283 |   ʃ   |
| 1E00 |   Ḁ   |      •------•-------•      •------•-------•      •------•-------•      | 0285 |   ʅ   |
| 1E01 |   ḁ   |      | 0046 |   F   |      | 0047 |   G   |      | 0048 |   H   |      | 0286 |   ʆ   |
| 1E9A |   ẚ   |      | 0066 |   f   |      | 0067 |   g   |      | 0068 |   h   |      | 029A |   ʚ   |
| 1EA0 |   Ạ   |      | 0191 |   Ƒ   |      | 011C |   Ĝ   |      | 0124 |   Ĥ   |      | 1E14 |   Ḕ   |
| 1EA1 |   ạ   |      | 0192 |   ƒ   |      | 011D |   ĝ   |      | 0125 |   ĥ   |      | 1E15 |   ḕ   |
| 1EA2 |   Ả   |      | 1E1E |   Ḟ   |      | 011E |   Ğ   |      | 0126 |   Ħ   |      | 1E16 |   Ḗ   |
| 1EA3 |   ả   |      | 1E1F |   ḟ   |      | 011F |   ğ   |      | 0127 |   ħ   |      | 1E17 |   ḗ   |
| 1EA4 |   Ấ   |      | 2109 |   ℉   |      | 0120 |   Ġ   |      | 0195 |   ƕ   |      | 1E18 |   Ḙ   |
| 1EA5 |   ấ   |      | 2131 |   ℱ   |      | 0121 |   ġ   |      | 0265 | [ ɥ ] |      | 1E19 |   ḙ   |
| 1EA6 |   Ầ   |      | 2132 |   Ⅎ   |      | 0122 |   Ģ   |      | 0266 | [ ɦ ] |      | 1E1A |   Ḛ   |
| 1EA7 |   ầ   |      | 24A1 |   ⒡   |      | 0123 |   ģ   |      | 029C |   ʜ   |      | 1E1B |   ḛ   |
| 1EA8 |   Ẩ   |      | 24BB |   Ⓕ   |      | 0193 |   Ɠ   |      | 1E22 |   Ḣ   |      | 1E1C |   Ḝ   |
| 1EA9 |   ẩ   |      | 24D5 |   ⓕ   |      | 0194 |   Ɣ   |      | 1E23 |   ḣ   |      | 1E1D |   ḝ   |
| 1EAA |   Ẫ   |      | FF26 |   Ｆ   |      | 01E4 |   Ǥ   |      | 1E24 |   Ḥ   |      | 1EB8 |   Ẹ   |
| 1EAB |   ẫ   |      | FF46 |   ｆ   |       | 01E5 |   ǥ   |      | 1E25 |   ḥ   |      | 1EB9 |   ẹ   |
| 1EAC |   Ậ   |      •------•-------•      | 01E6 |   Ǧ   |      | 1E26 |   Ḧ   |      | 1EBA |   Ẻ   |
| 1EAD |   ậ   |                            | 01E7 |   ǧ   |      | 1E27 |   ḧ   |      | 1EBB |   ẻ   |
| 1EAE |   Ắ   |                            | 01F4 |   Ǵ   |      | 1E28 |   Ḩ   |      | 1EBC |   Ẽ   |
| 1EAF |   ắ   |                            | 01F5 |   ǵ   |      | 1E29 |   ḩ   |      | 1EBD |   ẽ   |
| 1EB0 |   Ằ   |                            | 0260 | [ ɠ ] |      | 1E2A |   Ḫ   |      | 1EBE |   Ế   |
| 1EB1 |   ằ   |                            | 0261 | [ ɡ ] |      | 1E2B |   ḫ   |      | 1EBF |   ế   |
| 1EB2 |   Ẳ   |                            | 0262 | [ ɢ ] |      | 1E96 |   ẖ   |      | 1EC0 |   Ề   |
| 1EB3 |   ẳ   |                            | 0263 | [ ɣ ] |      | 210B |   ℋ   |      | 1EC1 |   ề   |
| 1EB4 |   Ẵ   |                            | 0264 | [ ɤ ] |      | 210C |   ℌ   |      | 1EC2 |   Ể   |
| 1EB5 |   ẵ   |                            | 029B |   ʛ   |      | 210D |   ℍ   |      | 1EC3 |   ể   |
| 1EB6 |   Ặ   |                            | 1E20 |   Ḡ   |      | 210E |   ℎ   |      | 1EC4 |   Ễ   |
| 1EB7 |   ặ   |                            | 1E21 |   ḡ   |      | 210F |   ℏ   |      | 1EC5 |   ễ   |
| 212B |   Å   |                            | 210A |   ℊ   |      | 24A3 |   ⒣   |      | 1EC6 |   Ệ   |
| 249C |   ⒜   |                            | 24A2 |   ⒢   |      | 24BD |   Ⓗ   |      | 1EC7 |   ệ   |
| 24B6 |   Ⓐ   |                            | 24BC |   Ⓖ   |      | 24D7 |   ⓗ   |      | 2107 |   ℇ   |
| 24D0 |   ⓐ   |                           | 24D6 |   ⓖ   |     | FF28 |   Ｈ   |      | 212E |   ℮   |
| FF21 |   Ａ   |                            | FF27 |   Ｇ   |     | FF48 |   ｈ   |      | 212F |   ℯ   |
| FF41 |   ａ   |                            | FF47 |   ｇ   |      •------•-------•      | 2130 |   ℰ   |
•------•-------•                            •------•-------•                            | 24A0 |   ⒠   |
                                                                                        | 24BA |   Ⓔ   |
                                                                                        | 24D4 |   ⓔ   |
                                                                                        | FF25 |   Ｅ   |
                                                                                        | FF45 |   ｅ   |
                                                                                        •------•-------•


•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |      | Code | Char. |
•------•-------•      •------•-------•      •------•-------•      •------•-------•      •------•-------•
| 0049 |   I   |      | 004A |   J   |      | 004B |   K   |      | 004C |   L   |      | 004D |   M   |
| 0069 |   i   |      | 006A |   j   |      | 006B |   k   |      | 006C |   l   |      | 006D |   m   |
| 00CC |   Ì   |      | 0134 |   Ĵ   |      | 0136 |   Ķ   |      | 0139 |   Ĺ   |      | 019C |   Ɯ   |
| 00CD |   Í   |      | 0135 |   ĵ   |      | 0137 |   ķ   |      | 013A |   ĺ   |      | 026F | [ ɯ ] |
| 00CE |   Î   |      | 01F0 |   ǰ   |      | 0138 |   ĸ   |      | 013B |   Ļ   |      | 0270 | [ ɰ ] |
| 00CF |   Ï   |      | 025F |   ɟ   |      | 0198 |   Ƙ   |      | 013C |   ļ   |      | 0271 | [ ɱ ] |
| 00EC |   ì   |      | 0284 |   ʄ   |      | 0199 |   ƙ   |      | 013D |   Ľ   |      | 1E3E |   Ḿ   |
| 00ED |   í   |      | 029D |   ʝ   |      | 01E8 |   Ǩ   |      | 013E |   ľ   |      | 1E3F |   ḿ   |
| 00EE |   î   |      | 24A5 |   ⒥   |      | 01E9 |   ǩ   |      | 013F |   Ŀ   |      | 1E40 |   Ṁ   |
| 00EF |   ï   |      | 24BF |   Ⓙ   |      | 029E |   ʞ   |      | 0140 |   ŀ   |      | 1E41 | [ ṁ ] |
| 0128 |   Ĩ   |      | 24D9 |   ⓙ   |      | 1E30 |   Ḱ   |      | 0141 | [ Ł ] |      | 1E42 | [ Ṃ ] |
| 0129 |   ĩ   |      | FF2A |   Ｊ   |      | 1E31 |   ḱ   |      | 0142 | [ ł ] |      | 1E43 | [ ṃ ] |
| 012A |   Ī   |      | FF4A |   ｊ   |       | 1E32 |   Ḳ   |      | 019A |   ƚ   |      | 2133 |   ℳ   |
| 012B |   ī   |      •------•-------•      | 1E33 |   ḳ   |      | 019B |   ƛ   |      | 24A8 |   ⒨   |
| 012C |   Ĭ   |                            | 1E34 |   Ḵ   |      | 026B | [ ɫ ] |      | 24C2 |   Ⓜ   |
| 012D |   ĭ   |                            | 1E35 |   ḵ   |      | 026C |   ɬ   |      | 24DC |   ⓜ   |
| 012E |   Į   |                            | 24A6 |   ⒦   |     | 026D | [ ɭ ] |       | FF2D |   Ｍ   |
| 012F |   į   |                            | 24C0 |   Ⓚ   |     | 026E | [ ɮ ] |      | FF4D |   ｍ   |
| 0130 |   İ   |                            | 24DA |   ⓚ   |      | 029F |   ʟ   |      •------•-------•
| 0131 |   ı   |                            | FF2B |   Ｋ   |      | 1E36 |   Ḷ   |
| 0196 |   Ɩ   |                             | FF4B |   ｋ   |      | 1E37 |   ḷ   |
| 0197 |   Ɨ   |                            •------•-------•      | 1E38 |   Ḹ   |
| 01CF |   Ǐ   |                                                  | 1E39 |   ḹ   |
| 01D0 |   ǐ   |                                                  | 1E3A |   Ḻ   |
| 0208 |   Ȉ   |                                                  | 1E3B |   ḻ   |
| 0209 |   ȉ   |                                                  | 1E3C |   Ḽ   |
| 020A |   Ȋ   |                                                  | 1E3D |   ḽ   |
| 020B |   ȋ   |                                                  | 2112 |   ℒ   |
| 0268 | [ ɨ ] |                                                  | 2113 |   ℓ   |
| 0269 |   ɩ   |                                                  | 24A7 |   ⒧   |
| 026A | [ ɪ ] |                                                  | 24C1 |   Ⓛ   |
| 1E2C |   Ḭ   |                                                  | 24DB |   ⓛ   |
| 1E2D |   ḭ   |                                                   | FF2C |   Ｌ   |
| 1E2E |   Ḯ   |                                                   | FF4C |   ｌ   |
| 1E2F |   ḯ   |                                                  •------•-------•
| 1EC8 |   Ỉ   |
| 1EC9 |   ỉ   |
| 1ECA |   Ị   |
| 1ECB |   ị   |
| 2110 |   ℐ   |
| 2111 |   ℑ   |
| 24A4 |   ⒤   |
| 24BE |   Ⓘ   |
| 24D8 |   ⓘ   |
| FF29 |   Ｉ   |
| FF49 |   ｉ   |
•------•-------•
</code></pre>
<p dir="auto">guy038</p>
]]></description><link>https://community.notepad-plus-plus.org/post/32029</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/32029</guid><dc:creator><![CDATA[guy038]]></dc:creator><pubDate>Fri, 04 May 2018 17:02:28 GMT</pubDate></item><item><title><![CDATA[Reply to When case insensitive sorting will be added in Notepad++ x64 (without TextFX)? on Fri, 04 May 2018 10:42:56 GMT]]></title><description><![CDATA[<p dir="auto">Hello, @piotr-Przeklasa and <strong>All</strong>,</p>
<p dir="auto">So, here is my <strong>complete</strong> method :</p>
<ul>
<li><strong>Phase <code>1</code></strong> On your list to be <strong>sorted</strong>, perform the <strong>simple</strong> regex S/R :</li>
</ul>
<p dir="auto">SEARCH <strong><code>(?-s).+</code></strong></p>
<p dir="auto">REPLACE <strong><code>$0\x1F$0</code></strong></p>
<ul>
<li><strong>Phase <code>2</code></strong> Now, use this ( <strong>more</strong> complicated ! ) regex S/R  :</li>
</ul>
<p dir="auto">SEARCH <strong><code>(?i)(?=.*\x1F)(?:([A-Z0-9]+)|(&lsqb;&lsqb;=a=&rsqb;&rsqb;)|(&lsqb;&lsqb;=b=&rsqb;&rsqb;)|(&lsqb;&lsqb;=c=&rsqb;&rsqb;)|(&lsqb;&lsqb;=d=&rsqb;&rsqb;)|(&lsqb;&lsqb;=e=&rsqb;&rsqb;)|(&lsqb;&lsqb;=f=&rsqb;&rsqb;)|(&lsqb;&lsqb;=g=&rsqb;&rsqb;)|(&lsqb;&lsqb;=h=&rsqb;&rsqb;)|(&lsqb;&lsqb;=i=&rsqb;&rsqb;)|(&lsqb;&lsqb;=j=&rsqb;&rsqb;)|(&lsqb;&lsqb;=k=&rsqb;&rsqb;)|(&lsqb;&lsqb;=l=&rsqb;&rsqb;)|(&lsqb;&lsqb;=m=&rsqb;&rsqb;)|(&lsqb;&lsqb;=n=&rsqb;&rsqb;)|(&lsqb;&lsqb;=o=&rsqb;&rsqb;)|(&lsqb;&lsqb;=p=&rsqb;&rsqb;)|(&lsqb;&lsqb;=q=&rsqb;&rsqb;)|(&lsqb;&lsqb;=r=&rsqb;&rsqb;)|(&lsqb;&lsqb;=s=&rsqb;&rsqb;)|(&lsqb;&lsqb;=t=&rsqb;&rsqb;)|(&lsqb;&lsqb;=u=&rsqb;&rsqb;)|(&lsqb;&lsqb;=v=&rsqb;&rsqb;)|(&lsqb;&lsqb;=w=&rsqb;&rsqb;)|(&lsqb;&lsqb;=x=&rsqb;&rsqb;)|(&lsqb;&lsqb;=y=&rsqb;&rsqb;)|(&lsqb;&lsqb;=z=&rsqb;&rsqb;)|(\x{00C6})|(\x{0152})|(\x{0218}|\x{0219})|(\x{021A}|\x{021B})|(\x{00A0})|[^\x1F\x20\xDE\r\n])</code></strong></p>
<p dir="auto">REPLACE <strong><code>(?1\L\1)(?2a)(?3b)(?4c)(?5d)(?6e)(?7f)(?8g)(?9h)(?10i)(?11j)(?12k)(?13l)(?14m)(?15n)(?16o)(?17p)(?18q)(?19r)(?20s)(?21t)(?22u)(?23v)(?24w)(?25x)(?26y)(?27z)(?28ae)(?29oe)(?30s)(?31t)(?32\x20)</code></strong></p>
<ul>
<li>
<p dir="auto"><strong>Phase <code>3</code></strong> Run a classical <strong>N++ sort</strong> using the menu option <strong>Edit &gt; Line Operations &gt; Sort Lines Lexicographically Ascending</strong></p>
</li>
<li>
<p dir="auto"><strong>Phase <code>4</code></strong> Perform a <strong>last</strong> regex S/R, in order to <strong>delete</strong> the <strong>primary temporary</strong> key :</p>
</li>
</ul>
<p dir="auto">SEARCH <strong><code>^.+\x1F</code></strong></p>
<p dir="auto">REPLACE <strong><code>Leave EMPTY</code></strong></p>
<p dir="auto"><strong>Notes</strong> :</p>
<p dir="auto">In search, <strong>until</strong> the <strong>US</strong> separator character is <strong>reached</strong>, the regex searches for :</p>
<ul>
<li>
<p dir="auto">Any range of <strong>consecutive word</strong> characters of <strong>group <code>1</code></strong>, ( part <strong><code>([A-Z0-9]+)</code></strong> ), <strong>whatever</strong> its case, replaced by the <strong>same</strong> range of characters, in <strong>lower</strong> case, due to the <strong>conditional</strong> replacement <strong><code>(?1\L\1)</code></strong></p>
</li>
<li>
<p dir="auto">Then, from <strong>group <code>2</code></strong>  to <strong>group <code>27</code></strong> ( part <strong><code>(&lsqb;&lsqb;=x=&rsqb;&rsqb;)</code></strong> ), any <strong>accentuated Latin</strong> letter <strong><code>x</code></strong>  is changed into a classical <strong>lower-case</strong> letter with the conditional replacement <strong><code>(?##x)</code></strong></p>
</li>
<li>
<p dir="auto">Now, <strong>group <code>28</code></strong>, ( part <strong><code>(\x{00C6})</code></strong> ), rewrites the <strong><code>Æ</code></strong>  and <strong><code>æ</code></strong> ligatures as the <strong>two lowercase</strong> letters <strong>ae</strong></p>
</li>
<li>
<p dir="auto">And <strong>group <code>29</code></strong>, ( part <strong><code>(\x{0152})</code></strong> ), rewrites the <strong><code>Œ</code></strong>  and <strong><code>œ</code></strong> ligatures as the <strong>two lowercase</strong> letters <strong>oe</strong></p>
</li>
<li>
<p dir="auto">Now, the <strong>Romanian</strong> letters <strong><code>Ș</code></strong> and <strong><code>ș</code></strong>, of <strong>group <code>30</code></strong> ( part <strong><code>(\x{0218}|\x{0219})</code></strong> ), are changed into the <strong>lowercase <code>s</code></strong> letter</p>
</li>
<li>
<p dir="auto">And the <strong>Romanian</strong> letters <strong><code>Ț</code></strong> and <strong><code>ț</code></strong>, of <strong>grouip <code>31</code></strong> ( part <strong><code>(\x{021A}|\x{021B})</code></strong> ), are changed into the <strong>lowercase <code>t</code></strong> letter</p>
</li>
</ul>
<p dir="auto"><strong>Note</strong> : The  <strong><code>Œ</code></strong> ligature and the  <strong>Romanian</strong> letters <strong><code>Ș</code></strong> and <strong><code>Ț</code></strong> are <strong>not</strong> handled by the <strong><code>&lsqb;&lsqb;=</code>X<code>=&rsqb;&rsqb;</code></strong> feature !</p>
<ul>
<li>
<p dir="auto">Then the <strong>group<code>32</code></strong> ( the <strong><code>NBSP</code></strong> character = <strong><code>(\x{00A0})</code></strong> ) is rewritten as a single <strong><code>Space</code></strong> character</p>
</li>
<li>
<p dir="auto">Finally, the regex <strong><code>[^\x1F\x20\xDE\r\n]</code></strong> matches <strong>any</strong> signle character, <strong>different</strong> from :</p>
<ul>
<li>
<p dir="auto"><strong>All</strong> the characters <strong>previously</strong> matched</p>
</li>
<li>
<p dir="auto">The <strong><code>US</code></strong> control character <strong>separator</strong></p>
</li>
<li>
<p dir="auto">The <strong><code>SPACE</code></strong> character</p>
</li>
<li>
<p dir="auto">The <strong>lower</strong> and <strong>upper</strong> case <strong><code>THORN</code></strong> letters, which are <strong>unchanged</strong> by this S/R</p>
</li>
<li>
<p dir="auto">Of course, the <strong>line-break</strong> characters, too</p>
</li>
</ul>
<p dir="auto">which are simply <strong>deleted</strong> because <strong>not stored</strong> in <strong>any existing</strong> group !</p>
</li>
</ul>
<p dir="auto">Best Regards,</p>
<p dir="auto">guy038</p>
<p dir="auto"><strong>P.S.</strong> :</p>
<p dir="auto">By the way, this method may be <strong>used</strong> when you want to sort a text, <strong>either</strong> :</p>
<ul>
<li>
<p dir="auto">From a <strong>specific</strong> colum <strong><code>c</code></strong></p>
</li>
<li>
<p dir="auto">On <strong>specific</strong> blocks of text, from columns <strong><code>c1</code></strong> to <strong><code>c2</code></strong>, <strong><code>c3</code></strong> to <strong><code>c4</code></strong>, and so on…</p>
</li>
</ul>
<p dir="auto">For instance, assuming that you would like to sort a list, with :</p>
<ul>
<li>
<p dir="auto">A <strong>primary</strong> key, from colums <strong><code>20</code></strong> to <strong><code>22</code></strong></p>
</li>
<li>
<p dir="auto">A <strong>secondary</strong> key, from colums <strong><code>8</code></strong> to <strong><code>12</code></strong></p>
</li>
<li>
<p dir="auto">A <strong>third</strong> key, using <strong>all</strong> text starting at column <strong><code>30</code></strong></p>
</li>
</ul>
<p dir="auto">Just build a regex which :</p>
<ul>
<li>
<p dir="auto">Copy the <strong><code>3</code></strong> characters, in columns <strong><code>20</code></strong> ,<strong><code>21</code></strong>  and <strong><code>22</code></strong>, at <strong>beginning</strong> of <strong>each</strong> line</p>
</li>
<li>
<p dir="auto">Then, adds the <strong><code>5</code></strong> characters, present between columns <strong><code>8</code></strong> to <strong><code>12</code></strong> of <strong>each</strong> line</p>
</li>
<li>
<p dir="auto">Then, adds the <strong>remaining</strong> text, of <strong>each</strong> line, starting with column <strong><code>30</code></strong></p>
</li>
<li>
<p dir="auto">And, finally, adds the <strong>US</strong> control character as a <strong>separator</strong>, <strong>followed</strong> with each <strong>initial</strong> line, in order to create a <strong><code>pseudo fourth</code></strong> key ( the <strong>entire</strong> contents of <strong>each</strong> line ) in the case that the <strong>first three</strong> keys would be <strong>identical</strong>, in <strong>several</strong> lines !</p>
</li>
</ul>
<p dir="auto">That is to say, the following regex :</p>
<p dir="auto">SEARCH <strong><code>(?-s)^.{7}(.....).......(...).......(.*)</code></strong></p>
<p dir="auto">REPLACE <strong><code>\2\1\3\x1F$0</code></strong></p>
]]></description><link>https://community.notepad-plus-plus.org/post/32019</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/32019</guid><dc:creator><![CDATA[guy038]]></dc:creator><pubDate>Fri, 04 May 2018 10:42:56 GMT</pubDate></item><item><title><![CDATA[Reply to When case insensitive sorting will be added in Notepad++ x64 (without TextFX)? on Tue, 08 Nov 2022 21:49:35 GMT]]></title><description><![CDATA[<p dir="auto">Hello, @piotr-Przeklasa and <strong>All</strong>,</p>
<p dir="auto">I’ve deeply thought about your problem, since a <strong>week</strong> and, finally, I found out a <strong>work-round</strong> to change the <strong>default</strong> behaviour of the N++ <strong>sort</strong> ;-)</p>
<p dir="auto">While elaborating my solution, I read some <strong>documentation</strong> about <strong>sorting</strong> and ,to my mind, it’s quite <strong>difficult</strong> to chose which <strong>sorting</strong> method is the <strong>best</strong> one, as it <strong>mainly</strong> depends on the kind of <strong>data</strong>, that you have to sort ! So, I tried to find a “<strong>middle</strong>” solution, which, sometimes, should be modified for a <strong>specific</strong> sorting way.</p>
<p dir="auto">Referring to the links, below, it appears, to everyone, that <strong>sorting</strong> things are quite <strong>complex</strong> if you consider <strong>all</strong> countries, using <strong>Latin</strong> languages, and, even if you restrict your goal to <strong>American-English</strong> language, <strong>several</strong> sorting methods are still possible !</p>
<p dir="auto"><a href="https://en.wikipedia.org/wiki/Alphabetical_order" rel="nofollow ugc">https://en.wikipedia.org/wiki/Alphabetical_order</a></p>
<p dir="auto"><a href="https://en.wikipedia.org/wiki/List_of_precomposed_Latin_characters_in_Unicode" rel="nofollow ugc">https://en.wikipedia.org/wiki/List_of_precomposed_Latin_characters_in_Unicode</a></p>
<hr />
<p dir="auto">For instance, here is a list of some <strong>streets</strong> of the <strong>Great London</strong> area :</p>
<pre><code class="language-diff">NON sorted :                             NOTEPAD++ Sort :                            Sort B :
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Oakworth                                 Oak Avenue                                  Oak Avenue
Oak Ridge                                Oak Bank Croy                               Oak Bank Croy
Oak Hill Way                             Oak Gardens                                 Oakcroft Road
Oakwood                                  Oak Hill                                    Oak Gardens
Oak Hill                                 Oak Hill Close                              Oak Hill
Oak Bank Croy                            Oak Hill Way                                Oak Hill Close
Oakhill Drive                            Oak Lane                                    Oakhill Drive
Oakley River                             Oak Loadge Avenue                           Oak Hill Way
Oak Rise                                 Oak Ridge                                   Oaklands Avenue
Oak Avenue                               Oak Rise                                    Oak Lane
Oakway Close                             Oak Road                                    Oakleigh Park
Oakcroft Road                            Oakcroft Road                               Oakley Avenue
Oak Hill Close                           Oakhill Drive                               Oakley Park
Oak Gardens                              Oaklands Avenue                             Oakley River
Oak Lane                                 Oakleigh Park                               Oak Loadge Avenue
Oakleigh Park                            Oakley Avenue                               Oak Ridge
Oakley Avenue                            Oakley Park                                 Oak Rise
Oak Loadge Avenue                        Oakley River                                Oak Road
Oakley Park                              Oakway Close                                Oakway Close
Oaklands Avenue                          Oakwood                                     Oakwood
Oak Road                                 Oakworth                                    Oakworth
</code></pre>
<ul>
<li>
<p dir="auto">The <strong>unsorted</strong> list is displayed in the <strong>first</strong> column</p>
</li>
<li>
<p dir="auto">The list sorted with the <strong>Notepad++</strong> sort, according to <strong>Unicode</strong> values of characters, is in the <strong>second</strong> column</p>
</li>
<li>
<p dir="auto">An <strong>alternate sort</strong>, generally used, in street <strong>indexes</strong>, where <strong>spaces</strong> are <strong>not</strong> taken in account, during the <strong>sort</strong> process, is added in the <strong>third</strong> column</p>
</li>
</ul>
<p dir="auto">Personally, in that matter, the <strong>N++ sort</strong> seems to give <strong>better</strong> results :-)</p>
<hr />
<p dir="auto">Now, we must discuss of all <strong>symbols</strong>, which may occur <strong>inside</strong> words. Here is a <strong>short</strong> list of some English/ American <strong>abbreviations</strong> :</p>
<pre><code class="language-diff">NON sorted :                             N++ Sort :                                  Sort B :
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Corp                                     C-in-C                                      CAT
cp                                       C.E.N.T.O                                   C.E.N.T.O
C.O.I.                                   C.O.I.                                      cert
C-in-C                                   C.S.                                        C-in-C
CAT                                      C.S.E.                                      c/o
C.S.                                     CAT                                         C.O.I.
c/o                                      Co-op                                       Co-op
C.S.E.                                   Corp                                        Corp
Co-op                                    c/o                                         cp
C.E.N.T.O                                cert                                        C.S.
cert                                     cp                                          C.S.E.
</code></pre>
<ul>
<li>
<p dir="auto">The <strong>unsorted</strong> list is displayed in the <strong>first</strong> column</p>
</li>
<li>
<p dir="auto">The N++ sorted list, in the <strong>second</strong> column, is rather <strong>difficult</strong> to read, because words with <strong>symbols</strong> appear, <strong>first</strong> !</p>
</li>
<li>
<p dir="auto">This time, the <strong>alternate</strong> list, in the <strong>third</strong> column, where symbols are <strong>not</strong> taken in account, gives <strong>better</strong> results, indeed !</p>
</li>
</ul>
<hr />
<p dir="auto">Finally, we have to talk about <strong>upper/lower</strong> case letters and about <strong>Latin accentuated</strong> characters, very common in many <strong>European</strong> languages !</p>
<p dir="auto">For instance, here is, below, a list of the <strong>given</strong> and <strong>family</strong> names of some “<strong>celebrities</strong>”, taken from the general list, below :</p>
<p dir="auto"><a href="https://en.wikipedia.org/wiki/Lists_of_people_by_nationality#By_nationality" rel="nofollow ugc">https://en.wikipedia.org/wiki/Lists_of_people_by_nationality#By_nationality</a></p>
<pre><code class="language-diff">NON sorted :                             N++ Sort :                                  Sort B :
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
İshak Alaton                             Abdi İpekçi                                 Abdi İpekçi
şükrü Gülesin                            Jón Ásgeir Jóhannesson                      aleksandrs čaks
đorđe Petrović                           León Ávalos y Vez                           antoni józef śmieszek
Thích Quảng độ                           Nazan Öncel                                 bela Čikoš Sesija
Nazan Öncel                              Olgierd łukaszewicz                         đorđe Petrović
sezer öztürk                             Radoje đerić                                eduardo Úrculo
žarko Čabarkapa                          Thích Quảng độ                              Éric millot
saúl Canelo Álvarez                      Tróndur í gøtu                              finn øglænd
Þóra Magnúsdóttir                        Viktorija Čmilytė                           henryk Średnicki
aleksandrs čaks                          aleksandrs čaks                             İshak Alaton
eduardo Úrculo                           antoni józef śmieszek                       Jón Ásgeir Jóhannesson
Radoje đerić                             bela Čikoš Sesija                           León Ávalos y Vez
antoni józef śmieszek                    eduardo Úrculo                              łukasz załuska
øystein sunde                            finn øglænd                                 mạc đăng Dung
učimir                                   henryk Średnicki                            mia Čorak Slavenska
henryk Średnicki                         mia Čorak Slavenska                         Nazan Öncel
Éric millot                              mạc đăng Dung                               Olgierd łukaszewicz
Șerban Andrei                            saúl Canelo Álvarez                         Óscar Muñoz oviedo
mia Čorak Slavenska                      sezer öztürk                                øystein sunde
Olgierd łukaszewicz                      učimir                                      Radoje đerić
Viktorija Čmilytė                        václav černý                                saúl Canelo Álvarez
Jón Ásgeir Jóhannesson                   władysław ślebodziński                      Șerban Andrei
władysław ślebodziński                   Éric millot                                 sezer öztürk
Tróndur í gøtu                           Óscar Muñoz oviedo                          şükrü Gülesin
václav černý                             Ümit Özat                                   Thích Quảng độ
łukasz załuska                           Þóra Magnúsdóttir                           Tróndur í gøtu
Óscar Muñoz oviedo                       øystein sunde                               učimir
León Ávalos y Vez                        đorđe Petrović                              Ümit Özat
Ümit Özat                                İshak Alaton                                václav černý
finn øglænd                              łukasz załuska                              Viktorija Čmilytė
mạc đăng Dung                            şükrü Gülesin                               władysław ślebodziński
bela Čikoš Sesija                        žarko Čabarkapa                             žarko Čabarkapa
Abdi İpekçi                              Șerban Andrei                               Þóra Magnúsdóttir
</code></pre>
<ul>
<li>
<p dir="auto">Again, the <strong>unsorted</strong> list is displayed in the <strong>first</strong> column</p>
</li>
<li>
<p dir="auto">The N++ sorted list, in the <strong>second</strong> column, is, obviously, <strong>poorly</strong> sorted, as :</p>
<ul>
<li>
<p dir="auto">Words, beginning with a <strong>NON</strong>-accentuated <strong>uppercase</strong> letter, come <strong>first</strong></p>
</li>
<li>
<p dir="auto">Words, beginning with a <strong>NON</strong>-accentuated <strong>lowercase</strong> letter, come in <strong>second</strong></p>
</li>
<li>
<p dir="auto">Words, beginning with a <strong>accentuated</strong> letter, come <strong>last</strong>, according to the <strong>Unicode</strong> value of its <strong>first</strong> char.</p>
</li>
</ul>
</li>
<li>
<p dir="auto">The <strong>alternate</strong> list, in the <strong>third</strong> column, seems much more <strong>readable</strong>, as :</p>
<ul>
<li>
<p dir="auto">The <strong>case</strong> of letters is <strong>not</strong> taken in account, in <strong>first</strong> approach</p>
</li>
<li>
<p dir="auto">The <strong>accentuated</strong> letters are sorted, <strong>right after</strong> their corresponding <strong>NON-accentuated</strong> letter !</p>
</li>
</ul>
</li>
</ul>
<hr />
<p dir="auto">Of course, a lot of other <strong>parameters</strong>, about sorting, could be discussed and my <strong>previous</strong> discussion just shows the complexity of the <strong>sort</strong> problem ;-)) For instance, I did <strong>NOT</strong> study sorting, involving <strong>RTL</strong> languages ( such as <strong>Hebrew</strong> or <strong>Arabic</strong> languages ) and, also, <strong>Asian</strong> languages using <strong>ideographic</strong> characters !</p>
<p dir="auto">My <strong>word-around</strong> is based on a simple idea : to <strong>customize</strong> a sort process, I simply <strong>add</strong> a <strong>specific</strong> text, in front of <strong>any</strong> line of the text to be <strong>sorted</strong> !</p>
<p dir="auto">Here is my <strong>method</strong> :</p>
<ul>
<li>
<p dir="auto">First, with a <strong>first</strong> simple regex, I copy, with a simple regex, <strong>each</strong> line of text to be sorted, <strong>twice</strong>, separating these <strong>identical</strong> blocks with a <strong>specific</strong> character, <strong>not</strong> used, anywhere else</p>
</li>
<li>
<p dir="auto">Secondly, with a <strong>second</strong> regex S/R, I modify the <strong>first</strong> part of <strong>each</strong> line, located <strong>before</strong> the <strong>separator</strong> character</p>
</li>
<li>
<p dir="auto">Thirdly,  I perform a classical <strong>N++ sort</strong>, using the menu option <strong>Edit &gt; Line Operations &gt; Sort Lines Lexicographically Ascending</strong></p>
</li>
<li>
<p dir="auto">Finally, I get rid of that <strong>temporary sorting</strong> key, as well as the <strong>separator</strong>, for <strong>each</strong> line, with a <strong>third</strong> simple regex</p>
</li>
</ul>
<hr />
<p dir="auto">In my <strong>first</strong> attempts, I used the <strong><code>#</code></strong>  symbol as a <strong>separator</strong>. However, I <strong>quickly</strong> realized that this could break a <strong>correct</strong> sort. Indeed, just imagine the <strong>two consecutive</strong> lines, obtained at the end of <strong>phase <code>2</code></strong></p>
<pre><code class="language-diff">alvaro uribe velez#álvaro uribe Vélez
alvaro uribe#álvaro Uribe
</code></pre>
<p dir="auto">It’s obvious that, using <strong>N++</strong> sort, then <strong>deleting</strong> part till the <strong><code>#</code></strong> char, leave you with the <strong>NON sorted</strong> text :</p>
<pre><code class="language-diff">álvaro uribe Vélez
álvaro Uribe
</code></pre>
<p dir="auto">because the <strong>Unicode</strong> value of the <strong><code>#</code></strong> symbol ( <strong><code>0023</code></strong> ) is <strong>higher</strong> than the <strong><code>0020</code></strong> ( <strong>space</strong> char )</p>
<p dir="auto">Now, replacing the <strong><code>#</code></strong> separator with, for instance, the  control character ( code <strong><code>001f</code></strong> ) implies a <strong>correct</strong> sort, because the <strong><code>001F</code></strong> value comes <strong>before</strong> the <strong><code>0020</code></strong> value of the <strong>space</strong> character. Hence the two <strong>records</strong> are correctly displayed :</p>
<pre><code class="language-diff">álvaro Uribe
álvaro uribe Vélez
</code></pre>
<hr />
<p dir="auto">So, according to what I said before, in <strong>each</strong> part of line, laying <strong>before</strong> the separator, the regex, used in <strong>phase <code>2</code></strong>, will :</p>
<ul>
<li>
<p dir="auto">Keep the <strong><code>Space</code></strong> character as a <strong>significant</strong> character for sorting</p>
</li>
<li>
<p dir="auto">Get rid of any <strong>NON-word</strong> character ( <strong>NO-significant</strong> character )</p>
</li>
<li>
<p dir="auto">Change any <strong><code>NBSP</code></strong> character, of <strong>Unicode</strong> value <strong><code>00A0</code></strong>, with a common <strong><code>Space</code></strong> char</p>
</li>
<li>
<p dir="auto">Change any <strong>Latin</strong> letter, <strong>accentuated</strong> or <strong>NOT</strong>, <strong>whatever</strong> its case, by its <strong>corresponding lowercase</strong> letter</p>
</li>
</ul>
<p dir="auto">The <strong>fourth</strong> point can be reached, with a <strong>special</strong> regex feature, called an <strong>equivalence class</strong> of characters, with the syntax <strong><code>&lsqb;&lsqb;=</code>X<code>=&rsqb;&rsqb;</code></strong>, where <strong><code>X</code></strong> is <strong>any single word</strong> character !</p>
<p dir="auto">For instance :</p>
<ul>
<li>The regex <strong><code>&lsqb;&lsqb;=O=&rsqb;&rsqb;</code></strong> would match <strong>any</strong> single character of the list :</li>
</ul>
<p dir="auto"><strong><code>OoºÒÓÔÕÖØòóôõöøŌōŎŏŐőƟƠơƢƣǑǒǪǫǬǭǾǿȌȍȎȏɵɶɷṌṍṎṏṐṑṒṓỌọỎỏỐốỒồỔổỖỗỘộỚớỜờỞởỠỡỢợℴ⒪ⓄⓞＯｏ</code></strong></p>
<ul>
<li>
<p dir="auto">The regex <strong><code>&lsqb;&lsqb;=X=&rsqb;&rsqb;&lsqb;&lsqb;=O=&rsqb;&rsqb;</code></strong> would match any <strong>two</strong> char. string, as <strong>ẌƟ</strong>,  <strong>Ⓧṑ</strong> or <strong>ｘƣ</strong></p>
</li>
<li>
<p dir="auto">The regex <strong><code>&lsqb;&lsqb;=X=][=O=&rsqb;&rsqb;</code></strong> would match <strong>any single</strong> character of the list :</p>
</li>
</ul>
<p dir="auto"><strong><code>OoºÒÓÔÕÖØòóôõöøŌōŎŏŐőƟƠơƢƣǑǒǪǫǬǭǾǿȌȍȎȏɵɶɷṌṍṎṏṐṑṒṓỌọỎỏỐốỒồỔổỖỗỘộỚớỜờỞởỠỡỢợℴ⒪ⓄⓞＯｏXxẊẋẌẍ⒳ⓍⓧＸｘ</code></strong></p>
<ul>
<li>The regex <strong><code>&lsqb;&lsqb;=X=]0-9]</code></strong> would match <strong>any single</strong> character of the list :</li>
</ul>
<p dir="auto"><strong><code>0123456789XxẊẋẌẍ⒳ⓍⓧＸｘ</code></strong></p>
<ul>
<li>And the regex <strong><code>&lsqb;&lsqb;=X=&rsqb;&rsqb;-[0-9]-&lsqb;&lsqb;=O=&rsqb;&rsqb;</code></strong> would match any <strong>three</strong> char. strings as <strong>Ẋ-2-Ⓧ</strong>, <strong>Ẍ-4-⒳</strong> or <strong>ｘ-0-Ẋ</strong></li>
</ul>
<p dir="auto"><strong>Remark</strong> :</p>
<p dir="auto">I intend, <strong>shortly</strong>, to create an <strong>other</strong> post, with the tables of <strong>all</strong> characters, involved in the <strong><code>&lsqb;&lsqb;=</code>X<code>=&rsqb;&rsqb;</code></strong> feature ! Just be <strong>patient</strong> !</p>
<hr />
<p dir="auto">So, given, for instance, the <strong>seven</strong> family and given <strong>names</strong>, below :</p>
<pre><code class="language-diff">đorđe Petrović
žarko Čabarkapa
øystein sunde
Șerban Andrei
Tróndur í gøtu
łukasz załuska
Óscar Muñoz oviedo
</code></pre>
<p dir="auto">We’ll obtain, at the <strong>end</strong> of <strong>phase <code>2</code></strong>, after performing the regex, containing <strong><code>&lsqb;&lsqb;=</code>X<code>=&rsqb;&rsqb;</code></strong> blocks :</p>
<pre><code class="language-diff">dorde petrovicđorđe Petrović
zarko Cabarkapažarko Čabarkapa
oystein sundeøystein sunde
serban andreiȘerban Andrei
trondur i gotuTróndur í gøtu
lukasz zaluskałukasz załuska
oscar munoz oviedoÓscar Muñoz oviedo
</code></pre>
<p dir="auto">As you can see, in the <strong>first</strong> part, before the <strong>US</strong> separator ( unfortunately <strong>not</strong> displayed ), all the <strong>Latin accentuated</strong> characters are changed into their based <strong>equivalent lowercase</strong> letter, in the <strong><code>[a-z]</code></strong> range</p>
<p dir="auto">Now, it’s <strong>easy</strong> to understand <strong>why</strong>, after the N++ <strong>Unicode</strong> sort, we get the <strong>well sorted</strong> text, below ;-))</p>
<pre><code class="language-diff">dorde petrovicđorđe Petrović                                                     đorđe Petrović
lukasz zaluskałukasz załuska                                                     łukasz załuska
oscar munoz oviedoÓscar Muñoz oviedo                                             Óscar Muñoz oviedo
oystein sundeøystein sunde                    giving finally the lines  :        øystein sunde
serban andreiȘerban Andrei                                                       Șerban Andrei
trondur i gotuTróndur í gøtu                                                     Tróndur í gøtu
zarko Cabarkapažarko Čabarkapa                                                   žarko Čabarkapa
</code></pre>
<p dir="auto">Note that if <strong>two primary</strong> keys are the <strong>same</strong> as, for instance, in :</p>
<pre><code class="language-diff">guy Thevenotguy THEVENOT
guy ThevenotGuy Thevenot
</code></pre>
<p dir="auto">it would give, after the <strong>Unicode</strong> N++ sort :</p>
<pre><code class="language-diff">guy ThevenotGuy Thevenot
guy Thevenotguy THEVENOT
</code></pre>
<p dir="auto">As letter <strong><code>G</code></strong> of <strong>Guy</strong> comes <strong>before</strong> letter <strong><code>g</code></strong> of  <strong>guy</strong> !</p>
<p dir="auto">See <strong>next</strong> post, as any post has a <strong>maximum</strong> size of <strong><code>16Ko</code></strong>  ;-))</p>
<p dir="auto">guy038</p>
]]></description><link>https://community.notepad-plus-plus.org/post/32018</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/32018</guid><dc:creator><![CDATA[guy038]]></dc:creator><pubDate>Tue, 08 Nov 2022 21:49:35 GMT</pubDate></item><item><title><![CDATA[Reply to When case insensitive sorting will be added in Notepad++ x64 (without TextFX)? on Fri, 20 Apr 2018 09:50:26 GMT]]></title><description><![CDATA[<p dir="auto">Before posting have you tested the ‘Column sorting’ plugin ?</p>
]]></description><link>https://community.notepad-plus-plus.org/post/31765</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/31765</guid><dc:creator><![CDATA[Gogo Neatza]]></dc:creator><pubDate>Fri, 20 Apr 2018 09:50:26 GMT</pubDate></item></channel></rss>