sort file removing duplicates possible?
-
ok thanks, first thing tomorrow with the morning coffee
:-D -
Hi, @patrickdrd, and All,
Just for info, doing again my tests ( Last modified: 02 Jun 2018 17:23 UTC ) with N++ sort, followed by the regex S/R, I obtained :
-
Original file :
69931
lines -
With regex A ( sensitive ) :
69852
lines ( so,65
lines deleted, in56
matches ) -
With regex B ( insensitive ) :
69817
lines ( so,100
lines deleted, in88
matches ) -
With TextFX sort, at column
1
, with the option Sort outputs only UNIQUE (at column) lines, it produced a69285
lines file ( so,646
lines deleted )
Now, @patrickdrd, I don’t want you to be confused, with my explanations ! I just pointed out the fact that depending of the different ways and softwares, used to remove duplicates lines, you must expect different results ! My solution using first, N++ sort and , secondly, a regex S/R may not give the correct results, because it’s based on comparing consecutive lines, previously sorted !
And , probably, the TextFX plugin, ( whose I don’t know the sort logic ! ) other software, and the last sorter.py script, from @claudia-frank, will produce best results :-))
Finally, I’m thinking that a correct script to delete duplicates lines should not rely on any sort and just compares two individual lines, at a time ! In other words, any line should simply be compared, successively, with each other line of the file !
Cheers,
guy038
P.S. :
BTW, I confirm that the TextFX sort tool, as N++ sort, seems stable : after running it
5
times, the ouput files, produced , are totally identical :-) -
-
@Claudia-Frank still doesn’t work:
File “<console>”, line 1, in <module>
File “D:\Utilities\PortableApps\Notepad++\plugins\PythonScript\lib\tcl\lib-tk\Tkinter.py”, line 39, in <module>
import _tkinter # If this fails your Python may not be configured for Tk
ImportError: No module named _tkinter -
can you please run the following code in the python script console and check the output?
import os; print '\n'.join(os.listdir(r'plugins\PythonScript\lib\tcl'))
If you have something similar to me
dde1.3 lib-tk <--- this is important reg1.2 tcl tcl8 tcl8.5 tcl85x.dll <--- this is important tcl85x.lib tclConfig.sh tclpip85x.dll <--- this is important tclstub85.lib tk8.5 tk85x.dll <--- this is important tk85x.lib tkstub85.lib _tkinter.pyd <--- this is important
you have to add another path to sys.path so that you have the following in your startup.py
import sys sys.path.append(r'D:\Utilities\PortableApps\Notepad++\plugins\PythonScript\lib\tcl') sys.path.append(r'D:\Utilities\PortableApps\Notepad++\plugins\PythonScript\lib\tcl\lib-tk')
After restarting notepad++, PS should be able to use Tkinter to build windows.
If you don’t have the dlls etc… or it is still not working, can you please post the output from the os.listdir… ?Cheers
Claudia -
still not
dde1.3
lib-tk
reg1.2
tcl
tcl8
tcl8.5
tclConfig.sh
tk8.5
_tkinter.pyd -
import worked but sorter not:
Traceback (most recent call last):
File “D:\Utilities\PortableApps\Notepad++\plugins\PythonScript\scripts\Sorter.py”, line 110, in <module>
SorterWindow()
File “D:\Utilities\PortableApps\Notepad++\plugins\PythonScript\scripts\Sorter.py”, line 76, in init
self.window = tk.Tk()
File “D:\Utilities\PortableApps\Notepad++\plugins\PythonScript\lib\tcl\lib-tk\Tkinter.py”, line 1819, in init
self.tk = _tkinter.create(screenName, baseName, className, interactive, wantobjects, useTk, sync, use)
_tkinter.TclError: Can’t find a usable init.tcl in the following directories:
D:/Utilities/PortableApps/Notepad++/plugins/PythonScript/lib/lib/tcl8.5 D:/Utilities/PortableApps/lib/tcl8.5 D:/Utilities/lib/tcl8.5 D:/Utilities/PortableApps/library D:/Utilities/library D:/Utilities/tcl8.5.2/library D:/tcl8.5.2/libraryThis probably means that Tcl wasn’t installed properly.
what’s that double lib in the first path?
-
maybe we should quit?
I’ll be using textfx’s (insensitive) sort and tcl/tkinter seems so unbelievably awkward to setup!
-
Hi Patrick,
maybe we should quit?
For the time being, you are right.
The tcl/tk packages as well as python script internally do have
an issue which needs to be solved. I can’t say when but it should be solved
in near future.I will keep you updated once I successfully tested with a fixed version of
the tcl/tk packages in this thread, if you want.Then, hopefully, it should be just a matter of unzipping the packages
to the right folder and setting the appropriate paths.Cheers
Claudia -
yes, of course, I’m interested,
maybe you should zip your own folder (and files) regarding tcl and upload it somewhere for me to download? because I saw there are inconsistencies between different tcl zip downloads
-
I’m running a modified 64bit version (one of the reasons I didn’t notice that a clean
install does have issues) which uses modified versions of packages so I assume this isn’t
really a solution but might be another source of confusion.I’m in contact with @chcg - let’s see what we can do to get this fixed soon.
Cheers
Claudia -
Just sayin’ : The fact that the script uses Tk caused me no problems – works great! Thanks @Claudia-Frank !!
-
^([A-Z])([a-z]+)\R(?i:\1)\2$