Hello, @patrickdrd,
In a pevious post, you said :
anyway, I’m posting one of my such files to take a look yourselves if and whoever has some time, thanks!
https://pastebin.com/raw/rFi6heCt
So, I downloaded your file and, after studying it two days, I’m happy to email you back both the original file and the new one, with my formatting :-))
I succeeded to transform all your data in 9 tables, described below :
•----------•---------•---------•
| Fields | Width | Lines |
•----------•---------•---------•
| 10 | 9,849 | 466 |
| 4 | 146 | 43 |
| 19 | 1,565 | 216 |
| 2 | 154 | 15 |
| 12 | 9,008 | 169 |
| 12 | 5,992 | 89 |
| 9 | 8,346 | 21 |
| 9 | 226 | 39 |
| 7 | 381 | 487 |
•----------•---------•---------•
REMARK : Just one record is still odd. You can get this record by searching the regex \x{203D}{3}
I also rename this OUTPUT text file as an Lua file, in order to get a general folding, which does not interfere with your file contents !
Finally, I verified that the number of chars ( 339,792 ) is identical between your original file Patrick_BEFORE.txt and the transformed file Patrick_AFTER.lua, with the regex (?![,| \[\]•\x{203D}\r\n-])[\x{0000}-\x{D7FF}\x{E000}-\x{FFFF}], which covers all the BMP Unicode plane, except for the 10 characters below :
, | \x20 [ ] • ‽ \r \n -
I do hope that my work will be benefic to you, in some way !
Best Regards,
guy038