phonetic transformatios
-
hello, i want to do search for similar sounding words, so i have to do word transformations like in the soundex algorithm
-
Cool.
How is it you think that Notepad++ can help you with this task? -
ok , thread is opened.
now what i first wanted to post, but was rejected as spam.find phonetic similar words using methods like soundex or beider-morse phonetic matching (the headline)
hello , me as a hobbyist would like to find similar sounding words , to maybe directly see a deeper meaning or basic meaning of the words . often i am so blocked by the first learned meaning that i dont get obvious soundsimiliraties . thresh trash . me milk . hound hunt .
some words change in time and get another spelling . so clever people came up with soundmatching algorithms like “soundex” https://en.wikipedia.org/wiki/Soundex or beider-morse . there is a “apache commons lang” addition to the java-language offering both algorithms too .
but i am not into java-programming (nor any either) and maybe find it easier to do some transformations on thesasauri wordlists in regex (i know a bit) . would that be possible ? words into added integer strings , also in different speech-languages , change order , and list in alphanumeric way ?
are other ways , plugins , in npp possible ?
i already have some wordlists thesasauri, from european union translation service , aspell files , open thesasaurus , and dictionary files .
edit: oooh sry headline written wrong
-
@carypt said in phonetic transformatios:
but i am not into java-programming (nor any either) and maybe find it easier to do some transformations on thesasauri wordlists in regex (i know a bit) . would that be possible ? words into added integer strings , also in different speech-languages , change order , and list in alphanumeric way ?
Off hand, I would say no, regular expressions will not help you. The problem is that English, and many languages, are not “regular” enough that you can parse written words into spoken words. The rest of what you are seeking seems far outside the scope of this forum which is for supporting Notepad++ and the use of regular expression while doing search/replace of text within Notepad++.
-
there might be dictionary list with additional phonetic (IPA) representation . so working on the phonetics would be better.
so the concept of soundex and other phonetic algorithms is to transform phonem-types (like fricatives) into a representating number, a number string is generated which flattens out many tiny differences and makes words basically constructed.