RegEx for duplicate words (I'm not familiar withRegEx)



  • Hello,

    could some tell me how to find and delete duplicate words with RegEx?

    I have a file with several thousand words. The words are grouped, that means the heading in one line. The following group members are indented with a tab - one indented word per line. Now I have a list of words that I want to delete from this grouped file. I copied this list into the file with the grouped words. That means, at the end of the document are several words that are not indented with a tab - one word per line.

    I need to delete the duplicate words (both words). Only exacte matches need to be deleted.
    I could search for each word by hand, but this would take me days. I am not familiar with regex, but I know that there should be a simple solution for this.

    Is there someone who could help me? Please let me know, if you need further information.



  • @nafri-braut

    don’t copy the list of words into the document.
    Copy it into the find what field like this

    \b(word1|word3|word5|word7|word8...)\b
    

    leave replace empty and check regular expression.

    Cheers
    Claudia



  • Thank you!


Log in to reply