Community
    • Login

    How to easily manage IPA symbols...

    Scheduled Pinned Locked Moved General Discussion
    7 Posts 3 Posters 1.9k Views 1 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • guy038G Offline
      guy038
      last edited by guy038

      Hello, All,

      I suppose that very few people will be interested in the International Phonetic Alphabet ( known as IPA ) ! However, in this topic :

      • I’ll show some basic elements about the IPA writing system.

      • I’ll propose a monospaced font, from Clement Pit-Claudel, that supports all the IPA symbols and plenty of other symbols, as well as the Latin, Greek and Cyrillic characters.

      • I’ll describe a general regex to easily find out any IPA symbol in text files.

      NOTE : The GitHub site of Clement Pit-Claudel is https://github.com/cpitclaudel/monospacifier

      IMPORTANT : In order for that font to be displayed correctly as monospaced, you’ll need to use the GDI (most compatible) option of Direct Write , only !


      • For general information, refer to https://en.wikipedia.org/wiki/International_Phonetic_Alphabet

      • Regarding English/American IPA, see https://en.wikipedia.org/wiki/Help:IPA/English

      • Regarding French IPA, see https://fr.wikipedia.org/wiki/Aide:Alphabet_phonétique_français

      In, order to know the EN/US phonetic transcription of a word, I used :

      https://dictionary.cambridge.org/browse/pronunciation/english/

      In order to know the EN/US phonemic transcription of a word or phrase, I used :

      https://tophonetics.com/

      Finally if you need to type phonetic symbols, you may use these two sites :

      https://www.internationalphoneticassociation.org/IPAcharts/inter_chart_2018/IPA_2018.html

      https://ipa.typeit.org/full/

      As usual, these references are just a simple selection and many other sites may give interesting info, too !


      Unicode contains 3 blocks, devoted to the I.P.A., which are :

      • The IPA Extensions block, containing 96 characters, between U+0250 and U+02AF

      • The Phonetic Extensions block, containing 128 characters, between U+1D00 and U+1D7F

      • The Phonetic Extensions Supplement, containing 64 characters between U+1D80 and U+1DBF

      Unfortunately, many other characters, used by the IPA writing system, are dispersed across several UNICODE character blocks !

      The other blocks involved, in some way, in IPA, are :

      - Basic Latin
      - Latin-1 Supplement
      - Latin Extended-A
      - Latin Extended-B
      - Spacing Modifier Letters
      - Combining Diacritical Marks
      - Greek and Coptic
      - Combining Diacritical Marks Supplement
      - General Punctuation
      - Superscripts and Subscripts
      - Arrows
      - Latin Extended-C
      

      Thus, it rather difficult to find out a monospaced font able to display all these IPA symbols. I succeeded to find the serif font Symbola_Monospacified_for_Liberation_Mono.ttf, containing 9,622 characters and 9,827 glyphs :

      https://github.com/cpitclaudel/monospacifier/blob/master/fonts/Symbola_monospacified_for_LiberationMono.ttf?raw=true

      However, this font can be used, either, to display any common character or symbol !!

      Once this font downloaded, on your desktop :

      • Select and copy this font ( CTRL + C )

      • Move to the folder C:\Windows\fonts

      • Paste this font ( Ctrl + V )

      • Open Notepad++

      • Select Settings > Style Configurator... > Global Styles > Default Style

      • Choose the font Symbola Monospacified for...

      • Click on the Save & Close button

      REMARK : Note that my Windows 10 system automatically redirected this font to the folder C:\Users\Guy\AppData\Local\Microsoft\Windows\Fonts ( and not within C:\Windows\fonts )


      As explained, in this section :

      https://en.wikipedia.org/wiki/International_Phonetic_Alphabet#Brackets_and_transcription_delimiters

      The general regex, proposed below, searches any range of IPA symbols between, either :

      • Square Brackets, [], representing the phonetic notation, primary function of the IPA

      • Slashes, //, representing the phonemic notation, with distinctive features only

      However, as a lot of uncommon IPA symbols may happen in a phonetic transcription, I decided to include, in my regex, the complete IPA blocks, so a total of 288 characters, for the 3 blocks noted above.

      Note also that the delimiters are surrounded, themselves, by a \s character ( so any kind of horizontal or vertical space char ). In other words, this regex should avoid any normal text and just matches IPA transcriptions !


      The general regex, to find any range of IPA symbols, between the [] delimiters OR the // delimiters, is :

      (?-i)(?<=(?<=\s)(\[)|(?<=\s)/)[\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}\x{00E6}\x{00E7}\x{00F0}\x{00F8}\x{0127}\x{014B}\x{0153}\x{01C0}\x{01C1}\x{01C2}\x{01C3}\x{0250}-\x{02AF}\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC}\x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F}\x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329}\x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D}\x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}\x{03B2}\x{03B8}\x{03C7}\x{1D00}-\x{1D7F}\x{1D80}-\x{1DBF}\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}\x{2016}\x{203C}\x{203F}\x{207F}\x{2191}\x{2193}\x{2197}\x{2198}\x{2C71}]+(?=(?(1)\](?=\s)|/(?=\s)))

      And may also be expressed as below, to distinguish the different chars or blocks :

      (?x)
      (?-i)(?<=(?<=\s)(\[)|(?<=\s)/)
      (?:
      [\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}]                       | # Basic Latin                            29
      [\x{00E6}\x{00E7}\x{00F0}\x{00F8}]                                | # Latin-1 Supplement                      4
      [\x{0127}\x{014B}\x{0153}]                                        | # Latin Extended-A                        3
      [\x{01C0}\x{01C1}\x{01C2}\x{01C3}]                                | # Latin Extended-B                        4
      [\x{0250}-\x{02AF}]                                               | # IPA Extensions                         96
      [\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC}
       \x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}]       | # Spacing Modifier Letters               19
      [\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F}
       \x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329}
       \x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D}
       \x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}]                | # Combining Diacritical Marks            39
      [\x{03B2}\x{03B8}\x{03C7}]                                        | # Greek and Coptic                        3
      [\x{1D00}-\x{1D7F}]                                               | # Phonetic Extensions                   128
      [\x{1D80}-\x{1DBF}]                                               | # Phonetic Extensions Supplement         64
      [\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}]                | # Combining Diacritical Marks Supplement  6
      [\x{2016}\x{203C}\x{203F}]                                        | # General Punctuation                     3
       \x{207F}                                                         | # Superscripts and Subscripts             1
      [\x{2191}\x{2193}\x{2197}\x{2198}]                                | # Arrows                                  4
       \x{2C71}                                                           # Latin Extended-C                        1
      )+
      (?=(?(1)\](?=\s)|/(?=\s)))
      

      Some examples of the IPA transcriptions of English/American words, from the two sites I used :

      IPA  /ˌækəˈdɛmɪk/                                 for English / American word academic
      IPA  [ˌæk.əˈdem.]                                 for English / American word academic
      
      
      IPA  /əˈbaʊt/                                     for English / American word about
      IPA  [əˈbaʊt]                                     for English / American word about
      
      
      IPA  /ˈfæmɪʃ/             /ˈfeɪmɪʃ/               for English / American word famish
      IPA  [ˈfæm.ɪʃ]                                    for English / American word famish
      
      
      IPA  /ˌriːˈsaɪkᵊl/        /ˌriˈsaɪkᵊl/            for English / American word recycle
      IPA  [ˌriːˈsaɪ.kᵊl]                               for English / American word recycle
      
      
      IPA  /ˌdʒɛnəˈræləti/                              for English / American word generality
      IPA  [ˌdʒen.əˈræl.ə.ti]   [ˌdʒen.əˈræl.ə.t̬i]      for English / American word generality
      
      
      IPA  /ˌɪntəˈnæʃᵊnᵊl/      /ˌɪntərˈnæʃənəl/        for English / American word international
      IPA  [ˌɪn.təˈnæʃ.ᵊn.ᵊl]   [ˌɪnt̬ɚˈnæʃ.ᵊn.ᵊl]       for English / American word international
      
      
      IPA  /ˈmʌðə/              /ˈmʌðər/                for English / American word mother
      IPA  [ˈmʌð.ər]            [ˈmʌð.ɚ]                for English / American word mother
      
      
      IPA  /ədˈvɜːtɪsmənt/      /ˌædvərˈtaizmənt/       for English / American word advertisement
      IPA  [ədˈvɜː.tɪs.mənt]    [ˌæd.ˈvɚˈtaiz.mənt]     for English / American word advertisement
      

      If you run the general IPA regex against this post, you should get 25 matches. Try the Mark feature !


      In the next post, I’ll list all the Unicode blocks correctly displayed by the Symbola_Monospacified_for_Liberation_Mono.ttf font

      Best Regards,

      guy038

      mkupperM 1 Reply Last reply Reply Quote 2
      • guy038G Offline
        guy038
        last edited by

        Hi, all,

        Here is the list of all the UNICODE blocks covered by the Symbola_Monospacified_for_Liberation_Mono.ttf font :

            •-------------------------------------------------------------•---------•----------•
            |             Unicode 16.0 Blocks           |      Range      |   Chars |   Total  |
            •-------------------------------------------•-----------------•---------•----------•
            |  Basic Latin                              |   0000 -  007F  |     97  |     128  |
            |  Latin-1 Supplement                       |   0080 -  00FF  |     96  |     128  |
            |  Latin Extended-A                         |   0100 -  017F  |    128  |     128  |
            |  Latin Extended-B                         |   0180 -  024F  |    208  |     208  |
            |  IPA Extensions                           |   0250 -  02AF  |     96  |      96  |
            |  Spacing Modifier Letters                 |   02B0 -  02FF  |     80  |      80  |
            |  Combining Diacritical Marks              |   0300 -  036F  |    112  |     112  |
            |  Greek and Coptic                         |   0370 -  03FF  |    135  |     135  |
            |  Cyrillic                                 |   0400 -  04FF  |    256  |     256  |
            |  Cyrillic Supplement                      |   0500 -  052F  |     48  |      48  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Arabic                                   |   0600 -  06FF  |     10  |     255  |  Imcomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Combining Diacritical Marks Extended     |   1AB0 -  1AFF  |     15  |      15  |
            |  Cyrillic Extended-C                      |   1C80 -  1C8F  |      9  |       9  |
            |  Phonetic Extensions                      |   1D00 -  1D7F  |    128  |     128  |
            |  Phonetic Extensions Supplement           |   1D80 -  1DBF  |     64  |      64  |
            |  Combining Diacritical Marks Supplement   |   1DC0 -  1DFF  |     63  |      63  |
            |  Latin Extended Additional                |   1E00 -  1EFF  |    256  |     256  |
            |  Greek Extended                           |   1F00 -  1FFF  |    233  |     233  |
            |  General Punctuation                      |   2000 -  206F  |    111  |     111  |
            |  Superscripts and Subscripts              |   2070 -  209F  |     42  |      42  |
            |  Currency Symbols                         |   20A0 -  20CF  |     32  |      32  |
            |  Combining Diacritical Marks for Symbols  |   20D0 -  20FF  |     33  |      33  |
            |  Letterlike Symbols                       |   2100 -  214F  |     80  |      80  |
            |  Number Forms                             |   2150 -  218F  |     60  |      60  |
            |  Arrows                                   |   2190 -  21FF  |    112  |     112  |
            |  Mathematical Operators                   |   2200 -  22FF  |    256  |     256  |
            |  Miscellaneous Technical                  |   2300 -  23FF  |    256  |     256  |
            |  Control Pictures                         |   2400 -  243F  |     39  |      39  |
            |  Optical Character Recognition            |   2440 -  245F  |     11  |      11  |
            |  Enclosed Alphanumerics                   |   2460 -  24FF  |    160  |     160  |
            |  Box Drawing                              |   2500 -  257F  |    128  |     128  |
            |  Block Elements                           |   2580 -  259F  |     32  |      32  |
            |  Geometric Shapes                         |   25A0 -  25FF  |     96  |      96  |
            |  Miscellaneous Symbols                    |   2600 -  26FF  |    256  |     256  |
            |  Dingbats                                 |   2700 -  27BF  |    192  |     192  |
            |  Miscellaneous Mathematical Symbols-A     |   27C0 -  27EF  |     48  |      48  |
            |  Supplemental Arrows-A                    |   27F0 -  27FF  |     16  |      16  |
            |  Braille Patterns                         |   2800 -  28FF  |    256  |     256  |
            |  Supplemental Arrows-B                    |   2900 -  297F  |    128  |     128  |
            |  Miscellaneous Mathematical Symbols-B     |   2980 -  29FF  |    128  |     128  |
            |  Supplemental Mathematical Operators      |   2A00 -  2AFF  |    256  |     256  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Miscellaneous Symbols and Arrows         |   2B00 -  2BFF  |    207  |     250  |  Imcomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Latin Extended-C                         |   2C60 -  2C7F  |     32  |      32  |
            |  Coptic                                   |   2C80 -  2CFF  |    123  |     123  |
            |  Cyrillic Extended-A                      |   2DE0 -  2DFF  |     32  |      32  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Supplemental Punctuation                 |   2E00 -  2E7F  |     74  |      79  |  Incomplete
            |  Hiragana                                 |   3040 -  309F  |      1  |      93  |  Incomplete
            |  Enclosed CJK Letters and Months          |   3200 -  32FF  |      2  |     254  |  Incomplete
            |  CJK Unified Ideographs                   |   4E00 -  9FFF  |      1  |  20,976  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Yijing Hexagram Symbols                  |   4DC0 -  4DFF  |     64  |      64  |
            |  Cyrillic Extended-B                      |   A640 -  A69F  |     96  |      96  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Latin Extended-D                         |   A720 -  A7FF  |    160  |     163  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Latin Extended-E                         |   AB30 -  AB6F  |     54  |      54  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Alphabetic Presentation Forms            |   FB00 -  FB4F  |      7  |      58  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Variation Selectors                      |   FE00 -  FE0F  |     16  |      16  |
            |  Combining Half Marks                     |   FE20 -  FE2F  |     16  |      16  |
            |  Specials                                 |   FFF0 -  FFFF  |      5  |       5  |
            •-------------------------------------------•-----------------•---------•----------•
            |  Aegean Numbers                           |  10100 - 1013F  |     57  |      57  |
            |  Ancient Greek Numbers                    |  10140 - 1018F  |     79  |      79  |
            |  Ancient Symbols                          |  10190 - 101CF  |     13  |      13  |
            |  Phaistos Disc                            |  101D0 - 101FF  |     46  |      46  |
            |  Coptic Epact Numbers                     |  102E0 - 102FF  |     28  |      28  |
            |  Byzantine Musical Symbols                |  1D000 - 1D0FF  |    246  |     246  |
            |  Musical Symbols                          |  1D100 - 1D1FF  |    231  |     231  |
            |  Ancient Greek Musical Notation           |  1D200 - 1D24F  |     70  |      70  |
            |  Tai Xuan Jing Symbols                    |  1D300 - 1D35F  |     87  |      87  |
            |  Counting Rod Numerals                    |  1D360 - 1D37F  |     18  |      25  |
            |  Mathematical Alphanumeric Symbols        |  1D400 - 1D7FF  |    996  |     996  |
            |  Mahjong Tiles                            |  1F000 - 1F02F  |     44  |      44  |
            |  Domino Tiles                             |  1F030 - 1F09F  |    100  |     100  |
            |  Playing Cards                            |  1F0A0 - 1F0FF  |     82  |      82  |
            |  Enclosed Alphanumeric Supplement         |  1F100 - 1F1FF  |    191  |     192  |
            |  Enclosed Ideographic Supplement          |  1F200 - 1F2FF  |     64  |      64  |
            |  Miscellaneous Symbols and Pictographs    |  1F300 - 1F5FF  |    768  |     768  |
            |  Emoticons                                |  1F600 - 1F64F  |     80  |      80  |
            |  Ornamental Dingbats                      |  1F650 - 1F67F  |     48  |      48  |
            |  Transport and Map Symbols                |  1F680 - 1F6FF  |    107  |     108  |
            |  Alchemical Symbols                       |  1F700 - 1F77F  |    116  |     116  |
            |  Geometric Shapes Extended                |  1F780 - 1F7FF  |     85  |      89  |
            |  Supplemental Arrows-C                    |  1F800 - 1F8FF  |    148  |     148  |
            |  Supplemental Symbols and Pictographs     |  1F900 - 1F9FF  |    148  |     213  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Supplementary Private Use Area-A         |  F0000 - FFFFF  |    118  |  65,534  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |                  Total                    |                 |  9,622  |          |
            •-------------------------------------------•-----------------•---------•----------•
        

        In the next three posts, I’ll describe the complete list of IPA symbols, sorted by UNICODE code-point

        Best Regards,

        guy038

        1 Reply Last reply Reply Quote 2
        • guy038G Offline
          guy038
          last edited by

          Hello, All,

          Here is the list of the 185 common IPA characters, sorted by UNICODE code-point :

          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          |             Unicode Block              | Chr | Code |                   Unicode Name                    |                      IPA Name                       |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Basic Latin                            | [ ] | 0020 | space                                             | word break                                          |
          | Basic Latin                            | [.] | 002E | full stop                                         | syllable break                                      |
          | Basic Latin                            | [a] | 0061 | latin small letter a                              | open front unrounded                                |
          | Basic Latin                            | [b] | 0062 | latin small letter b                              | voiced bilabial plosive                             |
          | Basic Latin                            | [c] | 0063 | latin small letter c                              | voiceless palatal plosive                           |
          | Basic Latin                            | [d] | 0064 | latin small letter d                              | voiced alveolar plosive                             |
          | Basic Latin                            | [e] | 0065 | latin small letter e                              | close-mid front unrounded                           |
          | Basic Latin                            | [f] | 0066 | latin small letter f                              | voiceless labiodental fricative                     |
          | Basic Latin                            | [g] | 0067 | latin small letter g                              | voiced velar plosive                                |
          | Basic Latin                            | [h] | 0068 | latin small letter h                              | voiceless glottal fricative                         |
          | Basic Latin                            | [i] | 0069 | latin small letter i                              | close front unrounded                               |
          | Basic Latin                            | [j] | 006A | latin small letter j                              | voiced palatal approximant                          |
          | Basic Latin                            | [k] | 006B | latin small letter k                              | voiceless velar plosive                             |
          | Basic Latin                            | [l] | 006C | latin small letter l                              | voiced alveolar lateral approximant                 |
          | Basic Latin                            | [m] | 006D | latin small letter m                              | voiced bilabial nasal                               |
          | Basic Latin                            | [n] | 006E | latin small letter n                              | voiced alveolar nasal                               |
          | Basic Latin                            | [o] | 006F | latin small letter o                              | close-mid back rounded                              |
          | Basic Latin                            | [p] | 0070 | latin small letter p                              | voiceless bilabial plosive                          |
          | Basic Latin                            | [q] | 0071 | latin small letter q                              | voiceless uvular plosive                            |
          | Basic Latin                            | [r] | 0072 | latin small letter r                              | voiced alveolar trill                               |
          | Basic Latin                            | [s] | 0073 | latin small letter s                              | voiceless alveolar fricative                        |
          | Basic Latin                            | [t] | 0074 | latin small letter t                              | voiceless alveolar plosive                          |
          | Basic Latin                            | [u] | 0075 | latin small letter u                              | close back rounded                                  |
          | Basic Latin                            | [v] | 0076 | latin small letter v                              | voiced labiodental fricative                        |
          | Basic Latin                            | [w] | 0077 | latin small letter w                              | voiced labial-velar approximant                     |
          | Basic Latin                            | [x] | 0078 | latin small letter x                              | voiceless velar fricative                           |
          | Basic Latin                            | [y] | 0079 | latin small letter y                              | close front rounded                                 |
          | Basic Latin                            | [z] | 007A | latin small letter z                              | voiced alveolar fricative                           |
          | Basic Latin                            | [|] | 007C | vertical line                                     | minor group break (foot)                            |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Latin-1 Supplement                     | [æ] | 00E6 | latin small letter ae                             | raised open front unrounded                         |
          | Latin-1 Supplement                     | [ç] | 00E7 | latin small letter c with cedilla                 | voiceless palatal fricative                         |
          | Latin-1 Supplement                     | [ð] | 00F0 | latin small letter eth                            | voiced dental fricative                             |
          | Latin-1 Supplement                     | [ø] | 00F8 | latin small letter o with stroke                  | close-mid front rounded                             |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Latin Extended-A                       | [ħ] | 0127 | latin small letter h with stroke                  | voiceless pharyngeal fricative                      |
          | Latin Extended-A                       | [ŋ] | 014B | latin small letter eng                            | voiced velar nasal                                  |
          | Latin Extended-A                       | [œ] | 0153 | latin small ligature oe                           | open-mid front rounded                              |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Latin Extended-B                       | [ǀ] | 01C0 | latin letter dental click                         | voiceless dental click                              |
          | Latin Extended-B                       | [ǁ] | 01C1 | latin letter lateral click                        | voiceless alveolar lateral click                    |
          | Latin Extended-B                       | [ǂ] | 01C2 | latin letter alveolar click                       | voiceless palatoalveolar click                      |
          | Latin Extended-B                       | [ǃ] | 01C3 | latin letter retroflex click                      | voiceless (post)alveolar click                      |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          

          Continuation of the list on next post

          1 Reply Last reply Reply Quote 2
          • guy038G Offline
            guy038
            last edited by guy038

            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            |             Unicode Block              | Chr | Code |                   Unicode Name                    |                      IPA Name                       |
            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            | IPA Extensions                         | [ɐ] | 0250 | latin small letter turned a                       | lowered schwa                                       |
            | IPA Extensions                         | [ɑ] | 0251 | latin small letter alpha                          | open back unrounded                                 |
            | IPA Extensions                         | [ɒ] | 0252 | latin small letter turned alpha                   | open back rounded                                   |
            | IPA Extensions                         | [ɓ] | 0253 | latin small letter b with hook                    | voiced bilabial implosive                           |
            | IPA Extensions                         | [ɔ] | 0254 | latin small letter open o                         | open-mid back rounded                               |
            | IPA Extensions                         | [ɕ] | 0255 | latin small letter c with curl                    | voiceless alveolo-palatal fricative                 |
            | IPA Extensions                         | [ɖ] | 0256 | latin small letter d with tail                    | voiced retroflex plosive                            |
            | IPA Extensions                         | [ɗ] | 0257 | latin small letter d with hook                    | voiced dental/alveolar implosive                    |
            | IPA Extensions                         | [ɘ] | 0258 | latin small letter reversed e                     | close-mid central unrounded                         |
            | IPA Extensions                         | [ə] | 0259 | latin small letter schwa                          | mid-central schwa                                   |
            | IPA Extensions                         | [ɛ] | 025B | latin small letter open e                         | open-mid front unrounded                            |
            | IPA Extensions                         | [ɜ] | 025C | latin small letter reversed open e                | open-mid central unrounded                          |
            | IPA Extensions                         | [ɞ] | 025E | latin small letter closed reversed open e         | open-mid central rounded                            |
            | IPA Extensions                         | [ɟ] | 025F | latin small letter dotless j with stroke          | voiced palatal plosive                              |
            | IPA Extensions                         | [ɠ] | 0260 | latin small letter g with hook                    | voiced velar implosive                              |
            | IPA Extensions                         | [g] | 0261 | latin small letter script g                       | voiced velar plosive                                |
            | IPA Extensions                         | [ɢ] | 0262 | latin letter small capital g                      | voiced uvular plosive                               |
            | IPA Extensions                         | [ɣ] | 0263 | latin small letter gamma                          | voiced velar fricative                              |
            | IPA Extensions                         | [ɤ] | 0264 | latin small letter rams horn                      | close-mid back unrounded                            |
            | IPA Extensions                         | [ɥ] | 0265 | latin small letter turned h                       | voiced labial-palatal approximant                   |
            | IPA Extensions                         | [ɦ] | 0266 | latin small letter h with hook                    | voiced glottal fricative                            |
            | IPA Extensions                         | [ɧ] | 0267 | latin small letter heng with hook                 | simultaneous voiceless postalveolar+velar fricative |
            | IPA Extensions                         | [ɨ] | 0268 | latin small letter i with stroke                  | close central unrounded                             |
            | IPA Extensions                         | [ɪ] | 026A | latin letter small capital i                      | lax close front unrounded                           |
            | IPA Extensions                         | [ɬ] | 026C | latin small letter l with belt                    | voiceless alveolar lateral fricative                |
            | IPA Extensions                         | [ɭ] | 026D | latin small letter l with retroflex hook          | voiced retroflex lateral approximant                |
            | IPA Extensions                         | [ɮ] | 026E | latin small letter lezh                           | voiced alveolar lateral fricative                   |
            | IPA Extensions                         | [ɯ] | 026F | latin small letter turned m                       | close back unrounded                                |
            | IPA Extensions                         | [ɰ] | 0270 | latin small letter turned m with long leg         | voiced velar approximant                            |
            | IPA Extensions                         | [ɱ] | 0271 | latin small letter m with hook                    | voiced labiodental nasal                            |
            | IPA Extensions                         | [ɲ] | 0272 | latin small letter n with left hook               | voiced palatal nasal                                |
            | IPA Extensions                         | [ɳ] | 0273 | latin small letter n with retroflex hook          | voiced retroflex nasal                              |
            | IPA Extensions                         | [ɴ] | 0274 | latin letter small capital n                      | voiced uvular nasal                                 |
            | IPA Extensions                         | [ɵ] | 0275 | latin small letter barred o                       | close-mid central rounded                           |
            | IPA Extensions                         | [ɶ] | 0276 | latin letter small capital oe                     | open front rounded                                  |
            | IPA Extensions                         | [ɸ] | 0278 | latin small letter phi                            | voiceless bilabial fricative                        |
            | IPA Extensions                         | [ɹ] | 0279 | latin small letter turned r                       | voiced alveolar approximant                         |
            | IPA Extensions                         | [ɺ] | 027A | latin small letter turned r with long leg         | voiced alveolar lateral flap                        |
            | IPA Extensions                         | [ɻ] | 027B | latin small letter turned r with hook             | voiced retroflex approximant                        |
            | IPA Extensions                         | [ɽ] | 027D | latin small letter r with tail                    | voiced retroflex tap                                |
            | IPA Extensions                         | [ɾ] | 027E | latin small letter r with fishhook                | voiced alveolar tap                                 |
            | IPA Extensions                         | [ʀ] | 0280 | latin letter small capital r                      | voiced uvular trill                                 |
            | IPA Extensions                         | [ʁ] | 0281 | latin letter small capital inverted r             | voiced uvular fricative                             |
            | IPA Extensions                         | [ʂ] | 0282 | latin small letter s with hook                    | voiceless retroflex fricative                       |
            | IPA Extensions                         | [ʃ] | 0283 | latin small letter esh                            | voiceless postalveolar fricative                    |
            | IPA Extensions                         | [ʄ] | 0284 | latin small letter dotless j with stroke and hook | voiced palatal implosive                            |
            | IPA Extensions                         | [ʈ] | 0288 | latin small letter t with retroflex hook          | voiceless retroflex plosive                         |
            | IPA Extensions                         | [ʉ] | 0289 | latin small letter u bar                          | close central rounded                               |
            | IPA Extensions                         | [ʊ] | 028A | latin small letter upsilon                        | lax close back rounded                              |
            | IPA Extensions                         | [ʋ] | 028B | latin small letter v with hook                    | voiced labiodental approximant                      |
            | IPA Extensions                         | [ʌ] | 028C | latin small letter turned v                       | open-mid back unrounded                             |
            | IPA Extensions                         | [ʍ] | 028D | latin small letter turned w                       | voiceless labial-velar fricative                    |
            | IPA Extensions                         | [ʎ] | 028E | latin small letter turned y                       | voiced palatal lateral approximant                  |
            | IPA Extensions                         | [ʏ] | 028F | latin letter small capital y                      | lax close front rounded                             |
            | IPA Extensions                         | [ʐ] | 0290 | latin small letter z with retroflex hook          | voiced retroflex fricative                          |
            | IPA Extensions                         | [ʑ] | 0291 | latin small letter z with curl                    | voiced alveolo-palatal fricative                    |
            | IPA Extensions                         | [ʒ] | 0292 | latin small letter ezh                            | voiced postalveolar fricative                       |
            | IPA Extensions                         | [ʔ] | 0294 | latin letter glottal stop                         | voiceless glottal plosive                           |
            | IPA Extensions                         | [ʕ] | 0295 | latin letter pharyngeal                           | voiced fricative voiced pharyngeal fricative        |
            | IPA Extensions                         | [ʘ] | 0298 | latin letter bilabial click                       | voiceless bilabial click                            |
            | IPA Extensions                         | [ʙ] | 0299 | latin letter small capital b                      | voiced bilabial trill                               |
            | IPA Extensions                         | [ʛ] | 029B | latin letter small capital g with hook            | voiced uvular implosive                             |
            | IPA Extensions                         | [ʜ] | 029C | latin letter small capital h                      | voiceless epiglottal fricative                      |
            | IPA Extensions                         | [ʝ] | 029D | latin small letter j with crossed tail            | voiced palatal fricative                            |
            | IPA Extensions                         | [ʟ] | 029F | latin letter small capital l                      | voiced velar lateral approximant                    |
            | IPA Extensions                         | [ʡ] | 02A1 | latin letter glottal stop with stroke             | epiglottal plosive                                  |
            | IPA Extensions                         | [ʢ] | 02A2 | latin letter reversed glottal stop with stroke    | voiced epiglottal fricative                         |
            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            | Spacing Modifier Letters               | [ʰ] | 02B0 | modifier letter small h                           | aspirated                                           |
            | Spacing Modifier Letters               | [ʱ] | 02B1 | modifier letter small h with hook                 | voiced pre-aspirated                                |
            | Spacing Modifier Letters               | [ʲ] | 02B2 | modifier letter small j                           | palatalized                                         |
            | Spacing Modifier Letters               | [ʷ] | 02B7 | modifier letter small w                           | labialized                                          |
            | Spacing Modifier Letters               | [ʼ] | 02BC | modifier letter apostrophe                        | ejective                                            |
            | Spacing Modifier Letters               | [ˀ] | 02C0 | modifier letter glottal stop                      | glottalized                                         |
            | Spacing Modifier Letters               | [ˈ] | 02C8 | modifier letter vertical line                     | primary stress                                      |
            | Spacing Modifier Letters               | [ˌ] | 02CC | modifier letter low vertical line                 | secondary stress                                    |
            | Spacing Modifier Letters               | [ː] | 02D0 | modifier letter triangular colon                  | long                                                |
            | Spacing Modifier Letters               | [ˑ] | 02D1 | modifier letter half triangular colon             | half-long                                           |
            | Spacing Modifier Letters               | [˞] | 02DE | modifier letter rhotic hook                        | rhotacized                                          |
            | Spacing Modifier Letters               | [ˠ] | 02E0 | modifier letter small gamma                       | velarized                                           |
            | Spacing Modifier Letters               | [ˡ] | 02E1 | modifier letter small l                           | lateral release                                     |
            | Spacing Modifier Letters               | [ˤ] | 02E4 | modifier letter small reversed glottal stop       | pharyngealized                                      |
            | Spacing Modifier Letters                | [˥] | 02E5 | modifier letter extra-high tone bar              | extra high tone                                     |
            | Spacing Modifier Letters                | [˦] | 02E6 | modifier letter high tone bar                    | high tone                                           |
            | Spacing Modifier Letters                | [˧] | 02E7 | modifier letter mid tone bar                     | mid tone                                            |
            | Spacing Modifier Letters                | [˨] | 02E8 | modifier letter low tone bar                     | low tone                                            |
            | Spacing Modifier Letters                | [˩] | 02E9 | modifier letter extra-low tone bar               | extra low tone                                      |
            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            

            End of the list on next post

            1 Reply Last reply Reply Quote 2
            • guy038G Offline
              guy038
              last edited by guy038

              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              |             Unicode Block              | Chr | Code |                   Unicode Name                    |                      IPA Name                       |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Combining Diacritical Marks            | [ò] | 0300 | combining grave accent                            | low tone                                            |
              | Combining Diacritical Marks            | [ó] | 0301 | combining acute accent                            | high tone                                           |
              | Combining Diacritical Marks            | [ô] | 0302 | combining circumflex accent                       | falling                                             |
              | Combining Diacritical Marks            | [õ] | 0303 | combining tilde                                   | nasalized                                           |
              | Combining Diacritical Marks            | [ō] | 0304 | combining macron                                  | mid tone                                            |
              | Combining Diacritical Marks            | [ŏ] | 0306 | combining breve                                   | extra-short                                         |
              | Combining Diacritical Marks            | [ö] | 0308 | combining diaeresis                               | centralized                                         |
              | Combining Diacritical Marks            | [o̊] | 030A | combining ring above                              | voiceless (above)                                   |
              | Combining Diacritical Marks            | [ő] | 030B | combining double acute accent                     | extra high tone                                     |
              | Combining Diacritical Marks            | [ǒ] | 030C | combining caron                                   | rising                                              |
              | Combining Diacritical Marks            | [ȍ] | 030F | combining double grave accent                     | extra low tone                                      |
              | Combining Diacritical Marks            | [o̘] | 0318 | combining left tack below                         | advanced tongue root                                |
              | Combining Diacritical Marks            | [o̙] | 0319 | combining right tack below                        | retracted tongue root                               |
              | Combining Diacritical Marks            | [o̚] | 031A | combining left angle above                        | no audible release                                  |
              | Combining Diacritical Marks            | [o̜] | 031C | combining left half ring below                    | less rounded                                        |
              | Combining Diacritical Marks            | [o̝] | 031D | combining up tack below                           | raised                                              |
              | Combining Diacritical Marks            | [o̞] | 031E | combining down tack below                         | lowered                                             |
              | Combining Diacritical Marks            | [o̟] | 031F | combining plus sign below                         | advanced                                            |
              | Combining Diacritical Marks            | [o̠] | 0320 | combining minus sign below                        | retracted                                           |
              | Combining Diacritical Marks            | [o̤] | 0324 | combining diaeresis below                         | breathy voiced                                      |
              | Combining Diacritical Marks            | [o̥] | 0325 | combining ring below                              | voiceless                                           |
              | Combining Diacritical Marks            | [o̩] | 0329 | combining vertical line below                     | syllabic                                            |
              | Combining Diacritical Marks            | [o̪] | 032A | combining bridge below                            | dental                                              |
              | Combining Diacritical Marks            | [o̬] | 032C | combining caron below                             | voiced                                              |
              | Combining Diacritical Marks            | [o̮] | 032E | combining breve below                             | derhoticized                                        |
              | Combining Diacritical Marks            | [o̯] | 032F | combining inverted breve below                    | non-syllabic                                        |
              | Combining Diacritical Marks            | [o̰] | 0330 | combining tilde below                             | creaky voiced                                       |
              | Combining Diacritical Marks            | [o̴] | 0334 | combining tilde overlay                           | velarized or pharyngealized                         |
              | Combining Diacritical Marks            | [o̹] | 0339 | combining right half ring below                   | more rounded                                        |
              | Combining Diacritical Marks            | [o̺] | 033A | combining inverted bridge below                   | apical                                              |
              | Combining Diacritical Marks            | [o̻] | 033B | combining sqare below                             | aminal                                              |
              | Combining Diacritical Marks            | [o̼] | 033C | combining seagull below                           | linguolabial                                        |
              | Combining Diacritical Marks            | [o̽] | 033D | combining x above                                 | mid-centralized                                     |
              | Combining Diacritical Marks            | [o͇] | 0347 | combining equals sign below                       | non-sibilant                                        |
              | Combining Diacritical Marks            | [o͈] | 0348 | combining double vertical line below              | fortis                                              |
              | Combining Diacritical Marks            | [o͉] | 0349 | combining left angle below                        | lenis                                               |
              | Combining Diacritical Marks            | [o͓] | 0353 | combining x below                                 | frictionalized                                      |
              | Combining Diacritical Marks            | [o͜] | 035C | combining double breve below                      | tie bar (below)                                     |
              | Combining Diacritical Marks            | [o͡] | 0361 | combining double inverted breve                   | tie bar                                             |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Greek and Coptic                       | [β] | 03B2 | greek small letter beta                           | voiced bilabial fricative                           |
              | Greek and Coptic                       | [θ] | 03B8 | greek small letter theta                          | voiceless dental fricative                          |
              | Greek and Coptic                       | [χ] | 03C7 | greek small letter chi                            | voiceless uvular fricative                          |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Phonetic Extensions                    | [ᴱ] | 1D31 | modifier letter capital e                         | epilaryngeal phonation                              |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Phonetic Extensions Supplement         | [ᶑ] | 1D91 | latin small letter d with hook and tail           | voiced retroflex implosive                          |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Combining Diacritical Marks Supplement | [o᷄] | 1DC4 | combining macron-acute                            | high rising                                         |
              | Combining Diacritical Marks Supplement | [o᷅] | 1DC5 | combining grave-macron                            | low rising                                          |
              | Combining Diacritical Marks Supplement | [o᷆] | 1DC6 | combining macron-grave                            | low falling                                         |
              | Combining Diacritical Marks Supplement | [o᷇] | 1DC7 | combining acute-macron                            | high falling                                        |
              | Combining Diacritical Marks Supplement | [o᷈] | 1DC8 | combining grave-acute-grave                       | rising-falling                                      |
              | Combining Diacritical Marks Supplement | [o᷉] | 1DC9 | combining acute-grave-acute                       | falling-rising                                      |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | General Punctuation                    | [‖] | 2016 | double vertical line                              | major group break (intonation)                      |
              | General Punctuation                    | [‼] | 203C | double exclamation mark                           | retroflex click                                     |
              | General Punctuation                    | [‿] | 203F | undertie                                          | linking (absence of a break)                        |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Superscripts and Subscripts            | [ⁿ] | 207F | superscript latin small letter n                  | nasal release                                       |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Arrows                                 | [↑] | 2191 | upwards arrow                                     | global rise                                         |
              | Arrows                                 | [↓] | 2193 | downwards arrow                                   | global fall                                         |
              | Arrows                                 | [↗] | 2197 | north east arrow                                  | global rise                                         |
              | Arrows                                 | [↘] | 2198 | south east arrow                                  | global fall                                         |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Latin Extended-C                       | [ⱱ] | 2C71 | latin small letter v with right hook              | voiced labiodental tap                              |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              

              • Gather this post with the two last posts in a single .txt file

              • Then, if you apply the general IPA regex against this composite file, you should get, as expected, 185 matches !

              Best Regards,

              guy038

              1 Reply Last reply Reply Quote 2
              • mkupperM Offline
                mkupper @guy038
                last edited by

                @guy038 said in How to easily manage IPA symbols...:

                I suppose that very few people will be interested in the International Phonetic Alphabet ( known as IPA )

                Notepad++ walks into a bar and orders an IPA. @guy038 is tending the bar and so pours a glass of wine while chatting about IPA and Unicode.

                1 Reply Last reply Reply Quote 6
                • Alan KilbornA Offline
                  Alan Kilborn
                  last edited by Alan Kilborn

                  @mkupper said:

                  …so pours a glass of wine while chatting about IPA…

                  @guy038 has always been really good at answering the question that wasn’t asked. It’s not a bad thing, as usually an increase in general knowledge results. :-)

                  1 Reply Last reply Reply Quote 3

                  Hello! It looks like you're interested in this conversation, but you don't have an account yet.

                  Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.

                  With your input, this post could be even better 💗

                  Register Login
                  • First post
                    Last post
                  The Community of users of the Notepad++ text editor.
                  Powered by NodeBB | Contributors