Community
    • Login

    How to easily manage IPA symbols...

    Scheduled Pinned Locked Moved General Discussion
    7 Posts 3 Posters 213 Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • guy038G
      guy038
      last edited by guy038

      Hello, All,

      I suppose that very few people will be interested in the International Phonetic Alphabet ( known as IPA ) ! However, in this topic :

      • I’ll show some basic elements about the IPA writing system.

      • I’ll propose a monospaced font, from Clement Pit-Claudel, that supports all the IPA symbols and plenty of other symbols, as well as the Latin, Greek and Cyrillic characters.

      • I’ll describe a general regex to easily find out any IPA symbol in text files.

      NOTE : The GitHub site of Clement Pit-Claudel is https://github.com/cpitclaudel/monospacifier

      IMPORTANT : In order for that font to be displayed correctly as monospaced, you’ll need to use the GDI (most compatible) option of Direct Write , only !


      • For general information, refer to https://en.wikipedia.org/wiki/International_Phonetic_Alphabet

      • Regarding English/American IPA, see https://en.wikipedia.org/wiki/Help:IPA/English

      • Regarding French IPA, see https://fr.wikipedia.org/wiki/Aide:Alphabet_phonétique_français

      In, order to know the EN/US phonetic transcription of a word, I used :

      https://dictionary.cambridge.org/browse/pronunciation/english/

      In order to know the EN/US phonemic transcription of a word or phrase, I used :

      https://tophonetics.com/

      Finally if you need to type phonetic symbols, you may use these two sites :

      https://www.internationalphoneticassociation.org/IPAcharts/inter_chart_2018/IPA_2018.html

      https://ipa.typeit.org/full/

      As usual, these references are just a simple selection and many other sites may give interesting info, too !


      Unicode contains 3 blocks, devoted to the I.P.A., which are :

      • The IPA Extensions block, containing 96 characters, between U+0250 and U+02AF

      • The Phonetic Extensions block, containing 128 characters, between U+1D00 and U+1D7F

      • The Phonetic Extensions Supplement, containing 64 characters between U+1D80 and U+1DBF

      Unfortunately, many other characters, used by the IPA writing system, are dispersed across several UNICODE character blocks !

      The other blocks involved, in some way, in IPA, are :

      - Basic Latin
      - Latin-1 Supplement
      - Latin Extended-A
      - Latin Extended-B
      - Spacing Modifier Letters
      - Combining Diacritical Marks
      - Greek and Coptic
      - Combining Diacritical Marks Supplement
      - General Punctuation
      - Superscripts and Subscripts
      - Arrows
      - Latin Extended-C
      

      Thus, it rather difficult to find out a monospaced font able to display all these IPA symbols. I succeeded to find the serif font Symbola_Monospacified_for_Liberation_Mono.ttf, containing 9,622 characters and 9,827 glyphs :

      https://github.com/cpitclaudel/monospacifier/blob/master/fonts/Symbola_monospacified_for_LiberationMono.ttf?raw=true

      However, this font can be used, either, to display any common character or symbol !!

      Once this font downloaded, on your desktop :

      • Select and copy this font ( CTRL + C )

      • Move to the folder C:\Windows\fonts

      • Paste this font ( Ctrl + V )

      • Open Notepad++

      • Select Settings > Style Configurator... > Global Styles > Default Style

      • Choose the font Symbola Monospacified for...

      • Click on the Save & Close button

      REMARK : Note that my Windows 10 system automatically redirected this font to the folder C:\Users\Guy\AppData\Local\Microsoft\Windows\Fonts ( and not within C:\Windows\fonts )


      As explained, in this section :

      https://en.wikipedia.org/wiki/International_Phonetic_Alphabet#Brackets_and_transcription_delimiters

      The general regex, proposed below, searches any range of IPA symbols between, either :

      • Square Brackets, [], representing the phonetic notation, primary function of the IPA

      • Slashes, //, representing the phonemic notation, with distinctive features only

      However, as a lot of uncommon IPA symbols may happen in a phonetic transcription, I decided to include, in my regex, the complete IPA blocks, so a total of 288 characters, for the 3 blocks noted above.

      Note also that the delimiters are surrounded, themselves, by a \s character ( so any kind of horizontal or vertical space char ). In other words, this regex should avoid any normal text and just matches IPA transcriptions !


      The general regex, to find any range of IPA symbols, between the [] delimiters OR the // delimiters, is :

      (?-i)(?<=(?<=\s)(\[)|(?<=\s)/)[\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}\x{00E6}\x{00E7}\x{00F0}\x{00F8}\x{0127}\x{014B}\x{0153}\x{01C0}\x{01C1}\x{01C2}\x{01C3}\x{0250}-\x{02AF}\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC}\x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F}\x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329}\x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D}\x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}\x{03B2}\x{03B8}\x{03C7}\x{1D00}-\x{1D7F}\x{1D80}-\x{1DBF}\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}\x{2016}\x{203C}\x{203F}\x{207F}\x{2191}\x{2193}\x{2197}\x{2198}\x{2C71}]+(?=(?(1)\](?=\s)|/(?=\s)))

      And may also be expressed as below, to distinguish the different chars or blocks :

      (?x)
      (?-i)(?<=(?<=\s)(\[)|(?<=\s)/)
      (?:
      [\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}]                       | # Basic Latin                            29
      [\x{00E6}\x{00E7}\x{00F0}\x{00F8}]                                | # Latin-1 Supplement                      4
      [\x{0127}\x{014B}\x{0153}]                                        | # Latin Extended-A                        3
      [\x{01C0}\x{01C1}\x{01C2}\x{01C3}]                                | # Latin Extended-B                        4
      [\x{0250}-\x{02AF}]                                               | # IPA Extensions                         96
      [\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC}
       \x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}]       | # Spacing Modifier Letters               19
      [\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F}
       \x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329}
       \x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D}
       \x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}]                | # Combining Diacritical Marks            39
      [\x{03B2}\x{03B8}\x{03C7}]                                        | # Greek and Coptic                        3
      [\x{1D00}-\x{1D7F}]                                               | # Phonetic Extensions                   128
      [\x{1D80}-\x{1DBF}]                                               | # Phonetic Extensions Supplement         64
      [\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}]                | # Combining Diacritical Marks Supplement  6
      [\x{2016}\x{203C}\x{203F}]                                        | # General Punctuation                     3
       \x{207F}                                                         | # Superscripts and Subscripts             1
      [\x{2191}\x{2193}\x{2197}\x{2198}]                                | # Arrows                                  4
       \x{2C71}                                                           # Latin Extended-C                        1
      )+
      (?=(?(1)\](?=\s)|/(?=\s)))
      

      Some examples of the IPA transcriptions of English/American words, from the two sites I used :

      IPA  /ˌækəˈdɛmɪk/                                 for English / American word academic
      IPA  [ˌæk.əˈdem.]                                 for English / American word academic
      
      
      IPA  /əˈbaʊt/                                     for English / American word about
      IPA  [əˈbaʊt]                                     for English / American word about
      
      
      IPA  /ˈfæmɪʃ/             /ˈfeɪmɪʃ/               for English / American word famish
      IPA  [ˈfæm.ɪʃ]                                    for English / American word famish
      
      
      IPA  /ˌriːˈsaɪkᵊl/        /ˌriˈsaɪkᵊl/            for English / American word recycle
      IPA  [ˌriːˈsaɪ.kᵊl]                               for English / American word recycle
      
      
      IPA  /ˌdʒɛnəˈræləti/                              for English / American word generality
      IPA  [ˌdʒen.əˈræl.ə.ti]   [ˌdʒen.əˈræl.ə.t̬i]      for English / American word generality
      
      
      IPA  /ˌɪntəˈnæʃᵊnᵊl/      /ˌɪntərˈnæʃənəl/        for English / American word international
      IPA  [ˌɪn.təˈnæʃ.ᵊn.ᵊl]   [ˌɪnt̬ɚˈnæʃ.ᵊn.ᵊl]       for English / American word international
      
      
      IPA  /ˈmʌðə/              /ˈmʌðər/                for English / American word mother
      IPA  [ˈmʌð.ər]            [ˈmʌð.ɚ]                for English / American word mother
      
      
      IPA  /ədˈvɜːtɪsmənt/      /ˌædvərˈtaizmənt/       for English / American word advertisement
      IPA  [ədˈvɜː.tɪs.mənt]    [ˌæd.ˈvɚˈtaiz.mənt]     for English / American word advertisement
      

      If you run the general IPA regex against this post, you should get 25 matches. Try the Mark feature !


      In the next post, I’ll list all the Unicode blocks correctly displayed by the Symbola_Monospacified_for_Liberation_Mono.ttf font

      Best Regards,

      guy038

      mkupperM 1 Reply Last reply Reply Quote 2
      • guy038G
        guy038
        last edited by

        Hi, all,

        Here is the list of all the UNICODE blocks covered by the Symbola_Monospacified_for_Liberation_Mono.ttf font :

            •-------------------------------------------------------------•---------•----------•
            |             Unicode 16.0 Blocks           |      Range      |   Chars |   Total  |
            •-------------------------------------------•-----------------•---------•----------•
            |  Basic Latin                              |   0000 -  007F  |     97  |     128  |
            |  Latin-1 Supplement                       |   0080 -  00FF  |     96  |     128  |
            |  Latin Extended-A                         |   0100 -  017F  |    128  |     128  |
            |  Latin Extended-B                         |   0180 -  024F  |    208  |     208  |
            |  IPA Extensions                           |   0250 -  02AF  |     96  |      96  |
            |  Spacing Modifier Letters                 |   02B0 -  02FF  |     80  |      80  |
            |  Combining Diacritical Marks              |   0300 -  036F  |    112  |     112  |
            |  Greek and Coptic                         |   0370 -  03FF  |    135  |     135  |
            |  Cyrillic                                 |   0400 -  04FF  |    256  |     256  |
            |  Cyrillic Supplement                      |   0500 -  052F  |     48  |      48  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Arabic                                   |   0600 -  06FF  |     10  |     255  |  Imcomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Combining Diacritical Marks Extended     |   1AB0 -  1AFF  |     15  |      15  |
            |  Cyrillic Extended-C                      |   1C80 -  1C8F  |      9  |       9  |
            |  Phonetic Extensions                      |   1D00 -  1D7F  |    128  |     128  |
            |  Phonetic Extensions Supplement           |   1D80 -  1DBF  |     64  |      64  |
            |  Combining Diacritical Marks Supplement   |   1DC0 -  1DFF  |     63  |      63  |
            |  Latin Extended Additional                |   1E00 -  1EFF  |    256  |     256  |
            |  Greek Extended                           |   1F00 -  1FFF  |    233  |     233  |
            |  General Punctuation                      |   2000 -  206F  |    111  |     111  |
            |  Superscripts and Subscripts              |   2070 -  209F  |     42  |      42  |
            |  Currency Symbols                         |   20A0 -  20CF  |     32  |      32  |
            |  Combining Diacritical Marks for Symbols  |   20D0 -  20FF  |     33  |      33  |
            |  Letterlike Symbols                       |   2100 -  214F  |     80  |      80  |
            |  Number Forms                             |   2150 -  218F  |     60  |      60  |
            |  Arrows                                   |   2190 -  21FF  |    112  |     112  |
            |  Mathematical Operators                   |   2200 -  22FF  |    256  |     256  |
            |  Miscellaneous Technical                  |   2300 -  23FF  |    256  |     256  |
            |  Control Pictures                         |   2400 -  243F  |     39  |      39  |
            |  Optical Character Recognition            |   2440 -  245F  |     11  |      11  |
            |  Enclosed Alphanumerics                   |   2460 -  24FF  |    160  |     160  |
            |  Box Drawing                              |   2500 -  257F  |    128  |     128  |
            |  Block Elements                           |   2580 -  259F  |     32  |      32  |
            |  Geometric Shapes                         |   25A0 -  25FF  |     96  |      96  |
            |  Miscellaneous Symbols                    |   2600 -  26FF  |    256  |     256  |
            |  Dingbats                                 |   2700 -  27BF  |    192  |     192  |
            |  Miscellaneous Mathematical Symbols-A     |   27C0 -  27EF  |     48  |      48  |
            |  Supplemental Arrows-A                    |   27F0 -  27FF  |     16  |      16  |
            |  Braille Patterns                         |   2800 -  28FF  |    256  |     256  |
            |  Supplemental Arrows-B                    |   2900 -  297F  |    128  |     128  |
            |  Miscellaneous Mathematical Symbols-B     |   2980 -  29FF  |    128  |     128  |
            |  Supplemental Mathematical Operators      |   2A00 -  2AFF  |    256  |     256  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Miscellaneous Symbols and Arrows         |   2B00 -  2BFF  |    207  |     250  |  Imcomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Latin Extended-C                         |   2C60 -  2C7F  |     32  |      32  |
            |  Coptic                                   |   2C80 -  2CFF  |    123  |     123  |
            |  Cyrillic Extended-A                      |   2DE0 -  2DFF  |     32  |      32  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Supplemental Punctuation                 |   2E00 -  2E7F  |     74  |      79  |  Incomplete
            |  Hiragana                                 |   3040 -  309F  |      1  |      93  |  Incomplete
            |  Enclosed CJK Letters and Months          |   3200 -  32FF  |      2  |     254  |  Incomplete
            |  CJK Unified Ideographs                   |   4E00 -  9FFF  |      1  |  20,976  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Yijing Hexagram Symbols                  |   4DC0 -  4DFF  |     64  |      64  |
            |  Cyrillic Extended-B                      |   A640 -  A69F  |     96  |      96  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Latin Extended-D                         |   A720 -  A7FF  |    160  |     163  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Latin Extended-E                         |   AB30 -  AB6F  |     54  |      54  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Alphabetic Presentation Forms            |   FB00 -  FB4F  |      7  |      58  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Variation Selectors                      |   FE00 -  FE0F  |     16  |      16  |
            |  Combining Half Marks                     |   FE20 -  FE2F  |     16  |      16  |
            |  Specials                                 |   FFF0 -  FFFF  |      5  |       5  |
            •-------------------------------------------•-----------------•---------•----------•
            |  Aegean Numbers                           |  10100 - 1013F  |     57  |      57  |
            |  Ancient Greek Numbers                    |  10140 - 1018F  |     79  |      79  |
            |  Ancient Symbols                          |  10190 - 101CF  |     13  |      13  |
            |  Phaistos Disc                            |  101D0 - 101FF  |     46  |      46  |
            |  Coptic Epact Numbers                     |  102E0 - 102FF  |     28  |      28  |
            |  Byzantine Musical Symbols                |  1D000 - 1D0FF  |    246  |     246  |
            |  Musical Symbols                          |  1D100 - 1D1FF  |    231  |     231  |
            |  Ancient Greek Musical Notation           |  1D200 - 1D24F  |     70  |      70  |
            |  Tai Xuan Jing Symbols                    |  1D300 - 1D35F  |     87  |      87  |
            |  Counting Rod Numerals                    |  1D360 - 1D37F  |     18  |      25  |
            |  Mathematical Alphanumeric Symbols        |  1D400 - 1D7FF  |    996  |     996  |
            |  Mahjong Tiles                            |  1F000 - 1F02F  |     44  |      44  |
            |  Domino Tiles                             |  1F030 - 1F09F  |    100  |     100  |
            |  Playing Cards                            |  1F0A0 - 1F0FF  |     82  |      82  |
            |  Enclosed Alphanumeric Supplement         |  1F100 - 1F1FF  |    191  |     192  |
            |  Enclosed Ideographic Supplement          |  1F200 - 1F2FF  |     64  |      64  |
            |  Miscellaneous Symbols and Pictographs    |  1F300 - 1F5FF  |    768  |     768  |
            |  Emoticons                                |  1F600 - 1F64F  |     80  |      80  |
            |  Ornamental Dingbats                      |  1F650 - 1F67F  |     48  |      48  |
            |  Transport and Map Symbols                |  1F680 - 1F6FF  |    107  |     108  |
            |  Alchemical Symbols                       |  1F700 - 1F77F  |    116  |     116  |
            |  Geometric Shapes Extended                |  1F780 - 1F7FF  |     85  |      89  |
            |  Supplemental Arrows-C                    |  1F800 - 1F8FF  |    148  |     148  |
            |  Supplemental Symbols and Pictographs     |  1F900 - 1F9FF  |    148  |     213  |
            •-------------------------------------------•-----------------•---------•----------•--------------
            |  Supplementary Private Use Area-A         |  F0000 - FFFFF  |    118  |  65,534  |  Incomplete
            •-------------------------------------------•-----------------•---------•----------•--------------
            |                  Total                    |                 |  9,622  |          |
            •-------------------------------------------•-----------------•---------•----------•
        

        In the next three posts, I’ll describe the complete list of IPA symbols, sorted by UNICODE code-point

        Best Regards,

        guy038

        1 Reply Last reply Reply Quote 2
        • guy038G
          guy038
          last edited by

          Hello, All,

          Here is the list of the 185 common IPA characters, sorted by UNICODE code-point :

          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          |             Unicode Block              | Chr | Code |                   Unicode Name                    |                      IPA Name                       |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Basic Latin                            | [ ] | 0020 | space                                             | word break                                          |
          | Basic Latin                            | [.] | 002E | full stop                                         | syllable break                                      |
          | Basic Latin                            | [a] | 0061 | latin small letter a                              | open front unrounded                                |
          | Basic Latin                            | [b] | 0062 | latin small letter b                              | voiced bilabial plosive                             |
          | Basic Latin                            | [c] | 0063 | latin small letter c                              | voiceless palatal plosive                           |
          | Basic Latin                            | [d] | 0064 | latin small letter d                              | voiced alveolar plosive                             |
          | Basic Latin                            | [e] | 0065 | latin small letter e                              | close-mid front unrounded                           |
          | Basic Latin                            | [f] | 0066 | latin small letter f                              | voiceless labiodental fricative                     |
          | Basic Latin                            | [g] | 0067 | latin small letter g                              | voiced velar plosive                                |
          | Basic Latin                            | [h] | 0068 | latin small letter h                              | voiceless glottal fricative                         |
          | Basic Latin                            | [i] | 0069 | latin small letter i                              | close front unrounded                               |
          | Basic Latin                            | [j] | 006A | latin small letter j                              | voiced palatal approximant                          |
          | Basic Latin                            | [k] | 006B | latin small letter k                              | voiceless velar plosive                             |
          | Basic Latin                            | [l] | 006C | latin small letter l                              | voiced alveolar lateral approximant                 |
          | Basic Latin                            | [m] | 006D | latin small letter m                              | voiced bilabial nasal                               |
          | Basic Latin                            | [n] | 006E | latin small letter n                              | voiced alveolar nasal                               |
          | Basic Latin                            | [o] | 006F | latin small letter o                              | close-mid back rounded                              |
          | Basic Latin                            | [p] | 0070 | latin small letter p                              | voiceless bilabial plosive                          |
          | Basic Latin                            | [q] | 0071 | latin small letter q                              | voiceless uvular plosive                            |
          | Basic Latin                            | [r] | 0072 | latin small letter r                              | voiced alveolar trill                               |
          | Basic Latin                            | [s] | 0073 | latin small letter s                              | voiceless alveolar fricative                        |
          | Basic Latin                            | [t] | 0074 | latin small letter t                              | voiceless alveolar plosive                          |
          | Basic Latin                            | [u] | 0075 | latin small letter u                              | close back rounded                                  |
          | Basic Latin                            | [v] | 0076 | latin small letter v                              | voiced labiodental fricative                        |
          | Basic Latin                            | [w] | 0077 | latin small letter w                              | voiced labial-velar approximant                     |
          | Basic Latin                            | [x] | 0078 | latin small letter x                              | voiceless velar fricative                           |
          | Basic Latin                            | [y] | 0079 | latin small letter y                              | close front rounded                                 |
          | Basic Latin                            | [z] | 007A | latin small letter z                              | voiced alveolar fricative                           |
          | Basic Latin                            | [|] | 007C | vertical line                                     | minor group break (foot)                            |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Latin-1 Supplement                     | [æ] | 00E6 | latin small letter ae                             | raised open front unrounded                         |
          | Latin-1 Supplement                     | [ç] | 00E7 | latin small letter c with cedilla                 | voiceless palatal fricative                         |
          | Latin-1 Supplement                     | [ð] | 00F0 | latin small letter eth                            | voiced dental fricative                             |
          | Latin-1 Supplement                     | [ø] | 00F8 | latin small letter o with stroke                  | close-mid front rounded                             |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Latin Extended-A                       | [ħ] | 0127 | latin small letter h with stroke                  | voiceless pharyngeal fricative                      |
          | Latin Extended-A                       | [ŋ] | 014B | latin small letter eng                            | voiced velar nasal                                  |
          | Latin Extended-A                       | [œ] | 0153 | latin small ligature oe                           | open-mid front rounded                              |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          | Latin Extended-B                       | [ǀ] | 01C0 | latin letter dental click                         | voiceless dental click                              |
          | Latin Extended-B                       | [ǁ] | 01C1 | latin letter lateral click                        | voiceless alveolar lateral click                    |
          | Latin Extended-B                       | [ǂ] | 01C2 | latin letter alveolar click                       | voiceless palatoalveolar click                      |
          | Latin Extended-B                       | [ǃ] | 01C3 | latin letter retroflex click                      | voiceless (post)alveolar click                      |
          •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
          

          Continuation of the list on next post

          1 Reply Last reply Reply Quote 2
          • guy038G
            guy038
            last edited by guy038

            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            |             Unicode Block              | Chr | Code |                   Unicode Name                    |                      IPA Name                       |
            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            | IPA Extensions                         | [ɐ] | 0250 | latin small letter turned a                       | lowered schwa                                       |
            | IPA Extensions                         | [ɑ] | 0251 | latin small letter alpha                          | open back unrounded                                 |
            | IPA Extensions                         | [ɒ] | 0252 | latin small letter turned alpha                   | open back rounded                                   |
            | IPA Extensions                         | [ɓ] | 0253 | latin small letter b with hook                    | voiced bilabial implosive                           |
            | IPA Extensions                         | [ɔ] | 0254 | latin small letter open o                         | open-mid back rounded                               |
            | IPA Extensions                         | [ɕ] | 0255 | latin small letter c with curl                    | voiceless alveolo-palatal fricative                 |
            | IPA Extensions                         | [ɖ] | 0256 | latin small letter d with tail                    | voiced retroflex plosive                            |
            | IPA Extensions                         | [ɗ] | 0257 | latin small letter d with hook                    | voiced dental/alveolar implosive                    |
            | IPA Extensions                         | [ɘ] | 0258 | latin small letter reversed e                     | close-mid central unrounded                         |
            | IPA Extensions                         | [ə] | 0259 | latin small letter schwa                          | mid-central schwa                                   |
            | IPA Extensions                         | [ɛ] | 025B | latin small letter open e                         | open-mid front unrounded                            |
            | IPA Extensions                         | [ɜ] | 025C | latin small letter reversed open e                | open-mid central unrounded                          |
            | IPA Extensions                         | [ɞ] | 025E | latin small letter closed reversed open e         | open-mid central rounded                            |
            | IPA Extensions                         | [ɟ] | 025F | latin small letter dotless j with stroke          | voiced palatal plosive                              |
            | IPA Extensions                         | [ɠ] | 0260 | latin small letter g with hook                    | voiced velar implosive                              |
            | IPA Extensions                         | [g] | 0261 | latin small letter script g                       | voiced velar plosive                                |
            | IPA Extensions                         | [ɢ] | 0262 | latin letter small capital g                      | voiced uvular plosive                               |
            | IPA Extensions                         | [ɣ] | 0263 | latin small letter gamma                          | voiced velar fricative                              |
            | IPA Extensions                         | [ɤ] | 0264 | latin small letter rams horn                      | close-mid back unrounded                            |
            | IPA Extensions                         | [ɥ] | 0265 | latin small letter turned h                       | voiced labial-palatal approximant                   |
            | IPA Extensions                         | [ɦ] | 0266 | latin small letter h with hook                    | voiced glottal fricative                            |
            | IPA Extensions                         | [ɧ] | 0267 | latin small letter heng with hook                 | simultaneous voiceless postalveolar+velar fricative |
            | IPA Extensions                         | [ɨ] | 0268 | latin small letter i with stroke                  | close central unrounded                             |
            | IPA Extensions                         | [ɪ] | 026A | latin letter small capital i                      | lax close front unrounded                           |
            | IPA Extensions                         | [ɬ] | 026C | latin small letter l with belt                    | voiceless alveolar lateral fricative                |
            | IPA Extensions                         | [ɭ] | 026D | latin small letter l with retroflex hook          | voiced retroflex lateral approximant                |
            | IPA Extensions                         | [ɮ] | 026E | latin small letter lezh                           | voiced alveolar lateral fricative                   |
            | IPA Extensions                         | [ɯ] | 026F | latin small letter turned m                       | close back unrounded                                |
            | IPA Extensions                         | [ɰ] | 0270 | latin small letter turned m with long leg         | voiced velar approximant                            |
            | IPA Extensions                         | [ɱ] | 0271 | latin small letter m with hook                    | voiced labiodental nasal                            |
            | IPA Extensions                         | [ɲ] | 0272 | latin small letter n with left hook               | voiced palatal nasal                                |
            | IPA Extensions                         | [ɳ] | 0273 | latin small letter n with retroflex hook          | voiced retroflex nasal                              |
            | IPA Extensions                         | [ɴ] | 0274 | latin letter small capital n                      | voiced uvular nasal                                 |
            | IPA Extensions                         | [ɵ] | 0275 | latin small letter barred o                       | close-mid central rounded                           |
            | IPA Extensions                         | [ɶ] | 0276 | latin letter small capital oe                     | open front rounded                                  |
            | IPA Extensions                         | [ɸ] | 0278 | latin small letter phi                            | voiceless bilabial fricative                        |
            | IPA Extensions                         | [ɹ] | 0279 | latin small letter turned r                       | voiced alveolar approximant                         |
            | IPA Extensions                         | [ɺ] | 027A | latin small letter turned r with long leg         | voiced alveolar lateral flap                        |
            | IPA Extensions                         | [ɻ] | 027B | latin small letter turned r with hook             | voiced retroflex approximant                        |
            | IPA Extensions                         | [ɽ] | 027D | latin small letter r with tail                    | voiced retroflex tap                                |
            | IPA Extensions                         | [ɾ] | 027E | latin small letter r with fishhook                | voiced alveolar tap                                 |
            | IPA Extensions                         | [ʀ] | 0280 | latin letter small capital r                      | voiced uvular trill                                 |
            | IPA Extensions                         | [ʁ] | 0281 | latin letter small capital inverted r             | voiced uvular fricative                             |
            | IPA Extensions                         | [ʂ] | 0282 | latin small letter s with hook                    | voiceless retroflex fricative                       |
            | IPA Extensions                         | [ʃ] | 0283 | latin small letter esh                            | voiceless postalveolar fricative                    |
            | IPA Extensions                         | [ʄ] | 0284 | latin small letter dotless j with stroke and hook | voiced palatal implosive                            |
            | IPA Extensions                         | [ʈ] | 0288 | latin small letter t with retroflex hook          | voiceless retroflex plosive                         |
            | IPA Extensions                         | [ʉ] | 0289 | latin small letter u bar                          | close central rounded                               |
            | IPA Extensions                         | [ʊ] | 028A | latin small letter upsilon                        | lax close back rounded                              |
            | IPA Extensions                         | [ʋ] | 028B | latin small letter v with hook                    | voiced labiodental approximant                      |
            | IPA Extensions                         | [ʌ] | 028C | latin small letter turned v                       | open-mid back unrounded                             |
            | IPA Extensions                         | [ʍ] | 028D | latin small letter turned w                       | voiceless labial-velar fricative                    |
            | IPA Extensions                         | [ʎ] | 028E | latin small letter turned y                       | voiced palatal lateral approximant                  |
            | IPA Extensions                         | [ʏ] | 028F | latin letter small capital y                      | lax close front rounded                             |
            | IPA Extensions                         | [ʐ] | 0290 | latin small letter z with retroflex hook          | voiced retroflex fricative                          |
            | IPA Extensions                         | [ʑ] | 0291 | latin small letter z with curl                    | voiced alveolo-palatal fricative                    |
            | IPA Extensions                         | [ʒ] | 0292 | latin small letter ezh                            | voiced postalveolar fricative                       |
            | IPA Extensions                         | [ʔ] | 0294 | latin letter glottal stop                         | voiceless glottal plosive                           |
            | IPA Extensions                         | [ʕ] | 0295 | latin letter pharyngeal                           | voiced fricative voiced pharyngeal fricative        |
            | IPA Extensions                         | [ʘ] | 0298 | latin letter bilabial click                       | voiceless bilabial click                            |
            | IPA Extensions                         | [ʙ] | 0299 | latin letter small capital b                      | voiced bilabial trill                               |
            | IPA Extensions                         | [ʛ] | 029B | latin letter small capital g with hook            | voiced uvular implosive                             |
            | IPA Extensions                         | [ʜ] | 029C | latin letter small capital h                      | voiceless epiglottal fricative                      |
            | IPA Extensions                         | [ʝ] | 029D | latin small letter j with crossed tail            | voiced palatal fricative                            |
            | IPA Extensions                         | [ʟ] | 029F | latin letter small capital l                      | voiced velar lateral approximant                    |
            | IPA Extensions                         | [ʡ] | 02A1 | latin letter glottal stop with stroke             | epiglottal plosive                                  |
            | IPA Extensions                         | [ʢ] | 02A2 | latin letter reversed glottal stop with stroke    | voiced epiglottal fricative                         |
            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            | Spacing Modifier Letters               | [ʰ] | 02B0 | modifier letter small h                           | aspirated                                           |
            | Spacing Modifier Letters               | [ʱ] | 02B1 | modifier letter small h with hook                 | voiced pre-aspirated                                |
            | Spacing Modifier Letters               | [ʲ] | 02B2 | modifier letter small j                           | palatalized                                         |
            | Spacing Modifier Letters               | [ʷ] | 02B7 | modifier letter small w                           | labialized                                          |
            | Spacing Modifier Letters               | [ʼ] | 02BC | modifier letter apostrophe                        | ejective                                            |
            | Spacing Modifier Letters               | [ˀ] | 02C0 | modifier letter glottal stop                      | glottalized                                         |
            | Spacing Modifier Letters               | [ˈ] | 02C8 | modifier letter vertical line                     | primary stress                                      |
            | Spacing Modifier Letters               | [ˌ] | 02CC | modifier letter low vertical line                 | secondary stress                                    |
            | Spacing Modifier Letters               | [ː] | 02D0 | modifier letter triangular colon                  | long                                                |
            | Spacing Modifier Letters               | [ˑ] | 02D1 | modifier letter half triangular colon             | half-long                                           |
            | Spacing Modifier Letters               | [˞] | 02DE | modifier letter rhotic hook                        | rhotacized                                          |
            | Spacing Modifier Letters               | [ˠ] | 02E0 | modifier letter small gamma                       | velarized                                           |
            | Spacing Modifier Letters               | [ˡ] | 02E1 | modifier letter small l                           | lateral release                                     |
            | Spacing Modifier Letters               | [ˤ] | 02E4 | modifier letter small reversed glottal stop       | pharyngealized                                      |
            | Spacing Modifier Letters                | [˥] | 02E5 | modifier letter extra-high tone bar              | extra high tone                                     |
            | Spacing Modifier Letters                | [˦] | 02E6 | modifier letter high tone bar                    | high tone                                           |
            | Spacing Modifier Letters                | [˧] | 02E7 | modifier letter mid tone bar                     | mid tone                                            |
            | Spacing Modifier Letters                | [˨] | 02E8 | modifier letter low tone bar                     | low tone                                            |
            | Spacing Modifier Letters                | [˩] | 02E9 | modifier letter extra-low tone bar               | extra low tone                                      |
            •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
            

            End of the list on next post

            1 Reply Last reply Reply Quote 2
            • guy038G
              guy038
              last edited by guy038

              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              |             Unicode Block              | Chr | Code |                   Unicode Name                    |                      IPA Name                       |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Combining Diacritical Marks            | [ò] | 0300 | combining grave accent                            | low tone                                            |
              | Combining Diacritical Marks            | [ó] | 0301 | combining acute accent                            | high tone                                           |
              | Combining Diacritical Marks            | [ô] | 0302 | combining circumflex accent                       | falling                                             |
              | Combining Diacritical Marks            | [õ] | 0303 | combining tilde                                   | nasalized                                           |
              | Combining Diacritical Marks            | [ō] | 0304 | combining macron                                  | mid tone                                            |
              | Combining Diacritical Marks            | [ŏ] | 0306 | combining breve                                   | extra-short                                         |
              | Combining Diacritical Marks            | [ö] | 0308 | combining diaeresis                               | centralized                                         |
              | Combining Diacritical Marks            | [o̊] | 030A | combining ring above                              | voiceless (above)                                   |
              | Combining Diacritical Marks            | [ő] | 030B | combining double acute accent                     | extra high tone                                     |
              | Combining Diacritical Marks            | [ǒ] | 030C | combining caron                                   | rising                                              |
              | Combining Diacritical Marks            | [ȍ] | 030F | combining double grave accent                     | extra low tone                                      |
              | Combining Diacritical Marks            | [o̘] | 0318 | combining left tack below                         | advanced tongue root                                |
              | Combining Diacritical Marks            | [o̙] | 0319 | combining right tack below                        | retracted tongue root                               |
              | Combining Diacritical Marks            | [o̚] | 031A | combining left angle above                        | no audible release                                  |
              | Combining Diacritical Marks            | [o̜] | 031C | combining left half ring below                    | less rounded                                        |
              | Combining Diacritical Marks            | [o̝] | 031D | combining up tack below                           | raised                                              |
              | Combining Diacritical Marks            | [o̞] | 031E | combining down tack below                         | lowered                                             |
              | Combining Diacritical Marks            | [o̟] | 031F | combining plus sign below                         | advanced                                            |
              | Combining Diacritical Marks            | [o̠] | 0320 | combining minus sign below                        | retracted                                           |
              | Combining Diacritical Marks            | [o̤] | 0324 | combining diaeresis below                         | breathy voiced                                      |
              | Combining Diacritical Marks            | [o̥] | 0325 | combining ring below                              | voiceless                                           |
              | Combining Diacritical Marks            | [o̩] | 0329 | combining vertical line below                     | syllabic                                            |
              | Combining Diacritical Marks            | [o̪] | 032A | combining bridge below                            | dental                                              |
              | Combining Diacritical Marks            | [o̬] | 032C | combining caron below                             | voiced                                              |
              | Combining Diacritical Marks            | [o̮] | 032E | combining breve below                             | derhoticized                                        |
              | Combining Diacritical Marks            | [o̯] | 032F | combining inverted breve below                    | non-syllabic                                        |
              | Combining Diacritical Marks            | [o̰] | 0330 | combining tilde below                             | creaky voiced                                       |
              | Combining Diacritical Marks            | [o̴] | 0334 | combining tilde overlay                           | velarized or pharyngealized                         |
              | Combining Diacritical Marks            | [o̹] | 0339 | combining right half ring below                   | more rounded                                        |
              | Combining Diacritical Marks            | [o̺] | 033A | combining inverted bridge below                   | apical                                              |
              | Combining Diacritical Marks            | [o̻] | 033B | combining sqare below                             | aminal                                              |
              | Combining Diacritical Marks            | [o̼] | 033C | combining seagull below                           | linguolabial                                        |
              | Combining Diacritical Marks            | [o̽] | 033D | combining x above                                 | mid-centralized                                     |
              | Combining Diacritical Marks            | [o͇] | 0347 | combining equals sign below                       | non-sibilant                                        |
              | Combining Diacritical Marks            | [o͈] | 0348 | combining double vertical line below              | fortis                                              |
              | Combining Diacritical Marks            | [o͉] | 0349 | combining left angle below                        | lenis                                               |
              | Combining Diacritical Marks            | [o͓] | 0353 | combining x below                                 | frictionalized                                      |
              | Combining Diacritical Marks            | [o͜] | 035C | combining double breve below                      | tie bar (below)                                     |
              | Combining Diacritical Marks            | [o͡] | 0361 | combining double inverted breve                   | tie bar                                             |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Greek and Coptic                       | [β] | 03B2 | greek small letter beta                           | voiced bilabial fricative                           |
              | Greek and Coptic                       | [θ] | 03B8 | greek small letter theta                          | voiceless dental fricative                          |
              | Greek and Coptic                       | [χ] | 03C7 | greek small letter chi                            | voiceless uvular fricative                          |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Phonetic Extensions                    | [ᴱ] | 1D31 | modifier letter capital e                         | epilaryngeal phonation                              |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Phonetic Extensions Supplement         | [ᶑ] | 1D91 | latin small letter d with hook and tail           | voiced retroflex implosive                          |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Combining Diacritical Marks Supplement | [o᷄] | 1DC4 | combining macron-acute                            | high rising                                         |
              | Combining Diacritical Marks Supplement | [o᷅] | 1DC5 | combining grave-macron                            | low rising                                          |
              | Combining Diacritical Marks Supplement | [o᷆] | 1DC6 | combining macron-grave                            | low falling                                         |
              | Combining Diacritical Marks Supplement | [o᷇] | 1DC7 | combining acute-macron                            | high falling                                        |
              | Combining Diacritical Marks Supplement | [o᷈] | 1DC8 | combining grave-acute-grave                       | rising-falling                                      |
              | Combining Diacritical Marks Supplement | [o᷉] | 1DC9 | combining acute-grave-acute                       | falling-rising                                      |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | General Punctuation                    | [‖] | 2016 | double vertical line                              | major group break (intonation)                      |
              | General Punctuation                    | [‼] | 203C | double exclamation mark                           | retroflex click                                     |
              | General Punctuation                    | [‿] | 203F | undertie                                          | linking (absence of a break)                        |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Superscripts and Subscripts            | [ⁿ] | 207F | superscript latin small letter n                  | nasal release                                       |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Arrows                                 | [↑] | 2191 | upwards arrow                                     | global rise                                         |
              | Arrows                                 | [↓] | 2193 | downwards arrow                                   | global fall                                         |
              | Arrows                                 | [↗] | 2197 | north east arrow                                  | global rise                                         |
              | Arrows                                 | [↘] | 2198 | south east arrow                                  | global fall                                         |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              | Latin Extended-C                       | [ⱱ] | 2C71 | latin small letter v with right hook              | voiced labiodental tap                              |
              •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
              

              • Gather this post with the two last posts in a single .txt file

              • Then, if you apply the general IPA regex against this composite file, you should get, as expected, 185 matches !

              Best Regards,

              guy038

              1 Reply Last reply Reply Quote 2
              • mkupperM
                mkupper @guy038
                last edited by

                @guy038 said in How to easily manage IPA symbols...:

                I suppose that very few people will be interested in the International Phonetic Alphabet ( known as IPA )

                Notepad++ walks into a bar and orders an IPA. @guy038 is tending the bar and so pours a glass of wine while chatting about IPA and Unicode.

                1 Reply Last reply Reply Quote 6
                • Alan KilbornA
                  Alan Kilborn
                  last edited by Alan Kilborn

                  @mkupper said:

                  …so pours a glass of wine while chatting about IPA…

                  @guy038 has always been really good at answering the question that wasn’t asked. It’s not a bad thing, as usually an increase in general knowledge results. :-)

                  1 Reply Last reply Reply Quote 3
                  • First post
                    Last post
                  The Community of users of the Notepad++ text editor.
                  Powered by NodeBB | Contributors