How to easily manage IPA symbols...
-
Hello, All,
I suppose that very few people will be interested in the
International Phonetic Alphabet( known asIPA) ! However, in this topic :-
I’ll show some basic elements about the
IPAwriting system. -
I’ll propose a monospaced font, from
Clement Pit-Claudel, that supports all theIPAsymbols and plenty of other symbols, as well as theLatin,GreekandCyrilliccharacters. -
I’ll describe a general regex to easily find out any
IPAsymbol in text files.
NOTE : The
GitHubsite of Clement Pit-Claudel is https://github.com/cpitclaudel/monospacifierIMPORTANT : In order for that font to be displayed correctly as monospaced, you’ll need to use the
GDI (most compatible)option of Direct Write , only !
-
For general information, refer to https://en.wikipedia.org/wiki/International_Phonetic_Alphabet
-
Regarding English/American IPA, see https://en.wikipedia.org/wiki/Help:IPA/English
-
Regarding French IPA, see https://fr.wikipedia.org/wiki/Aide:Alphabet_phonétique_français
In, order to know the EN/US phonetic transcription of a word, I used :
https://dictionary.cambridge.org/browse/pronunciation/english/
In order to know the EN/US phonemic transcription of a word or phrase, I used :
Finally if you need to type phonetic symbols, you may use these two sites :
https://www.internationalphoneticassociation.org/IPAcharts/inter_chart_2018/IPA_2018.html
As usual, these references are just a simple selection and many other sites may give interesting info, too !
Unicode contains
3blocks, devoted to theI.P.A., which are :-
The
IPA Extensionsblock, containing96characters, betweenU+0250andU+02AF -
The
Phonetic Extensionsblock, containing128characters, betweenU+1D00andU+1D7F -
The
Phonetic Extensions Supplement, containing64characters betweenU+1D80andU+1DBF
Unfortunately, many other characters, used by the
IPAwriting system, are dispersed across several UNICODE character blocks !The other blocks involved, in some way, in
IPA, are :- Basic Latin - Latin-1 Supplement - Latin Extended-A - Latin Extended-B - Spacing Modifier Letters - Combining Diacritical Marks - Greek and Coptic - Combining Diacritical Marks Supplement - General Punctuation - Superscripts and Subscripts - Arrows - Latin Extended-CThus, it rather difficult to find out a monospaced font able to display all these
IPAsymbols. I succeeded to find the serif fontSymbola_Monospacified_for_Liberation_Mono.ttf, containing9,622characters and9,827glyphs :However, this font can be used, either, to display any common character or symbol !!
Once this font downloaded, on your desktop :
-
Select and copy this font (
CTRL + C) -
Move to the folder
C:\Windows\fonts -
Paste this font (
Ctrl + V) -
Open Notepad++
-
Select
Settings > Style Configurator... > Global Styles > Default Style -
Choose the font
Symbola Monospacified for... -
Click on the
Save & Closebutton
REMARK : Note that my Windows 10 system automatically redirected this font to the folder
C:\Users\Guy\AppData\Local\Microsoft\Windows\Fonts( and not withinC:\Windows\fonts)
As explained, in this section :
https://en.wikipedia.org/wiki/International_Phonetic_Alphabet#Brackets_and_transcription_delimiters
The general regex, proposed below, searches any range of
IPAsymbols between, either :-
Square Brackets,
[], representing the phonetic notation, primary function of theIPA -
Slashes,
//, representing the phonemic notation, with distinctive features only
However, as a lot of uncommon
IPAsymbols may happen in a phonetic transcription, I decided to include, in my regex, the completeIPAblocks, so a total of288characters, for the3blocks noted above.Note also that the delimiters are surrounded, themselves, by a
\scharacter ( so any kind of horizontal or vertical space char ). In other words, this regex should avoid any normal text and just matchesIPAtranscriptions !
The general regex, to find any range of
IPAsymbols, between the[]delimiters OR the//delimiters, is :(?-i)(?<=(?<=\s)(\[)|(?<=\s)/)[\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}\x{00E6}\x{00E7}\x{00F0}\x{00F8}\x{0127}\x{014B}\x{0153}\x{01C0}\x{01C1}\x{01C2}\x{01C3}\x{0250}-\x{02AF}\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC}\x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F}\x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329}\x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D}\x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}\x{03B2}\x{03B8}\x{03C7}\x{1D00}-\x{1D7F}\x{1D80}-\x{1DBF}\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}\x{2016}\x{203C}\x{203F}\x{207F}\x{2191}\x{2193}\x{2197}\x{2198}\x{2C71}]+(?=(?(1)\](?=\s)|/(?=\s)))And may also be expressed as below, to distinguish the different chars or blocks :
(?x) (?-i)(?<=(?<=\s)(\[)|(?<=\s)/) (?: [\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}] | # Basic Latin 29 [\x{00E6}\x{00E7}\x{00F0}\x{00F8}] | # Latin-1 Supplement 4 [\x{0127}\x{014B}\x{0153}] | # Latin Extended-A 3 [\x{01C0}\x{01C1}\x{01C2}\x{01C3}] | # Latin Extended-B 4 [\x{0250}-\x{02AF}] | # IPA Extensions 96 [\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC} \x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}] | # Spacing Modifier Letters 19 [\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F} \x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329} \x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D} \x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}] | # Combining Diacritical Marks 39 [\x{03B2}\x{03B8}\x{03C7}] | # Greek and Coptic 3 [\x{1D00}-\x{1D7F}] | # Phonetic Extensions 128 [\x{1D80}-\x{1DBF}] | # Phonetic Extensions Supplement 64 [\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}] | # Combining Diacritical Marks Supplement 6 [\x{2016}\x{203C}\x{203F}] | # General Punctuation 3 \x{207F} | # Superscripts and Subscripts 1 [\x{2191}\x{2193}\x{2197}\x{2198}] | # Arrows 4 \x{2C71} # Latin Extended-C 1 )+ (?=(?(1)\](?=\s)|/(?=\s)))
Some examples of the
IPAtranscriptions of English/American words, from the two sites I used :IPA /ˌækəˈdɛmɪk/ for English / American word academic IPA [ˌæk.əˈdem.] for English / American word academic IPA /əˈbaʊt/ for English / American word about IPA [əˈbaʊt] for English / American word about IPA /ˈfæmɪʃ/ /ˈfeɪmɪʃ/ for English / American word famish IPA [ˈfæm.ɪʃ] for English / American word famish IPA /ˌriːˈsaɪkᵊl/ /ˌriˈsaɪkᵊl/ for English / American word recycle IPA [ˌriːˈsaɪ.kᵊl] for English / American word recycle IPA /ˌdʒɛnəˈræləti/ for English / American word generality IPA [ˌdʒen.əˈræl.ə.ti] [ˌdʒen.əˈræl.ə.t̬i] for English / American word generality IPA /ˌɪntəˈnæʃᵊnᵊl/ /ˌɪntərˈnæʃənəl/ for English / American word international IPA [ˌɪn.təˈnæʃ.ᵊn.ᵊl] [ˌɪnt̬ɚˈnæʃ.ᵊn.ᵊl] for English / American word international IPA /ˈmʌðə/ /ˈmʌðər/ for English / American word mother IPA [ˈmʌð.ər] [ˈmʌð.ɚ] for English / American word mother IPA /ədˈvɜːtɪsmənt/ /ˌædvərˈtaizmənt/ for English / American word advertisement IPA [ədˈvɜː.tɪs.mənt] [ˌæd.ˈvɚˈtaiz.mənt] for English / American word advertisementIf you run the general
IPAregex against this post, you should get25matches. Try theMarkfeature !
In the next post, I’ll list all the Unicode blocks correctly displayed by the
Symbola_Monospacified_for_Liberation_Mono.ttffontBest Regards,
guy038
-
-
Hi, all,
Here is the list of all the UNICODE blocks covered by the
Symbola_Monospacified_for_Liberation_Mono.ttffont :•-------------------------------------------------------------•---------•----------• | Unicode 16.0 Blocks | Range | Chars | Total | •-------------------------------------------•-----------------•---------•----------• | Basic Latin | 0000 - 007F | 97 | 128 | | Latin-1 Supplement | 0080 - 00FF | 96 | 128 | | Latin Extended-A | 0100 - 017F | 128 | 128 | | Latin Extended-B | 0180 - 024F | 208 | 208 | | IPA Extensions | 0250 - 02AF | 96 | 96 | | Spacing Modifier Letters | 02B0 - 02FF | 80 | 80 | | Combining Diacritical Marks | 0300 - 036F | 112 | 112 | | Greek and Coptic | 0370 - 03FF | 135 | 135 | | Cyrillic | 0400 - 04FF | 256 | 256 | | Cyrillic Supplement | 0500 - 052F | 48 | 48 | •-------------------------------------------•-----------------•---------•----------•-------------- | Arabic | 0600 - 06FF | 10 | 255 | Imcomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Combining Diacritical Marks Extended | 1AB0 - 1AFF | 15 | 15 | | Cyrillic Extended-C | 1C80 - 1C8F | 9 | 9 | | Phonetic Extensions | 1D00 - 1D7F | 128 | 128 | | Phonetic Extensions Supplement | 1D80 - 1DBF | 64 | 64 | | Combining Diacritical Marks Supplement | 1DC0 - 1DFF | 63 | 63 | | Latin Extended Additional | 1E00 - 1EFF | 256 | 256 | | Greek Extended | 1F00 - 1FFF | 233 | 233 | | General Punctuation | 2000 - 206F | 111 | 111 | | Superscripts and Subscripts | 2070 - 209F | 42 | 42 | | Currency Symbols | 20A0 - 20CF | 32 | 32 | | Combining Diacritical Marks for Symbols | 20D0 - 20FF | 33 | 33 | | Letterlike Symbols | 2100 - 214F | 80 | 80 | | Number Forms | 2150 - 218F | 60 | 60 | | Arrows | 2190 - 21FF | 112 | 112 | | Mathematical Operators | 2200 - 22FF | 256 | 256 | | Miscellaneous Technical | 2300 - 23FF | 256 | 256 | | Control Pictures | 2400 - 243F | 39 | 39 | | Optical Character Recognition | 2440 - 245F | 11 | 11 | | Enclosed Alphanumerics | 2460 - 24FF | 160 | 160 | | Box Drawing | 2500 - 257F | 128 | 128 | | Block Elements | 2580 - 259F | 32 | 32 | | Geometric Shapes | 25A0 - 25FF | 96 | 96 | | Miscellaneous Symbols | 2600 - 26FF | 256 | 256 | | Dingbats | 2700 - 27BF | 192 | 192 | | Miscellaneous Mathematical Symbols-A | 27C0 - 27EF | 48 | 48 | | Supplemental Arrows-A | 27F0 - 27FF | 16 | 16 | | Braille Patterns | 2800 - 28FF | 256 | 256 | | Supplemental Arrows-B | 2900 - 297F | 128 | 128 | | Miscellaneous Mathematical Symbols-B | 2980 - 29FF | 128 | 128 | | Supplemental Mathematical Operators | 2A00 - 2AFF | 256 | 256 | •-------------------------------------------•-----------------•---------•----------•-------------- | Miscellaneous Symbols and Arrows | 2B00 - 2BFF | 207 | 250 | Imcomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Latin Extended-C | 2C60 - 2C7F | 32 | 32 | | Coptic | 2C80 - 2CFF | 123 | 123 | | Cyrillic Extended-A | 2DE0 - 2DFF | 32 | 32 | •-------------------------------------------•-----------------•---------•----------•-------------- | Supplemental Punctuation | 2E00 - 2E7F | 74 | 79 | Incomplete | Hiragana | 3040 - 309F | 1 | 93 | Incomplete | Enclosed CJK Letters and Months | 3200 - 32FF | 2 | 254 | Incomplete | CJK Unified Ideographs | 4E00 - 9FFF | 1 | 20,976 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Yijing Hexagram Symbols | 4DC0 - 4DFF | 64 | 64 | | Cyrillic Extended-B | A640 - A69F | 96 | 96 | •-------------------------------------------•-----------------•---------•----------•-------------- | Latin Extended-D | A720 - A7FF | 160 | 163 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Latin Extended-E | AB30 - AB6F | 54 | 54 | •-------------------------------------------•-----------------•---------•----------•-------------- | Alphabetic Presentation Forms | FB00 - FB4F | 7 | 58 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Variation Selectors | FE00 - FE0F | 16 | 16 | | Combining Half Marks | FE20 - FE2F | 16 | 16 | | Specials | FFF0 - FFFF | 5 | 5 | •-------------------------------------------•-----------------•---------•----------• | Aegean Numbers | 10100 - 1013F | 57 | 57 | | Ancient Greek Numbers | 10140 - 1018F | 79 | 79 | | Ancient Symbols | 10190 - 101CF | 13 | 13 | | Phaistos Disc | 101D0 - 101FF | 46 | 46 | | Coptic Epact Numbers | 102E0 - 102FF | 28 | 28 | | Byzantine Musical Symbols | 1D000 - 1D0FF | 246 | 246 | | Musical Symbols | 1D100 - 1D1FF | 231 | 231 | | Ancient Greek Musical Notation | 1D200 - 1D24F | 70 | 70 | | Tai Xuan Jing Symbols | 1D300 - 1D35F | 87 | 87 | | Counting Rod Numerals | 1D360 - 1D37F | 18 | 25 | | Mathematical Alphanumeric Symbols | 1D400 - 1D7FF | 996 | 996 | | Mahjong Tiles | 1F000 - 1F02F | 44 | 44 | | Domino Tiles | 1F030 - 1F09F | 100 | 100 | | Playing Cards | 1F0A0 - 1F0FF | 82 | 82 | | Enclosed Alphanumeric Supplement | 1F100 - 1F1FF | 191 | 192 | | Enclosed Ideographic Supplement | 1F200 - 1F2FF | 64 | 64 | | Miscellaneous Symbols and Pictographs | 1F300 - 1F5FF | 768 | 768 | | Emoticons | 1F600 - 1F64F | 80 | 80 | | Ornamental Dingbats | 1F650 - 1F67F | 48 | 48 | | Transport and Map Symbols | 1F680 - 1F6FF | 107 | 108 | | Alchemical Symbols | 1F700 - 1F77F | 116 | 116 | | Geometric Shapes Extended | 1F780 - 1F7FF | 85 | 89 | | Supplemental Arrows-C | 1F800 - 1F8FF | 148 | 148 | | Supplemental Symbols and Pictographs | 1F900 - 1F9FF | 148 | 213 | •-------------------------------------------•-----------------•---------•----------•-------------- | Supplementary Private Use Area-A | F0000 - FFFFF | 118 | 65,534 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Total | | 9,622 | | •-------------------------------------------•-----------------•---------•----------•
In the next
threeposts, I’ll describe the complete list ofIPAsymbols, sorted by UNICODE code-pointBest Regards,
guy038
-
Hello, All,
Here is the list of the
185commonIPAcharacters, sorted by UNICODE code-point :•----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Unicode Block | Chr | Code | Unicode Name | IPA Name | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Basic Latin | [ ] | 0020 | space | word break | | Basic Latin | [.] | 002E | full stop | syllable break | | Basic Latin | [a] | 0061 | latin small letter a | open front unrounded | | Basic Latin | [b] | 0062 | latin small letter b | voiced bilabial plosive | | Basic Latin | [c] | 0063 | latin small letter c | voiceless palatal plosive | | Basic Latin | [d] | 0064 | latin small letter d | voiced alveolar plosive | | Basic Latin | [e] | 0065 | latin small letter e | close-mid front unrounded | | Basic Latin | [f] | 0066 | latin small letter f | voiceless labiodental fricative | | Basic Latin | [g] | 0067 | latin small letter g | voiced velar plosive | | Basic Latin | [h] | 0068 | latin small letter h | voiceless glottal fricative | | Basic Latin | [i] | 0069 | latin small letter i | close front unrounded | | Basic Latin | [j] | 006A | latin small letter j | voiced palatal approximant | | Basic Latin | [k] | 006B | latin small letter k | voiceless velar plosive | | Basic Latin | [l] | 006C | latin small letter l | voiced alveolar lateral approximant | | Basic Latin | [m] | 006D | latin small letter m | voiced bilabial nasal | | Basic Latin | [n] | 006E | latin small letter n | voiced alveolar nasal | | Basic Latin | [o] | 006F | latin small letter o | close-mid back rounded | | Basic Latin | [p] | 0070 | latin small letter p | voiceless bilabial plosive | | Basic Latin | [q] | 0071 | latin small letter q | voiceless uvular plosive | | Basic Latin | [r] | 0072 | latin small letter r | voiced alveolar trill | | Basic Latin | [s] | 0073 | latin small letter s | voiceless alveolar fricative | | Basic Latin | [t] | 0074 | latin small letter t | voiceless alveolar plosive | | Basic Latin | [u] | 0075 | latin small letter u | close back rounded | | Basic Latin | [v] | 0076 | latin small letter v | voiced labiodental fricative | | Basic Latin | [w] | 0077 | latin small letter w | voiced labial-velar approximant | | Basic Latin | [x] | 0078 | latin small letter x | voiceless velar fricative | | Basic Latin | [y] | 0079 | latin small letter y | close front rounded | | Basic Latin | [z] | 007A | latin small letter z | voiced alveolar fricative | | Basic Latin | [|] | 007C | vertical line | minor group break (foot) | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin-1 Supplement | [æ] | 00E6 | latin small letter ae | raised open front unrounded | | Latin-1 Supplement | [ç] | 00E7 | latin small letter c with cedilla | voiceless palatal fricative | | Latin-1 Supplement | [ð] | 00F0 | latin small letter eth | voiced dental fricative | | Latin-1 Supplement | [ø] | 00F8 | latin small letter o with stroke | close-mid front rounded | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin Extended-A | [ħ] | 0127 | latin small letter h with stroke | voiceless pharyngeal fricative | | Latin Extended-A | [ŋ] | 014B | latin small letter eng | voiced velar nasal | | Latin Extended-A | [œ] | 0153 | latin small ligature oe | open-mid front rounded | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin Extended-B | [ǀ] | 01C0 | latin letter dental click | voiceless dental click | | Latin Extended-B | [ǁ] | 01C1 | latin letter lateral click | voiceless alveolar lateral click | | Latin Extended-B | [ǂ] | 01C2 | latin letter alveolar click | voiceless palatoalveolar click | | Latin Extended-B | [ǃ] | 01C3 | latin letter retroflex click | voiceless (post)alveolar click | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•Continuation of the list on next post
-
•----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Unicode Block | Chr | Code | Unicode Name | IPA Name | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | IPA Extensions | [ɐ] | 0250 | latin small letter turned a | lowered schwa | | IPA Extensions | [ɑ] | 0251 | latin small letter alpha | open back unrounded | | IPA Extensions | [ɒ] | 0252 | latin small letter turned alpha | open back rounded | | IPA Extensions | [ɓ] | 0253 | latin small letter b with hook | voiced bilabial implosive | | IPA Extensions | [ɔ] | 0254 | latin small letter open o | open-mid back rounded | | IPA Extensions | [ɕ] | 0255 | latin small letter c with curl | voiceless alveolo-palatal fricative | | IPA Extensions | [ɖ] | 0256 | latin small letter d with tail | voiced retroflex plosive | | IPA Extensions | [ɗ] | 0257 | latin small letter d with hook | voiced dental/alveolar implosive | | IPA Extensions | [ɘ] | 0258 | latin small letter reversed e | close-mid central unrounded | | IPA Extensions | [ə] | 0259 | latin small letter schwa | mid-central schwa | | IPA Extensions | [ɛ] | 025B | latin small letter open e | open-mid front unrounded | | IPA Extensions | [ɜ] | 025C | latin small letter reversed open e | open-mid central unrounded | | IPA Extensions | [ɞ] | 025E | latin small letter closed reversed open e | open-mid central rounded | | IPA Extensions | [ɟ] | 025F | latin small letter dotless j with stroke | voiced palatal plosive | | IPA Extensions | [ɠ] | 0260 | latin small letter g with hook | voiced velar implosive | | IPA Extensions | [g] | 0261 | latin small letter script g | voiced velar plosive | | IPA Extensions | [ɢ] | 0262 | latin letter small capital g | voiced uvular plosive | | IPA Extensions | [ɣ] | 0263 | latin small letter gamma | voiced velar fricative | | IPA Extensions | [ɤ] | 0264 | latin small letter rams horn | close-mid back unrounded | | IPA Extensions | [ɥ] | 0265 | latin small letter turned h | voiced labial-palatal approximant | | IPA Extensions | [ɦ] | 0266 | latin small letter h with hook | voiced glottal fricative | | IPA Extensions | [ɧ] | 0267 | latin small letter heng with hook | simultaneous voiceless postalveolar+velar fricative | | IPA Extensions | [ɨ] | 0268 | latin small letter i with stroke | close central unrounded | | IPA Extensions | [ɪ] | 026A | latin letter small capital i | lax close front unrounded | | IPA Extensions | [ɬ] | 026C | latin small letter l with belt | voiceless alveolar lateral fricative | | IPA Extensions | [ɭ] | 026D | latin small letter l with retroflex hook | voiced retroflex lateral approximant | | IPA Extensions | [ɮ] | 026E | latin small letter lezh | voiced alveolar lateral fricative | | IPA Extensions | [ɯ] | 026F | latin small letter turned m | close back unrounded | | IPA Extensions | [ɰ] | 0270 | latin small letter turned m with long leg | voiced velar approximant | | IPA Extensions | [ɱ] | 0271 | latin small letter m with hook | voiced labiodental nasal | | IPA Extensions | [ɲ] | 0272 | latin small letter n with left hook | voiced palatal nasal | | IPA Extensions | [ɳ] | 0273 | latin small letter n with retroflex hook | voiced retroflex nasal | | IPA Extensions | [ɴ] | 0274 | latin letter small capital n | voiced uvular nasal | | IPA Extensions | [ɵ] | 0275 | latin small letter barred o | close-mid central rounded | | IPA Extensions | [ɶ] | 0276 | latin letter small capital oe | open front rounded | | IPA Extensions | [ɸ] | 0278 | latin small letter phi | voiceless bilabial fricative | | IPA Extensions | [ɹ] | 0279 | latin small letter turned r | voiced alveolar approximant | | IPA Extensions | [ɺ] | 027A | latin small letter turned r with long leg | voiced alveolar lateral flap | | IPA Extensions | [ɻ] | 027B | latin small letter turned r with hook | voiced retroflex approximant | | IPA Extensions | [ɽ] | 027D | latin small letter r with tail | voiced retroflex tap | | IPA Extensions | [ɾ] | 027E | latin small letter r with fishhook | voiced alveolar tap | | IPA Extensions | [ʀ] | 0280 | latin letter small capital r | voiced uvular trill | | IPA Extensions | [ʁ] | 0281 | latin letter small capital inverted r | voiced uvular fricative | | IPA Extensions | [ʂ] | 0282 | latin small letter s with hook | voiceless retroflex fricative | | IPA Extensions | [ʃ] | 0283 | latin small letter esh | voiceless postalveolar fricative | | IPA Extensions | [ʄ] | 0284 | latin small letter dotless j with stroke and hook | voiced palatal implosive | | IPA Extensions | [ʈ] | 0288 | latin small letter t with retroflex hook | voiceless retroflex plosive | | IPA Extensions | [ʉ] | 0289 | latin small letter u bar | close central rounded | | IPA Extensions | [ʊ] | 028A | latin small letter upsilon | lax close back rounded | | IPA Extensions | [ʋ] | 028B | latin small letter v with hook | voiced labiodental approximant | | IPA Extensions | [ʌ] | 028C | latin small letter turned v | open-mid back unrounded | | IPA Extensions | [ʍ] | 028D | latin small letter turned w | voiceless labial-velar fricative | | IPA Extensions | [ʎ] | 028E | latin small letter turned y | voiced palatal lateral approximant | | IPA Extensions | [ʏ] | 028F | latin letter small capital y | lax close front rounded | | IPA Extensions | [ʐ] | 0290 | latin small letter z with retroflex hook | voiced retroflex fricative | | IPA Extensions | [ʑ] | 0291 | latin small letter z with curl | voiced alveolo-palatal fricative | | IPA Extensions | [ʒ] | 0292 | latin small letter ezh | voiced postalveolar fricative | | IPA Extensions | [ʔ] | 0294 | latin letter glottal stop | voiceless glottal plosive | | IPA Extensions | [ʕ] | 0295 | latin letter pharyngeal | voiced fricative voiced pharyngeal fricative | | IPA Extensions | [ʘ] | 0298 | latin letter bilabial click | voiceless bilabial click | | IPA Extensions | [ʙ] | 0299 | latin letter small capital b | voiced bilabial trill | | IPA Extensions | [ʛ] | 029B | latin letter small capital g with hook | voiced uvular implosive | | IPA Extensions | [ʜ] | 029C | latin letter small capital h | voiceless epiglottal fricative | | IPA Extensions | [ʝ] | 029D | latin small letter j with crossed tail | voiced palatal fricative | | IPA Extensions | [ʟ] | 029F | latin letter small capital l | voiced velar lateral approximant | | IPA Extensions | [ʡ] | 02A1 | latin letter glottal stop with stroke | epiglottal plosive | | IPA Extensions | [ʢ] | 02A2 | latin letter reversed glottal stop with stroke | voiced epiglottal fricative | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Spacing Modifier Letters | [ʰ] | 02B0 | modifier letter small h | aspirated | | Spacing Modifier Letters | [ʱ] | 02B1 | modifier letter small h with hook | voiced pre-aspirated | | Spacing Modifier Letters | [ʲ] | 02B2 | modifier letter small j | palatalized | | Spacing Modifier Letters | [ʷ] | 02B7 | modifier letter small w | labialized | | Spacing Modifier Letters | [ʼ] | 02BC | modifier letter apostrophe | ejective | | Spacing Modifier Letters | [ˀ] | 02C0 | modifier letter glottal stop | glottalized | | Spacing Modifier Letters | [ˈ] | 02C8 | modifier letter vertical line | primary stress | | Spacing Modifier Letters | [ˌ] | 02CC | modifier letter low vertical line | secondary stress | | Spacing Modifier Letters | [ː] | 02D0 | modifier letter triangular colon | long | | Spacing Modifier Letters | [ˑ] | 02D1 | modifier letter half triangular colon | half-long | | Spacing Modifier Letters | [˞] | 02DE | modifier letter rhotic hook | rhotacized | | Spacing Modifier Letters | [ˠ] | 02E0 | modifier letter small gamma | velarized | | Spacing Modifier Letters | [ˡ] | 02E1 | modifier letter small l | lateral release | | Spacing Modifier Letters | [ˤ] | 02E4 | modifier letter small reversed glottal stop | pharyngealized | | Spacing Modifier Letters | [˥] | 02E5 | modifier letter extra-high tone bar | extra high tone | | Spacing Modifier Letters | [˦] | 02E6 | modifier letter high tone bar | high tone | | Spacing Modifier Letters | [˧] | 02E7 | modifier letter mid tone bar | mid tone | | Spacing Modifier Letters | [˨] | 02E8 | modifier letter low tone bar | low tone | | Spacing Modifier Letters | [˩] | 02E9 | modifier letter extra-low tone bar | extra low tone | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•End of the list on next post
-
•----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Unicode Block | Chr | Code | Unicode Name | IPA Name | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Combining Diacritical Marks | [ò] | 0300 | combining grave accent | low tone | | Combining Diacritical Marks | [ó] | 0301 | combining acute accent | high tone | | Combining Diacritical Marks | [ô] | 0302 | combining circumflex accent | falling | | Combining Diacritical Marks | [õ] | 0303 | combining tilde | nasalized | | Combining Diacritical Marks | [ō] | 0304 | combining macron | mid tone | | Combining Diacritical Marks | [ŏ] | 0306 | combining breve | extra-short | | Combining Diacritical Marks | [ö] | 0308 | combining diaeresis | centralized | | Combining Diacritical Marks | [o̊] | 030A | combining ring above | voiceless (above) | | Combining Diacritical Marks | [ő] | 030B | combining double acute accent | extra high tone | | Combining Diacritical Marks | [ǒ] | 030C | combining caron | rising | | Combining Diacritical Marks | [ȍ] | 030F | combining double grave accent | extra low tone | | Combining Diacritical Marks | [o̘] | 0318 | combining left tack below | advanced tongue root | | Combining Diacritical Marks | [o̙] | 0319 | combining right tack below | retracted tongue root | | Combining Diacritical Marks | [o̚] | 031A | combining left angle above | no audible release | | Combining Diacritical Marks | [o̜] | 031C | combining left half ring below | less rounded | | Combining Diacritical Marks | [o̝] | 031D | combining up tack below | raised | | Combining Diacritical Marks | [o̞] | 031E | combining down tack below | lowered | | Combining Diacritical Marks | [o̟] | 031F | combining plus sign below | advanced | | Combining Diacritical Marks | [o̠] | 0320 | combining minus sign below | retracted | | Combining Diacritical Marks | [o̤] | 0324 | combining diaeresis below | breathy voiced | | Combining Diacritical Marks | [o̥] | 0325 | combining ring below | voiceless | | Combining Diacritical Marks | [o̩] | 0329 | combining vertical line below | syllabic | | Combining Diacritical Marks | [o̪] | 032A | combining bridge below | dental | | Combining Diacritical Marks | [o̬] | 032C | combining caron below | voiced | | Combining Diacritical Marks | [o̮] | 032E | combining breve below | derhoticized | | Combining Diacritical Marks | [o̯] | 032F | combining inverted breve below | non-syllabic | | Combining Diacritical Marks | [o̰] | 0330 | combining tilde below | creaky voiced | | Combining Diacritical Marks | [o̴] | 0334 | combining tilde overlay | velarized or pharyngealized | | Combining Diacritical Marks | [o̹] | 0339 | combining right half ring below | more rounded | | Combining Diacritical Marks | [o̺] | 033A | combining inverted bridge below | apical | | Combining Diacritical Marks | [o̻] | 033B | combining sqare below | aminal | | Combining Diacritical Marks | [o̼] | 033C | combining seagull below | linguolabial | | Combining Diacritical Marks | [o̽] | 033D | combining x above | mid-centralized | | Combining Diacritical Marks | [o͇] | 0347 | combining equals sign below | non-sibilant | | Combining Diacritical Marks | [o͈] | 0348 | combining double vertical line below | fortis | | Combining Diacritical Marks | [o͉] | 0349 | combining left angle below | lenis | | Combining Diacritical Marks | [o͓] | 0353 | combining x below | frictionalized | | Combining Diacritical Marks | [o͜] | 035C | combining double breve below | tie bar (below) | | Combining Diacritical Marks | [o͡] | 0361 | combining double inverted breve | tie bar | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Greek and Coptic | [β] | 03B2 | greek small letter beta | voiced bilabial fricative | | Greek and Coptic | [θ] | 03B8 | greek small letter theta | voiceless dental fricative | | Greek and Coptic | [χ] | 03C7 | greek small letter chi | voiceless uvular fricative | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Phonetic Extensions | [ᴱ] | 1D31 | modifier letter capital e | epilaryngeal phonation | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Phonetic Extensions Supplement | [ᶑ] | 1D91 | latin small letter d with hook and tail | voiced retroflex implosive | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Combining Diacritical Marks Supplement | [o᷄] | 1DC4 | combining macron-acute | high rising | | Combining Diacritical Marks Supplement | [o᷅] | 1DC5 | combining grave-macron | low rising | | Combining Diacritical Marks Supplement | [o᷆] | 1DC6 | combining macron-grave | low falling | | Combining Diacritical Marks Supplement | [o᷇] | 1DC7 | combining acute-macron | high falling | | Combining Diacritical Marks Supplement | [o᷈] | 1DC8 | combining grave-acute-grave | rising-falling | | Combining Diacritical Marks Supplement | [o᷉] | 1DC9 | combining acute-grave-acute | falling-rising | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | General Punctuation | [‖] | 2016 | double vertical line | major group break (intonation) | | General Punctuation | [‼] | 203C | double exclamation mark | retroflex click | | General Punctuation | [‿] | 203F | undertie | linking (absence of a break) | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Superscripts and Subscripts | [ⁿ] | 207F | superscript latin small letter n | nasal release | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Arrows | [↑] | 2191 | upwards arrow | global rise | | Arrows | [↓] | 2193 | downwards arrow | global fall | | Arrows | [↗] | 2197 | north east arrow | global rise | | Arrows | [↘] | 2198 | south east arrow | global fall | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin Extended-C | [ⱱ] | 2C71 | latin small letter v with right hook | voiced labiodental tap | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
-
Gather this post with the two last posts in a single
.txtfile -
Then, if you apply the general
IPAregex against this composite file, you should get, as expected,185matches !
Best Regards,
guy038
-
-
@guy038 said in How to easily manage IPA symbols...:
I suppose that very few people will be interested in the International Phonetic Alphabet ( known as IPA )
Notepad++ walks into a bar and orders an IPA. @guy038 is tending the bar and so pours a glass of wine while chatting about IPA and Unicode.
-
Hello! It looks like you're interested in this conversation, but you don't have an account yet.
Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.
With your input, this post could be even better 💗
Register Login