How to easily manage IPA symbols...
-
Hello, All,
I suppose that very few people will be interested in the
International Phonetic Alphabet
( known asIPA
) ! However, in this topic :-
I’ll show some basic elements about the
IPA
writing system. -
I’ll propose a monospaced font, from
Clement Pit-Claudel
, that supports all theIPA
symbols and plenty of other symbols, as well as theLatin
,Greek
andCyrillic
characters. -
I’ll describe a general regex to easily find out any
IPA
symbol in text files.
NOTE : The
GitHub
site of Clement Pit-Claudel is https://github.com/cpitclaudel/monospacifierIMPORTANT : In order for that font to be displayed correctly as monospaced, you’ll need to use the
GDI (most compatible)
option of Direct Write , only !
-
For general information, refer to https://en.wikipedia.org/wiki/International_Phonetic_Alphabet
-
Regarding English/American IPA, see https://en.wikipedia.org/wiki/Help:IPA/English
-
Regarding French IPA, see https://fr.wikipedia.org/wiki/Aide:Alphabet_phonétique_français
In, order to know the EN/US phonetic transcription of a word, I used :
https://dictionary.cambridge.org/browse/pronunciation/english/
In order to know the EN/US phonemic transcription of a word or phrase, I used :
Finally if you need to type phonetic symbols, you may use these two sites :
https://www.internationalphoneticassociation.org/IPAcharts/inter_chart_2018/IPA_2018.html
As usual, these references are just a simple selection and many other sites may give interesting info, too !
Unicode contains
3
blocks, devoted to theI.P.A.
, which are :-
The
IPA Extensions
block, containing96
characters, betweenU+0250
andU+02AF
-
The
Phonetic Extensions
block, containing128
characters, betweenU+1D00
andU+1D7F
-
The
Phonetic Extensions Supplement
, containing64
characters betweenU+1D80
andU+1DBF
Unfortunately, many other characters, used by the
IPA
writing system, are dispersed across several UNICODE character blocks !The other blocks involved, in some way, in
IPA
, are :- Basic Latin - Latin-1 Supplement - Latin Extended-A - Latin Extended-B - Spacing Modifier Letters - Combining Diacritical Marks - Greek and Coptic - Combining Diacritical Marks Supplement - General Punctuation - Superscripts and Subscripts - Arrows - Latin Extended-C
Thus, it rather difficult to find out a monospaced font able to display all these
IPA
symbols. I succeeded to find the serif fontSymbola_Monospacified_for_Liberation_Mono.ttf
, containing9,622
characters and9,827
glyphs :However, this font can be used, either, to display any common character or symbol !!
Once this font downloaded, on your desktop :
-
Select and copy this font (
CTRL + C
) -
Move to the folder
C:\Windows\fonts
-
Paste this font (
Ctrl + V
) -
Open Notepad++
-
Select
Settings > Style Configurator... > Global Styles > Default Style
-
Choose the font
Symbola Monospacified for...
-
Click on the
Save & Close
button
REMARK : Note that my Windows 10 system automatically redirected this font to the folder
C:\Users\Guy\AppData\Local\Microsoft\Windows\Fonts
( and not withinC:\Windows\fonts
)
As explained, in this section :
https://en.wikipedia.org/wiki/International_Phonetic_Alphabet#Brackets_and_transcription_delimiters
The general regex, proposed below, searches any range of
IPA
symbols between, either :-
Square Brackets,
[]
, representing the phonetic notation, primary function of theIPA
-
Slashes,
//
, representing the phonemic notation, with distinctive features only
However, as a lot of uncommon
IPA
symbols may happen in a phonetic transcription, I decided to include, in my regex, the completeIPA
blocks, so a total of288
characters, for the3
blocks noted above.Note also that the delimiters are surrounded, themselves, by a
\s
character ( so any kind of horizontal or vertical space char ). In other words, this regex should avoid any normal text and just matchesIPA
transcriptions !
The general regex, to find any range of
IPA
symbols, between the[]
delimiters OR the//
delimiters, is :(?-i)(?<=(?<=\s)(\[)|(?<=\s)/)[\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}\x{00E6}\x{00E7}\x{00F0}\x{00F8}\x{0127}\x{014B}\x{0153}\x{01C0}\x{01C1}\x{01C2}\x{01C3}\x{0250}-\x{02AF}\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC}\x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F}\x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329}\x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D}\x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}\x{03B2}\x{03B8}\x{03C7}\x{1D00}-\x{1D7F}\x{1D80}-\x{1DBF}\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}\x{2016}\x{203C}\x{203F}\x{207F}\x{2191}\x{2193}\x{2197}\x{2198}\x{2C71}]+(?=(?(1)\](?=\s)|/(?=\s)))
And may also be expressed as below, to distinguish the different chars or blocks :
(?x) (?-i)(?<=(?<=\s)(\[)|(?<=\s)/) (?: [\x{0020}\x{002E}\x{0061}-\x{007A}\x{007C}] | # Basic Latin 29 [\x{00E6}\x{00E7}\x{00F0}\x{00F8}] | # Latin-1 Supplement 4 [\x{0127}\x{014B}\x{0153}] | # Latin Extended-A 3 [\x{01C0}\x{01C1}\x{01C2}\x{01C3}] | # Latin Extended-B 4 [\x{0250}-\x{02AF}] | # IPA Extensions 96 [\x{02B0}\x{02B1}\x{02B2}\x{02B7}\x{02BC}\x{02C0}\x{02C8}\x{02CC} \x{02D0}\x{02D1}\x{02DE}\x{02E0}\x{02E1}\x{02E4}-\x{02E9}] | # Spacing Modifier Letters 19 [\x{0300}-\x{0304}\x{0306}\x{0308}\x{030A}\x{030B}\x{030C}\x{030F} \x{0318}\x{0319}\x{031A}\x{031C}-\x{0320}\x{0324}\x{0325}\x{0329} \x{032A}\x{032C}\x{032E}\x{032F}\x{0330}\x{0334}\x{0339}-\x{033D} \x{0347}\x{0348}\x{0349}\x{0353}\x{035C}\x{0361}] | # Combining Diacritical Marks 39 [\x{03B2}\x{03B8}\x{03C7}] | # Greek and Coptic 3 [\x{1D00}-\x{1D7F}] | # Phonetic Extensions 128 [\x{1D80}-\x{1DBF}] | # Phonetic Extensions Supplement 64 [\x{1DC4}\x{1DC5}\x{1DC6}\x{1DC7}\x{1DC8}\x{1DC9}] | # Combining Diacritical Marks Supplement 6 [\x{2016}\x{203C}\x{203F}] | # General Punctuation 3 \x{207F} | # Superscripts and Subscripts 1 [\x{2191}\x{2193}\x{2197}\x{2198}] | # Arrows 4 \x{2C71} # Latin Extended-C 1 )+ (?=(?(1)\](?=\s)|/(?=\s)))
Some examples of the
IPA
transcriptions of English/American words, from the two sites I used :IPA /ˌækəˈdɛmɪk/ for English / American word academic IPA [ˌæk.əˈdem.] for English / American word academic IPA /əˈbaʊt/ for English / American word about IPA [əˈbaʊt] for English / American word about IPA /ˈfæmɪʃ/ /ˈfeɪmɪʃ/ for English / American word famish IPA [ˈfæm.ɪʃ] for English / American word famish IPA /ˌriːˈsaɪkᵊl/ /ˌriˈsaɪkᵊl/ for English / American word recycle IPA [ˌriːˈsaɪ.kᵊl] for English / American word recycle IPA /ˌdʒɛnəˈræləti/ for English / American word generality IPA [ˌdʒen.əˈræl.ə.ti] [ˌdʒen.əˈræl.ə.t̬i] for English / American word generality IPA /ˌɪntəˈnæʃᵊnᵊl/ /ˌɪntərˈnæʃənəl/ for English / American word international IPA [ˌɪn.təˈnæʃ.ᵊn.ᵊl] [ˌɪnt̬ɚˈnæʃ.ᵊn.ᵊl] for English / American word international IPA /ˈmʌðə/ /ˈmʌðər/ for English / American word mother IPA [ˈmʌð.ər] [ˈmʌð.ɚ] for English / American word mother IPA /ədˈvɜːtɪsmənt/ /ˌædvərˈtaizmənt/ for English / American word advertisement IPA [ədˈvɜː.tɪs.mənt] [ˌæd.ˈvɚˈtaiz.mənt] for English / American word advertisement
If you run the general
IPA
regex against this post, you should get25
matches. Try theMark
feature !
In the next post, I’ll list all the Unicode blocks correctly displayed by the
Symbola_Monospacified_for_Liberation_Mono.ttf
fontBest Regards,
guy038
-
-
Hi, all,
Here is the list of all the UNICODE blocks covered by the
Symbola_Monospacified_for_Liberation_Mono.ttf
font :•-------------------------------------------------------------•---------•----------• | Unicode 16.0 Blocks | Range | Chars | Total | •-------------------------------------------•-----------------•---------•----------• | Basic Latin | 0000 - 007F | 97 | 128 | | Latin-1 Supplement | 0080 - 00FF | 96 | 128 | | Latin Extended-A | 0100 - 017F | 128 | 128 | | Latin Extended-B | 0180 - 024F | 208 | 208 | | IPA Extensions | 0250 - 02AF | 96 | 96 | | Spacing Modifier Letters | 02B0 - 02FF | 80 | 80 | | Combining Diacritical Marks | 0300 - 036F | 112 | 112 | | Greek and Coptic | 0370 - 03FF | 135 | 135 | | Cyrillic | 0400 - 04FF | 256 | 256 | | Cyrillic Supplement | 0500 - 052F | 48 | 48 | •-------------------------------------------•-----------------•---------•----------•-------------- | Arabic | 0600 - 06FF | 10 | 255 | Imcomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Combining Diacritical Marks Extended | 1AB0 - 1AFF | 15 | 15 | | Cyrillic Extended-C | 1C80 - 1C8F | 9 | 9 | | Phonetic Extensions | 1D00 - 1D7F | 128 | 128 | | Phonetic Extensions Supplement | 1D80 - 1DBF | 64 | 64 | | Combining Diacritical Marks Supplement | 1DC0 - 1DFF | 63 | 63 | | Latin Extended Additional | 1E00 - 1EFF | 256 | 256 | | Greek Extended | 1F00 - 1FFF | 233 | 233 | | General Punctuation | 2000 - 206F | 111 | 111 | | Superscripts and Subscripts | 2070 - 209F | 42 | 42 | | Currency Symbols | 20A0 - 20CF | 32 | 32 | | Combining Diacritical Marks for Symbols | 20D0 - 20FF | 33 | 33 | | Letterlike Symbols | 2100 - 214F | 80 | 80 | | Number Forms | 2150 - 218F | 60 | 60 | | Arrows | 2190 - 21FF | 112 | 112 | | Mathematical Operators | 2200 - 22FF | 256 | 256 | | Miscellaneous Technical | 2300 - 23FF | 256 | 256 | | Control Pictures | 2400 - 243F | 39 | 39 | | Optical Character Recognition | 2440 - 245F | 11 | 11 | | Enclosed Alphanumerics | 2460 - 24FF | 160 | 160 | | Box Drawing | 2500 - 257F | 128 | 128 | | Block Elements | 2580 - 259F | 32 | 32 | | Geometric Shapes | 25A0 - 25FF | 96 | 96 | | Miscellaneous Symbols | 2600 - 26FF | 256 | 256 | | Dingbats | 2700 - 27BF | 192 | 192 | | Miscellaneous Mathematical Symbols-A | 27C0 - 27EF | 48 | 48 | | Supplemental Arrows-A | 27F0 - 27FF | 16 | 16 | | Braille Patterns | 2800 - 28FF | 256 | 256 | | Supplemental Arrows-B | 2900 - 297F | 128 | 128 | | Miscellaneous Mathematical Symbols-B | 2980 - 29FF | 128 | 128 | | Supplemental Mathematical Operators | 2A00 - 2AFF | 256 | 256 | •-------------------------------------------•-----------------•---------•----------•-------------- | Miscellaneous Symbols and Arrows | 2B00 - 2BFF | 207 | 250 | Imcomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Latin Extended-C | 2C60 - 2C7F | 32 | 32 | | Coptic | 2C80 - 2CFF | 123 | 123 | | Cyrillic Extended-A | 2DE0 - 2DFF | 32 | 32 | •-------------------------------------------•-----------------•---------•----------•-------------- | Supplemental Punctuation | 2E00 - 2E7F | 74 | 79 | Incomplete | Hiragana | 3040 - 309F | 1 | 93 | Incomplete | Enclosed CJK Letters and Months | 3200 - 32FF | 2 | 254 | Incomplete | CJK Unified Ideographs | 4E00 - 9FFF | 1 | 20,976 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Yijing Hexagram Symbols | 4DC0 - 4DFF | 64 | 64 | | Cyrillic Extended-B | A640 - A69F | 96 | 96 | •-------------------------------------------•-----------------•---------•----------•-------------- | Latin Extended-D | A720 - A7FF | 160 | 163 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Latin Extended-E | AB30 - AB6F | 54 | 54 | •-------------------------------------------•-----------------•---------•----------•-------------- | Alphabetic Presentation Forms | FB00 - FB4F | 7 | 58 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Variation Selectors | FE00 - FE0F | 16 | 16 | | Combining Half Marks | FE20 - FE2F | 16 | 16 | | Specials | FFF0 - FFFF | 5 | 5 | •-------------------------------------------•-----------------•---------•----------• | Aegean Numbers | 10100 - 1013F | 57 | 57 | | Ancient Greek Numbers | 10140 - 1018F | 79 | 79 | | Ancient Symbols | 10190 - 101CF | 13 | 13 | | Phaistos Disc | 101D0 - 101FF | 46 | 46 | | Coptic Epact Numbers | 102E0 - 102FF | 28 | 28 | | Byzantine Musical Symbols | 1D000 - 1D0FF | 246 | 246 | | Musical Symbols | 1D100 - 1D1FF | 231 | 231 | | Ancient Greek Musical Notation | 1D200 - 1D24F | 70 | 70 | | Tai Xuan Jing Symbols | 1D300 - 1D35F | 87 | 87 | | Counting Rod Numerals | 1D360 - 1D37F | 18 | 25 | | Mathematical Alphanumeric Symbols | 1D400 - 1D7FF | 996 | 996 | | Mahjong Tiles | 1F000 - 1F02F | 44 | 44 | | Domino Tiles | 1F030 - 1F09F | 100 | 100 | | Playing Cards | 1F0A0 - 1F0FF | 82 | 82 | | Enclosed Alphanumeric Supplement | 1F100 - 1F1FF | 191 | 192 | | Enclosed Ideographic Supplement | 1F200 - 1F2FF | 64 | 64 | | Miscellaneous Symbols and Pictographs | 1F300 - 1F5FF | 768 | 768 | | Emoticons | 1F600 - 1F64F | 80 | 80 | | Ornamental Dingbats | 1F650 - 1F67F | 48 | 48 | | Transport and Map Symbols | 1F680 - 1F6FF | 107 | 108 | | Alchemical Symbols | 1F700 - 1F77F | 116 | 116 | | Geometric Shapes Extended | 1F780 - 1F7FF | 85 | 89 | | Supplemental Arrows-C | 1F800 - 1F8FF | 148 | 148 | | Supplemental Symbols and Pictographs | 1F900 - 1F9FF | 148 | 213 | •-------------------------------------------•-----------------•---------•----------•-------------- | Supplementary Private Use Area-A | F0000 - FFFFF | 118 | 65,534 | Incomplete •-------------------------------------------•-----------------•---------•----------•-------------- | Total | | 9,622 | | •-------------------------------------------•-----------------•---------•----------•
In the next
three
posts, I’ll describe the complete list ofIPA
symbols, sorted by UNICODE code-pointBest Regards,
guy038
-
Hello, All,
Here is the list of the
185
commonIPA
characters, sorted by UNICODE code-point :•----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Unicode Block | Chr | Code | Unicode Name | IPA Name | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Basic Latin | [ ] | 0020 | space | word break | | Basic Latin | [.] | 002E | full stop | syllable break | | Basic Latin | [a] | 0061 | latin small letter a | open front unrounded | | Basic Latin | [b] | 0062 | latin small letter b | voiced bilabial plosive | | Basic Latin | [c] | 0063 | latin small letter c | voiceless palatal plosive | | Basic Latin | [d] | 0064 | latin small letter d | voiced alveolar plosive | | Basic Latin | [e] | 0065 | latin small letter e | close-mid front unrounded | | Basic Latin | [f] | 0066 | latin small letter f | voiceless labiodental fricative | | Basic Latin | [g] | 0067 | latin small letter g | voiced velar plosive | | Basic Latin | [h] | 0068 | latin small letter h | voiceless glottal fricative | | Basic Latin | [i] | 0069 | latin small letter i | close front unrounded | | Basic Latin | [j] | 006A | latin small letter j | voiced palatal approximant | | Basic Latin | [k] | 006B | latin small letter k | voiceless velar plosive | | Basic Latin | [l] | 006C | latin small letter l | voiced alveolar lateral approximant | | Basic Latin | [m] | 006D | latin small letter m | voiced bilabial nasal | | Basic Latin | [n] | 006E | latin small letter n | voiced alveolar nasal | | Basic Latin | [o] | 006F | latin small letter o | close-mid back rounded | | Basic Latin | [p] | 0070 | latin small letter p | voiceless bilabial plosive | | Basic Latin | [q] | 0071 | latin small letter q | voiceless uvular plosive | | Basic Latin | [r] | 0072 | latin small letter r | voiced alveolar trill | | Basic Latin | [s] | 0073 | latin small letter s | voiceless alveolar fricative | | Basic Latin | [t] | 0074 | latin small letter t | voiceless alveolar plosive | | Basic Latin | [u] | 0075 | latin small letter u | close back rounded | | Basic Latin | [v] | 0076 | latin small letter v | voiced labiodental fricative | | Basic Latin | [w] | 0077 | latin small letter w | voiced labial-velar approximant | | Basic Latin | [x] | 0078 | latin small letter x | voiceless velar fricative | | Basic Latin | [y] | 0079 | latin small letter y | close front rounded | | Basic Latin | [z] | 007A | latin small letter z | voiced alveolar fricative | | Basic Latin | [|] | 007C | vertical line | minor group break (foot) | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin-1 Supplement | [æ] | 00E6 | latin small letter ae | raised open front unrounded | | Latin-1 Supplement | [ç] | 00E7 | latin small letter c with cedilla | voiceless palatal fricative | | Latin-1 Supplement | [ð] | 00F0 | latin small letter eth | voiced dental fricative | | Latin-1 Supplement | [ø] | 00F8 | latin small letter o with stroke | close-mid front rounded | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin Extended-A | [ħ] | 0127 | latin small letter h with stroke | voiceless pharyngeal fricative | | Latin Extended-A | [ŋ] | 014B | latin small letter eng | voiced velar nasal | | Latin Extended-A | [œ] | 0153 | latin small ligature oe | open-mid front rounded | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin Extended-B | [ǀ] | 01C0 | latin letter dental click | voiceless dental click | | Latin Extended-B | [ǁ] | 01C1 | latin letter lateral click | voiceless alveolar lateral click | | Latin Extended-B | [ǂ] | 01C2 | latin letter alveolar click | voiceless palatoalveolar click | | Latin Extended-B | [ǃ] | 01C3 | latin letter retroflex click | voiceless (post)alveolar click | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
Continuation of the list on next post
-
•----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Unicode Block | Chr | Code | Unicode Name | IPA Name | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | IPA Extensions | [ɐ] | 0250 | latin small letter turned a | lowered schwa | | IPA Extensions | [ɑ] | 0251 | latin small letter alpha | open back unrounded | | IPA Extensions | [ɒ] | 0252 | latin small letter turned alpha | open back rounded | | IPA Extensions | [ɓ] | 0253 | latin small letter b with hook | voiced bilabial implosive | | IPA Extensions | [ɔ] | 0254 | latin small letter open o | open-mid back rounded | | IPA Extensions | [ɕ] | 0255 | latin small letter c with curl | voiceless alveolo-palatal fricative | | IPA Extensions | [ɖ] | 0256 | latin small letter d with tail | voiced retroflex plosive | | IPA Extensions | [ɗ] | 0257 | latin small letter d with hook | voiced dental/alveolar implosive | | IPA Extensions | [ɘ] | 0258 | latin small letter reversed e | close-mid central unrounded | | IPA Extensions | [ə] | 0259 | latin small letter schwa | mid-central schwa | | IPA Extensions | [ɛ] | 025B | latin small letter open e | open-mid front unrounded | | IPA Extensions | [ɜ] | 025C | latin small letter reversed open e | open-mid central unrounded | | IPA Extensions | [ɞ] | 025E | latin small letter closed reversed open e | open-mid central rounded | | IPA Extensions | [ɟ] | 025F | latin small letter dotless j with stroke | voiced palatal plosive | | IPA Extensions | [ɠ] | 0260 | latin small letter g with hook | voiced velar implosive | | IPA Extensions | [g] | 0261 | latin small letter script g | voiced velar plosive | | IPA Extensions | [ɢ] | 0262 | latin letter small capital g | voiced uvular plosive | | IPA Extensions | [ɣ] | 0263 | latin small letter gamma | voiced velar fricative | | IPA Extensions | [ɤ] | 0264 | latin small letter rams horn | close-mid back unrounded | | IPA Extensions | [ɥ] | 0265 | latin small letter turned h | voiced labial-palatal approximant | | IPA Extensions | [ɦ] | 0266 | latin small letter h with hook | voiced glottal fricative | | IPA Extensions | [ɧ] | 0267 | latin small letter heng with hook | simultaneous voiceless postalveolar+velar fricative | | IPA Extensions | [ɨ] | 0268 | latin small letter i with stroke | close central unrounded | | IPA Extensions | [ɪ] | 026A | latin letter small capital i | lax close front unrounded | | IPA Extensions | [ɬ] | 026C | latin small letter l with belt | voiceless alveolar lateral fricative | | IPA Extensions | [ɭ] | 026D | latin small letter l with retroflex hook | voiced retroflex lateral approximant | | IPA Extensions | [ɮ] | 026E | latin small letter lezh | voiced alveolar lateral fricative | | IPA Extensions | [ɯ] | 026F | latin small letter turned m | close back unrounded | | IPA Extensions | [ɰ] | 0270 | latin small letter turned m with long leg | voiced velar approximant | | IPA Extensions | [ɱ] | 0271 | latin small letter m with hook | voiced labiodental nasal | | IPA Extensions | [ɲ] | 0272 | latin small letter n with left hook | voiced palatal nasal | | IPA Extensions | [ɳ] | 0273 | latin small letter n with retroflex hook | voiced retroflex nasal | | IPA Extensions | [ɴ] | 0274 | latin letter small capital n | voiced uvular nasal | | IPA Extensions | [ɵ] | 0275 | latin small letter barred o | close-mid central rounded | | IPA Extensions | [ɶ] | 0276 | latin letter small capital oe | open front rounded | | IPA Extensions | [ɸ] | 0278 | latin small letter phi | voiceless bilabial fricative | | IPA Extensions | [ɹ] | 0279 | latin small letter turned r | voiced alveolar approximant | | IPA Extensions | [ɺ] | 027A | latin small letter turned r with long leg | voiced alveolar lateral flap | | IPA Extensions | [ɻ] | 027B | latin small letter turned r with hook | voiced retroflex approximant | | IPA Extensions | [ɽ] | 027D | latin small letter r with tail | voiced retroflex tap | | IPA Extensions | [ɾ] | 027E | latin small letter r with fishhook | voiced alveolar tap | | IPA Extensions | [ʀ] | 0280 | latin letter small capital r | voiced uvular trill | | IPA Extensions | [ʁ] | 0281 | latin letter small capital inverted r | voiced uvular fricative | | IPA Extensions | [ʂ] | 0282 | latin small letter s with hook | voiceless retroflex fricative | | IPA Extensions | [ʃ] | 0283 | latin small letter esh | voiceless postalveolar fricative | | IPA Extensions | [ʄ] | 0284 | latin small letter dotless j with stroke and hook | voiced palatal implosive | | IPA Extensions | [ʈ] | 0288 | latin small letter t with retroflex hook | voiceless retroflex plosive | | IPA Extensions | [ʉ] | 0289 | latin small letter u bar | close central rounded | | IPA Extensions | [ʊ] | 028A | latin small letter upsilon | lax close back rounded | | IPA Extensions | [ʋ] | 028B | latin small letter v with hook | voiced labiodental approximant | | IPA Extensions | [ʌ] | 028C | latin small letter turned v | open-mid back unrounded | | IPA Extensions | [ʍ] | 028D | latin small letter turned w | voiceless labial-velar fricative | | IPA Extensions | [ʎ] | 028E | latin small letter turned y | voiced palatal lateral approximant | | IPA Extensions | [ʏ] | 028F | latin letter small capital y | lax close front rounded | | IPA Extensions | [ʐ] | 0290 | latin small letter z with retroflex hook | voiced retroflex fricative | | IPA Extensions | [ʑ] | 0291 | latin small letter z with curl | voiced alveolo-palatal fricative | | IPA Extensions | [ʒ] | 0292 | latin small letter ezh | voiced postalveolar fricative | | IPA Extensions | [ʔ] | 0294 | latin letter glottal stop | voiceless glottal plosive | | IPA Extensions | [ʕ] | 0295 | latin letter pharyngeal | voiced fricative voiced pharyngeal fricative | | IPA Extensions | [ʘ] | 0298 | latin letter bilabial click | voiceless bilabial click | | IPA Extensions | [ʙ] | 0299 | latin letter small capital b | voiced bilabial trill | | IPA Extensions | [ʛ] | 029B | latin letter small capital g with hook | voiced uvular implosive | | IPA Extensions | [ʜ] | 029C | latin letter small capital h | voiceless epiglottal fricative | | IPA Extensions | [ʝ] | 029D | latin small letter j with crossed tail | voiced palatal fricative | | IPA Extensions | [ʟ] | 029F | latin letter small capital l | voiced velar lateral approximant | | IPA Extensions | [ʡ] | 02A1 | latin letter glottal stop with stroke | epiglottal plosive | | IPA Extensions | [ʢ] | 02A2 | latin letter reversed glottal stop with stroke | voiced epiglottal fricative | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Spacing Modifier Letters | [ʰ] | 02B0 | modifier letter small h | aspirated | | Spacing Modifier Letters | [ʱ] | 02B1 | modifier letter small h with hook | voiced pre-aspirated | | Spacing Modifier Letters | [ʲ] | 02B2 | modifier letter small j | palatalized | | Spacing Modifier Letters | [ʷ] | 02B7 | modifier letter small w | labialized | | Spacing Modifier Letters | [ʼ] | 02BC | modifier letter apostrophe | ejective | | Spacing Modifier Letters | [ˀ] | 02C0 | modifier letter glottal stop | glottalized | | Spacing Modifier Letters | [ˈ] | 02C8 | modifier letter vertical line | primary stress | | Spacing Modifier Letters | [ˌ] | 02CC | modifier letter low vertical line | secondary stress | | Spacing Modifier Letters | [ː] | 02D0 | modifier letter triangular colon | long | | Spacing Modifier Letters | [ˑ] | 02D1 | modifier letter half triangular colon | half-long | | Spacing Modifier Letters | [˞] | 02DE | modifier letter rhotic hook | rhotacized | | Spacing Modifier Letters | [ˠ] | 02E0 | modifier letter small gamma | velarized | | Spacing Modifier Letters | [ˡ] | 02E1 | modifier letter small l | lateral release | | Spacing Modifier Letters | [ˤ] | 02E4 | modifier letter small reversed glottal stop | pharyngealized | | Spacing Modifier Letters | [˥] | 02E5 | modifier letter extra-high tone bar | extra high tone | | Spacing Modifier Letters | [˦] | 02E6 | modifier letter high tone bar | high tone | | Spacing Modifier Letters | [˧] | 02E7 | modifier letter mid tone bar | mid tone | | Spacing Modifier Letters | [˨] | 02E8 | modifier letter low tone bar | low tone | | Spacing Modifier Letters | [˩] | 02E9 | modifier letter extra-low tone bar | extra low tone | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
End of the list on next post
-
•----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Unicode Block | Chr | Code | Unicode Name | IPA Name | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Combining Diacritical Marks | [ò] | 0300 | combining grave accent | low tone | | Combining Diacritical Marks | [ó] | 0301 | combining acute accent | high tone | | Combining Diacritical Marks | [ô] | 0302 | combining circumflex accent | falling | | Combining Diacritical Marks | [õ] | 0303 | combining tilde | nasalized | | Combining Diacritical Marks | [ō] | 0304 | combining macron | mid tone | | Combining Diacritical Marks | [ŏ] | 0306 | combining breve | extra-short | | Combining Diacritical Marks | [ö] | 0308 | combining diaeresis | centralized | | Combining Diacritical Marks | [o̊] | 030A | combining ring above | voiceless (above) | | Combining Diacritical Marks | [ő] | 030B | combining double acute accent | extra high tone | | Combining Diacritical Marks | [ǒ] | 030C | combining caron | rising | | Combining Diacritical Marks | [ȍ] | 030F | combining double grave accent | extra low tone | | Combining Diacritical Marks | [o̘] | 0318 | combining left tack below | advanced tongue root | | Combining Diacritical Marks | [o̙] | 0319 | combining right tack below | retracted tongue root | | Combining Diacritical Marks | [o̚] | 031A | combining left angle above | no audible release | | Combining Diacritical Marks | [o̜] | 031C | combining left half ring below | less rounded | | Combining Diacritical Marks | [o̝] | 031D | combining up tack below | raised | | Combining Diacritical Marks | [o̞] | 031E | combining down tack below | lowered | | Combining Diacritical Marks | [o̟] | 031F | combining plus sign below | advanced | | Combining Diacritical Marks | [o̠] | 0320 | combining minus sign below | retracted | | Combining Diacritical Marks | [o̤] | 0324 | combining diaeresis below | breathy voiced | | Combining Diacritical Marks | [o̥] | 0325 | combining ring below | voiceless | | Combining Diacritical Marks | [o̩] | 0329 | combining vertical line below | syllabic | | Combining Diacritical Marks | [o̪] | 032A | combining bridge below | dental | | Combining Diacritical Marks | [o̬] | 032C | combining caron below | voiced | | Combining Diacritical Marks | [o̮] | 032E | combining breve below | derhoticized | | Combining Diacritical Marks | [o̯] | 032F | combining inverted breve below | non-syllabic | | Combining Diacritical Marks | [o̰] | 0330 | combining tilde below | creaky voiced | | Combining Diacritical Marks | [o̴] | 0334 | combining tilde overlay | velarized or pharyngealized | | Combining Diacritical Marks | [o̹] | 0339 | combining right half ring below | more rounded | | Combining Diacritical Marks | [o̺] | 033A | combining inverted bridge below | apical | | Combining Diacritical Marks | [o̻] | 033B | combining sqare below | aminal | | Combining Diacritical Marks | [o̼] | 033C | combining seagull below | linguolabial | | Combining Diacritical Marks | [o̽] | 033D | combining x above | mid-centralized | | Combining Diacritical Marks | [o͇] | 0347 | combining equals sign below | non-sibilant | | Combining Diacritical Marks | [o͈] | 0348 | combining double vertical line below | fortis | | Combining Diacritical Marks | [o͉] | 0349 | combining left angle below | lenis | | Combining Diacritical Marks | [o͓] | 0353 | combining x below | frictionalized | | Combining Diacritical Marks | [o͜] | 035C | combining double breve below | tie bar (below) | | Combining Diacritical Marks | [o͡] | 0361 | combining double inverted breve | tie bar | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Greek and Coptic | [β] | 03B2 | greek small letter beta | voiced bilabial fricative | | Greek and Coptic | [θ] | 03B8 | greek small letter theta | voiceless dental fricative | | Greek and Coptic | [χ] | 03C7 | greek small letter chi | voiceless uvular fricative | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Phonetic Extensions | [ᴱ] | 1D31 | modifier letter capital e | epilaryngeal phonation | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Phonetic Extensions Supplement | [ᶑ] | 1D91 | latin small letter d with hook and tail | voiced retroflex implosive | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Combining Diacritical Marks Supplement | [o᷄] | 1DC4 | combining macron-acute | high rising | | Combining Diacritical Marks Supplement | [o᷅] | 1DC5 | combining grave-macron | low rising | | Combining Diacritical Marks Supplement | [o᷆] | 1DC6 | combining macron-grave | low falling | | Combining Diacritical Marks Supplement | [o᷇] | 1DC7 | combining acute-macron | high falling | | Combining Diacritical Marks Supplement | [o᷈] | 1DC8 | combining grave-acute-grave | rising-falling | | Combining Diacritical Marks Supplement | [o᷉] | 1DC9 | combining acute-grave-acute | falling-rising | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | General Punctuation | [‖] | 2016 | double vertical line | major group break (intonation) | | General Punctuation | [‼] | 203C | double exclamation mark | retroflex click | | General Punctuation | [‿] | 203F | undertie | linking (absence of a break) | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Superscripts and Subscripts | [ⁿ] | 207F | superscript latin small letter n | nasal release | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Arrows | [↑] | 2191 | upwards arrow | global rise | | Arrows | [↓] | 2193 | downwards arrow | global fall | | Arrows | [↗] | 2197 | north east arrow | global rise | | Arrows | [↘] | 2198 | south east arrow | global fall | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------• | Latin Extended-C | [ⱱ] | 2C71 | latin small letter v with right hook | voiced labiodental tap | •----------------------------------------•-----•------•---------------------------------------------------•-----------------------------------------------------•
-
Gather this post with the two last posts in a single
.txt
file -
Then, if you apply the general
IPA
regex against this composite file, you should get, as expected,185
matches !
Best Regards,
guy038
-
-
@guy038 said in How to easily manage IPA symbols...:
I suppose that very few people will be interested in the International Phonetic Alphabet ( known as IPA )
Notepad++ walks into a bar and orders an IPA. @guy038 is tending the bar and so pours a glass of wine while chatting about IPA and Unicode.
-