You are currently using the General_American lexicon
Switch lexicon:- British RP_beta
- CMU_0.6
- New York_beta
- Scottish__Southern_Borders_beta
- Shakespeare
- Upper Midwest_beta
No documents marked yet.
Switch to advanced pattern specification interface.
Usage
This page is primarily used to mark the letters in the words of a text file which correspond to a set of sound patterns. This page can also be used to transcribe the words in a file into IPA or a custom dictionary-style transcription system. The transcription options can be used either in conjunction with marking sound patterns or on their own. To include IPA transcription, simply check the box next to Add line-by-line IPA transcription to the output file. To include dictionary-style transcription, simply check the box next to Add line-by-line dictionary-style transcription to the output file.. You can also customize the dictionary-style transcription by clicking Show/hide transcription options. You can uncheck sounds which you want to retain their orignal written form in the output. You can also enter custom symbols in the box next to each sound. You can also use the checkboxes to specify how stressed syllables should be marked and whether syllables should be separated by dashes. If you choose to include a transcription, you can also choose to discard the original text and have only the transcription in the result file by checking the box next to Output transcription only without the original text which will appear once you check the box for including a transcription. If you only want to transcribe the file, leave the fields for entering sound patterns blank.The input file should be plain text (.txt) and should be encoded in ASCII. The file can also be UTF-8 encoded, but most non-ASCII characters will be dropped from the result file.
Up to 9 patterns can be marked at once. The default color for each pattern is the next color in the list of available colors. The colors can be customized for each pattern using the last drop-down box. The file will be returned as an RTF with the patterns marked and formatting preserved, when possible.
The pattern specification interface is very similar to that used for sound searches. Each field specifies a sound pattern that the program will search for in the pronunciations of words in the document. It will then mark the corresponding portions of the written words in the document. The only difference is that only one sound can be specified at a time. If you wish to have multiple sounds or postions marked in the same color, please use the next drop-down box and choose the same color you used for the previous sound.
If IPA transcription is included, the transcription of each line will follow its regular form. Please note that the lines of the file are determined by the presence of line-breaks so that transcribing a file which only has line breaks between paragraphs will result in paragraph by paragraph transcriptions. The pronunciations used to determine if a word matches a pattern and for IPA transcription are based on the ones in the lexicon selected.
Examples
- To mark /L/'s at the beginning of words, set Word start to L=/l/.
- To mark medial voiceless stops (excluding stops at the start or end of a word), set Word medial to Voiceless-stop.
- To mark r-colored schwas anywhere in a word in red, set Anywhere in word to AXR=/ɚ/.
- To mark word-final /S/'s, set Word end to S=/s/.
Available sounds and categories:
Sound category descriptions:
- Vowel : Any vowel including R-vowels
- Stressed Vowel : Any vowel which has a primary stress
- Unstressed Vowel : Any vowel which does not have a primary stress (including ones with secondary stress)
- Consonant : Any consonant (stops, liquids, glides, fricatives and affricates).
- R-colored-vowel : Vowels that are followed an R sound in the same syllable, including ER
- Vowel-no-R : Vowels that are not followed by an R sound
- Nasal : n, m and ŋ
- Liquid : l or ɹ
- Glide : j or w
- Sonorant : A vowel, nasal, liquid or glide
- Sonorant-consonant : A nasal, liquid or glide (but not a vowel)
- Stop : Any voiced or voiceless stop
- Voiced-stop : b d, g or ʔ
- Voiceless-stop : p t or k
- Fricative : Any voiced or voiceless fricative
- Affricate : tʃ or dʒ
- Voiced-fricative : v, ð, z, or ʒ
- Voiceless-fricative : f, θ, s, ʃ or h
- Stop-or-flap : Any voiced or voiceless stop or a flap(ɾ). Note that flap(ɾ) is not used in the CMU dictionary.
Individual Arpabet sounds:
Arpabet | IPA | Dictionary style phonics |
IY | /i/ | [ē] as in "beat" (IY) |
IH | /ɪ/ | [ĭ] as in "hit" (IH) |
EH | /ɛ/ | [ĕ] as in "pet" (EH) |
AE | /æ/ | [ă] as in "hat" (AE) |
AH | /ʌ/ | [ʌ] as in "cup" (AH) |
UW | /u/ | [Ū] as in "shoe" (UW) |
UH | /ʊ/ | [Ŭ] as in "could" (UH) |
AO | /ɔ/ | [ô] as in "ball" (AO) |
AA | /ɑ/ | [ä] as in "father" (AA) |
EY | /eɪ/ | [ā] as in "made" (EY) |
AY | /aɪ/ | [ī] as in "tight" (AY) |
OY | /ɔɪ/ | [oi] as in "voice" (OY) |
OW | /oʊ/ | [ō] as in "go" (OW) |
AW | /ɑʊ/ | [ow] as in "cow" (AW) |
ER | /ɝ/ | [ər] as in "heard" (ER) |
IH R | /ɪɚ/ | [ear] as in "beer" (IH R) |
EH R | /ɛɚ/ | [air] as in "bare" (EH R) |
UH R | /ʊɚ/ | [oor] as in "cure" (UH R) |
AO R | /ɔɚ/ | [oar] as in "door" (AO R) |
AA R | /ɑɚ/ | [ar] as in "car" (AA R) |
AX | /ə/ | [ə] as in "about" (AX) |
AXR | /ɚ/ | [ər] as in "another" (AXR) |
P | /p/ | [p] as in "pan" (P) |
B | /b/ | [b] as in "bat" (B) |
T | /t/ | [t] as in "tag" (T) |
D | /d/ | [d] as in "dog" (D) |
K | /k/ | [k] as in "kite" (K) |
G | /ɡ/ | [ɡ] as in "game" (G) |
CH | /tʃ/ | [ch] as in "chair" (CH) |
JH | /dʒ/ | [dg] as in "judge" (JH) |
F | /f/ | [f] as in "fan" (F) |
V | /v/ | [v] as in "van" (V) |
TH | /θ/ | [th] as in "thin" (TH) |
DH | /ð/ | [th] as in "these" (DH) |
S | /s/ | [s] as in "some" (S) |
Z | /z/ | [z] as in "zoo" (Z) |
SH | /ʃ/ | [sh] as in "ship" (SH) |
ZH | /ʒ/ | [zh] as in "rouge" (ZH) |
HH | /h/ | [h] as in "hand" (HH) |
M | /m/ | [m] as in "move" (M) |
N | /n/ | [n] as in "nose" (N) |
NG | /ŋ/ | [ng] as in "sing" (NG) |
L | /l/ | [l] as in "late" (L) |
R | /ɹ/ | [r] as in "red" (R) |
DX | /ɾ/ | [t] as in "matter" (DX) |
Y | /j/ | [y] as in "yellow" (Y) |
W | /w/ | [w] as in "will" (W) |