You are not logged in and can only preview the speech tools.

Please log in or sign up.


You are currently using the General_American lexicon

Switch lexicon:

No documents marked yet.


Switch to basic pattern specification interface.

Must be plain text (.txt).

Pattern 1:

Use pattern for between words only.

Preceding boundary:

Preceding sound:

Primary sound:

Following sound:

Post-pattern boundary:

Allow patterns to cross syllable boundaries?  

Markup color:

Save this pattern.


Pattern 2:

Use pattern for between words only.

Preceding boundary:

Preceding sound:

Primary sound:

Following sound:

Post-pattern boundary:

Allow patterns to cross syllable boundaries?  

Markup color:

Save this pattern.


Add line-by-line IPA transcription to the output file.


Add line-by-line dictionary-style transcription to the output file.

You must be logged in to use the speech apps. Please log in or sign up.

Usage

This page is primarily used to mark the letters in the words of a text file which correspond to a set of sound patterns. This page can also be used to transcribe the words in a file into IPA or a custom dictionary-style transcription system. The transcription options can be used either in conjunction with marking sound patterns or on their own. To include IPA transcription, simply check the box next to Add line-by-line IPA transcription to the output file. To include dictionary-style transcription, simply check the box next to Add line-by-line dictionary-style transcription to the output file.. You can also customize the dictionary-style transcription by clicking Show/hide transcription options. You can uncheck sounds which you want to retain their orignal written form in the output. You can also enter custom symbols in the box next to each sound. You can also use the checkboxes to specify how stressed syllables should be marked and whether syllables should be separated by dashes. If you choose to include a transcription, you can also choose to discard the original text and have only the transcription in the result file by checking the box next to Output transcription only without the original text which will appear once you check the box for including a transcription. If you only want to transcribe the file, leave the fields for entering sound patterns blank.

The input file should be plain text (.txt) and should be encoded in ASCII. The file can also be UTF-8 encoded, but most non-ASCII characters will be dropped from the result file.

Up to 9 patterns can be marked at once. The default color for each pattern is the next color in the list of available colors. The colors can be customized for each pattern using the last drop-down box. The file will be returned as an RTF with the patterns marked and formatting preserved, when possible.

The pattern specification interface is identical to that used for advanced sound search. Each field specifies part of a sound pattern that the program will search for in the pronunciations of words in the document. It will then mark the corresponding portions of the writtens word in the document.

Using the Include: field allows you to enter multiple sounds for a portion of the sound pattern. The Exclude: field will match any sounds except the ones entered in the text box. The sounds listed in either of these boxes should be separated by commas. For example, a pattern containing "AA1" will only mark words with stressed /AA/ sounds in the corresponding position. For R-colored vowels other than /ER/, stress should be specified after the primary vowel and before the /R/. For example, "EH1 R" can be used to mark only stressed occurrences of that diphthong. Consonants can never have a stress marker (excluding the "R" within /ER/).

Please note that non-rhotic forms of vowels cannot immediately be followed by an /R/ within the same syllable. For example, a pattern which marks an /EH/ followed by an /R/ will mark the "e" corresponding to /EH/ in words such as "very", in which the /EH/ and /R/ are in separate syllables, but not words such as "fair" in which they are in the same syllable and form a diphthong.

Sound pattern fields

  • Preceding boundary: specify whether the pattern should be at the start of a word or syllable.
  • Preceding sound: a sound that must come before the primary sound being searched for.
  • Primary sound: the primary sound to whose written form should be marked. Must always be included in a pattern.
  • Following sound: a sound that should follow the primary sound.
  • Post-pattern boundary: specify whether the pattern should to be at the end of a word or syllable.
  • Allow patterns to cross syllable boundaries?: specify whether the entire pattern must be contained in a single syllable to be marked.

Examples

  • To mark word-final /L/'s:
    • Set Primary sound to L=/l/.
    • Set Post-pattern boundary to Word-end.
  • To mark intervocalic /T/'s:
    • Set Preceding sound to Vowel.
    • Set Primary Sound to T=/t/.
    • Set Following sound to Vowel.
  • To mark all vowels without r-coloring:
    • Set Primary Sound to Vowel-no-R.

Available patterns and categories:

Positions:

  • Syllable-start : Specify that the pattern should start at the beginning of a syllable. Must occur at the start of the preceding context (but after Syllable, if used).
  • Word-start : Specify that the pattern should start at the beginning of a word. Must occur at the start of the preceding context (but after Syllable, if used).
  • NOT_Syllable-start : Specify that the beginning of the pattern, including its preceding context, must not occur at a syllable boundary. Must occur at the start of the preceding context (but after Syllable, if used).
  • NOT_Word-start : Specify that the beginning of the pattern, including its preceding context, must not occur at a word boundary. Must occur at the start of the preceding context (but after Syllable, if used).
  • Syllable-end : Specify that the pattern should end at the end of a syllable. Must occur at the end of the following context.
  • Word-end : Specify that the pattern should end at the end of a word. Must occur at the end of the following context.
  • NOT_Syllable-end : Specify that the pattern, including its following context must match sounds occurring before the end of a syllable. Must occur at the end of the following context.
  • NOT_Word-end : Specify that the pattern, including its following context must match sounds occurring before the end of a word. Must occur at the end of the following context.

Sound category descriptions:

  • Vowel : Any vowel including R-vowels
  • Stressed Vowel : Any vowel which has a primary stress
  • Unstressed Vowel : Any vowel which does not have a primary stress (including ones with secondary stress)
  • Consonant : Any consonant (stops, liquids, glides, fricatives and affricates).
  • R-colored-vowel : Vowels that are followed an R sound in the same syllable, including ER
  • Vowel-no-R : Vowels that are not followed by an R sound
  • Nasal : n, m and ŋ
  • Liquid : l or ɹ
  • Glide : j or w
  • Sonorant : A vowel, nasal, liquid or glide
  • Sonorant-consonant : A nasal, liquid or glide (but not a vowel)
  • Stop : Any voiced or voiceless stop
  • Voiced-stop : b d, g or ʔ
  • Voiceless-stop : p t or k
  • Fricative : Any voiced or voiceless fricative
  • Affricate : tʃ or dʒ
  • Voiced-fricative : v, ð, z, or ʒ
  • Voiceless-fricative : f, θ, s, ʃ or h
  • Stop-or-flap : Any voiced or voiceless stop or a flap(ɾ). Note that flap(ɾ) is not used in the CMU dictionary.

Individual Arpabet sounds:

Arpabet IPA Dictionary style phonics
IY /i/ [ē] as in "beat" (IY)
IH /ɪ/ [ĭ] as in "hit" (IH)
EH /ɛ/ [ĕ] as in "pet" (EH)
AE /æ/ [ă] as in "hat" (AE)
AH /ʌ/ [ʌ] as in "cup" (AH)
UW /u/ [Ū] as in "shoe" (UW)
UH /ʊ/ [Ŭ] as in "could" (UH)
AO /ɔ/ [ô] as in "ball" (AO)
AA /ɑ/ [ä] as in "father" (AA)
EY /eɪ/ [ā] as in "made" (EY)
AY /aɪ/ [ī] as in "tight" (AY)
OY /ɔɪ/ [oi] as in "voice" (OY)
OW /oʊ/ [ō] as in "go" (OW)
AW /ɑʊ/ [ow] as in "cow" (AW)
ER /ɝ/ [ər] as in "heard" (ER)
IH R /ɪɚ/ [ear] as in "beer" (IH R)
EH R /ɛɚ/ [air] as in "bare" (EH R)
UH R /ʊɚ/ [oor] as in "cure" (UH R)
AO R /ɔɚ/ [oar] as in "door" (AO R)
AA R /ɑɚ/ [ar] as in "car" (AA R)
AX /ə/ [ə] as in "about" (AX)
AXR /ɚ/ [ər] as in "another" (AXR)
P /p/ [p] as in "pan" (P)
B /b/ [b] as in "bat" (B)
T /t/ [t] as in "tag" (T)
D /d/ [d] as in "dog" (D)
K /k/ [k] as in "kite" (K)
G /ɡ/ [ɡ] as in "game" (G)
CH /tʃ/ [ch] as in "chair" (CH)
JH /dʒ/ [dg] as in "judge" (JH)
F /f/ [f] as in "fan" (F)
V /v/ [v] as in "van" (V)
TH /θ/ [th] as in "thin" (TH)
DH /ð/ [th] as in "these" (DH)
S /s/ [s] as in "some" (S)
Z /z/ [z] as in "zoo" (Z)
SH /ʃ/ [sh] as in "ship" (SH)
ZH /ʒ/ [zh] as in "rouge" (ZH)
HH /h/ [h] as in "hand" (HH)
M /m/ [m] as in "move" (M)
N /n/ [n] as in "nose" (N)
NG /ŋ/ [ng] as in "sing" (NG)
L /l/ [l] as in "late" (L)
R /ɹ/ [r] as in "red" (R)
DX /ɾ/ [t] as in "matter" (DX)
Y /j/ [y] as in "yellow" (Y)
W /w/ [w] as in "will" (W)