Based on Macintosh Speech Synthesis Programming Phonemes list

Pronunciation Notes


ae = bat
ey = bait
ao = caught
ax = about
iy = beet
eh = bet
ih = bit
ay = bite
ix = roses
aa = father
uw = boot
uh = book
ux = bud
ow = boat
aw = bout
oy = boy


ae ey ao ax iy eh ih ay ix aa uw uh ux ow aw oy
bae bey bao bax biy beh bih bay bix baa buw buh bux bow baw boy
aeb eyb aob axb iyb ehb ihb ayb ixb aab uwb uhb uxb owb awb oyb
chae chey chao chax chiy cheh chih chay chix chaa chuw chuh chux chow chaw choy
aech eych aoch axch iych ehch ihch aych ixch aach uwch uhch uxch owch awch oych
dae dey dao dax diy deh dih day dix daa duw duh dux dow daw doy
aed eyd aod axd iyd ehd ihd ayd ixd aad uwd uhd uxd owd awd oyd
thae they thao thax thiy theh thih thay thix thaa thuw thuh thux thow thaw thoy
aeth eyth aoth axth iyth ehth ihth ayth ixth aath uwth uhth uxth owth awth oyth
fae fey fao fax fiy feh fih fay fix faa fuw fuh fux fow faw foy
aef eyf aof axf iyf ehf ihf ayf ixf aaf uwf uhf uxf owf awf oyf
gae gey gao gax giy geh gih gay gix gaa guw guh gux gow gaw goy
aeg eyg aog axg iyg ehg ihg ayg ixg aag uwg uhg uxg owg awg oyg
hae hey hao hax hiy heh hih hay hix haa huw huh hux how haw hoy
jae jey jao jax jiy jeh jih jay jix jaa juw juh jux jow jaw joy
aej eyj aoj axj iyj ehj ihj ayj ixj aaj uwj uhj uxj owj awj oyj
kae key kao kax kiy keh kih kay kix kaa kuw kuh kux kow kaw koy
aek eyk aok axk iyk ehk ihk ayk ixk aak uwk uhk uxk owk awk oyk
lae ley lao lax liy leh lih lay lix laa luw luh lux low law loy
ael eyl aol axl iyl ehl ihl ayl ixl aal uwl uhl uxl owl awl oyl
mae mey mao max miy meh mih may mix maa muw muh mux mow maw moy
aem eym aom axm imy ehm ihm aym ixm aam uwm uhm uxm owm awm oym
nae ney nao nax niy neh nih nay nix naa nuw nuh nux now naw noy
aen eyn aon axn iyn ehn ihn ayn ixn aan uwn uhn uxn own awn oyn
ngae ngey ngao ngax ngiy ngeh ngih ngay ngix ngaa nguw nguh ngux ngow ngaw ngoy
aeng eyng aong axng iyng ehng ihng ayng ixng aang uwng uhng uxng owng awng oyng
pae pey pao pax piy peh pih pay pix paa puw puh pux pow paw poy
aep eyp aop axp iyp ehp ihp ayp ixp aap uwp uhp uxp owp awp oyp
rae rey rao rax riy reh rih ray rix raa ruw ruh rux row raw roy
aer eyr aor axr iyr ehr ihr ayr ixr aar uwr uhr uxr owr awr oyr
sae sey sao sax siy seh sih say six saa suw suh sux sow saw soy
aes eys aos axs iys ehs ihs ays ixs aas uws uhs uxs ows aws oys
shae shey shao shax shiy sheh shih shay shix shaa shuw shuh shux show shaw shoy
aesh eysh aosh axsh iysh ehsh ihsh aysh ixsh aash uwsh uhsh uxsh owsh awsh oysh
tae tey tao tax tiy teh tih tay tix taa tuw tuh tux tow taw toy
aet eyt aot axt iyt eht iht ayt ixt aat uwt uht uxt owt awt oyt
THae THey THao THax THiy THeh THih THay THix THaa THuw THuh THux THow THaw THoy
aeTH eyTH aoTH axTH iyTH ehTH ihTH ayTH ixTH aaTH uwTH uhTH uxTH owTH awTH oyTH
vae vey vao vax viy veh vih vay vix vaa vuw vuh vux vow vaw voy
aev eyv aov axv iyv ehv ihv ayv ixv aav uwv uhv uxv owv awv oyv
wae wey wao wax wiy weh wih way wix waa wuw wuh wux wow waw woy
yae yey yao yax yiy yeh yih yay yix yaa yuw yuh yux yow yaw yoy
zae zey zao zax ziy zeh zih zay zix zaa zuw zuh zux zow zaw zoy
aez eyz aoz axz iyz ehz ihz ayz ixz aaz uwz uhz uxz owz awz oyz
zhae zhey zhao zhax zhiy zheh zhih zhay zhix zhaa zhuw zhuh zhux zhow zhaw zhoy
aezh eyzh aozh axzh iyzh ehzh ihzh ayzh ixzh aazh uwzh uhzh uxzh owzh awzh oyzh

**w and y aren't put at the end because they would be impossible to pronounce. You can try to pronounce them if you like, but I won't include them in the reclist. 


@ = Breaths (inhale/exhale/sigh) Usually encoded as 'br' or 'breath' (and usually numbered)
Forced Multilingualism = Record phonemes from other languages and include them in the voicebank, or record more phonemes to make pronunciation of other languages easier.
English UTAU reclist. Based on…
Hope it's easy enough to use ^^;

Just to clear anything up, this list, like all the other English reclist's I've seen, is a CVVC (consonant vowel-vowel consonant) reclist.
This is because English, like many other languages, has initial as well as final consonants (as in there can be a consonant sound at the beginning or at the end of a word.)
SilveryRoseney Featured By Owner Jun 4, 2015  Hobbyist Artisan Crafter
is there a tut on how to oto an english utau?
GrayOblivion Featured By Owner Jun 4, 2015  Hobbyist General Artist
The tutorial that I used previously appears to have been taken down, so my advice is to look at CV oto tutorials for the CV sections of the voicebank (it's pretty much the same as a CV in any other language), then take a look at existing CVVC voicebanks for the VC part. For the CV part, I think this is a good tutorial, but there are many more over YouTube and other platforms:…
Sorry I couldn't be of more help ^^
SilveryRoseney Featured By Owner Jun 4, 2015  Hobbyist Artisan Crafter
thank you so much for the help hate to be a bother but are these dyslectic friendly I sometimes have hard time following fast pased tuts
GrayOblivion Featured By Owner Edited Jun 4, 2015  Hobbyist General Artist
I'm not sure if it's dyslexic friendly or not, sorry. You can take a look at the tutorial and see for yourself.
There are several tutorials on how to make CV voicebanks all over the internet, I'm sure you'll find one that's easier to follow if the one I linked is unclear.

If you're looking for a video tutorial, perhaps this will help:…
chibinagi Featured By Owner May 12, 2015
Umm... will a CVC VB be OTO'd faster than a CVVC?
GrayOblivion Featured By Owner May 12, 2015  Hobbyist General Artist
I haven't tried oto'ing a CVC before, but I think there are far more samples required in a CVC than a CVVC as it joins consonant-vowel-consonant sounds as opposed to a CVVC where you record the beginning and ending of sounds separately. I haven't tried myself though, so I think you should ask around.
chibinagi Featured By Owner May 18, 2015
Oh, okay. Thanks!
ToriaDion Featured By Owner Feb 5, 2015
How the Fruk do I OTO all these?
How long will this even take?
GrayOblivion Featured By Owner Feb 5, 2015  Hobbyist General Artist
OTO'ing CVVC voicebanks does take a lot of time. I haven't counted myself, but perhaps around the same amount of time required for a VCV voicebank, and definitely more time than for a CV voicebank.

Perhaps these videos might help:……
KiruaZorudikku Featured By Owner Feb 2, 2015  Hobbyist General Artist
Do you happen to know any UTAUs with their English VCV library up for download? I'd like to use one for further reference ÓwÒ 
GrayOblivion Featured By Owner Edited Feb 2, 2015  Hobbyist General Artist
Hmm... I don't know of any English VCV voicebanks, only CV VC (like this reclist) voicebanks.
If that's what you were looking for, I suggest Tony Yonne or Mona Minarai, though both of them have different reclists.
KiruaZorudikku Featured By Owner Feb 4, 2015  Hobbyist General Artist
I recall Tony being CVVC because I've tried him, but I'll check out Mona
H4NN4H-Chan Featured By Owner Oct 14, 2014  Hobbyist General Artist
This is awesome!! Thank you so much for taking the time to make this, it's so clear and easy to follow!! ^_^
GrayOblivion Featured By Owner Oct 14, 2014  Hobbyist General Artist
No problem :D I hope this will be useful to you~!
Pastagirl97 Featured By Owner Jun 28, 2014
Um how do you make the oto for this? I'm still trying to figure out how I can do this with audacity too so can you give this rookie some help?
GrayOblivion Featured By Owner Jun 28, 2014  Hobbyist General Artist
Audacity is just used for recording. You record the sounds in audacity then export them to a folder so you can use it in UTAU. Some people like to record in strings, others like to record the sounds individually, but recording in strings is faster and generally takes up less data space.
For a basic intro to the parameters of the oto.ini, here's a tutorial.
Doing the oto.ini for the sounds that go consonant-vowel is pretty much the same as oto'ing a CV (Diphone-only) Japanese voicebank, if you know how to do that. Many popular UTAUs like Kasane Teto or Momone Momo have well-oto'd CV voicebanks, so you could take a look at them for examples of consonant-vowel sounds.

Vowel-consonant sounds are a bit more difficult, since you also have to find a way to make the sounds 'connect' to the previous note. I think this is a reasonably simple tutorial for oto'ing the vowel-consonant part, that is if you can see the characteristics of the waveforms in your own recordings, though if not then the tutorial may seem confusing. If that tutorial doesn't work, ChezzieChan has A series of tutorials that might be useful.
Hope this helps, and I hope it wasn't too confusing ^^; I do not have first-hand experience creating an UTAU myself, so sorry if something I've mentioned doesn't work.
Pastagirl97 Featured By Owner Jun 28, 2014
Thanks I am a dummy! :) (Smile) 
IvyDevi Featured By Owner Jun 10, 2014  Hobbyist Digital Artist
Would we save the sounds under these 'titles' when recorded? Will it work with USTs that way?
GrayOblivion Featured By Owner Jun 10, 2014  Hobbyist General Artist
Yes, but you'll have to edit the UST to fit with the sounds. It's not like Japanese voicebanks, where pretty much every voicebank is aliased the same, since the language permits that. There are several English UTAU voicebanks, and depending on the creator, the 'titles' of each sound and the aliasing may differ, since English has many more sounds.
IvyDevi Featured By Owner Jun 11, 2014  Hobbyist Digital Artist
Ahh kay, thank you~! cx
XxXTetoKasaneXxX Featured By Owner May 16, 2014
T-T If only making an English UTAU was as easy as making a Japanese...
GrayOblivion Featured By Owner May 16, 2014  Hobbyist General Artist
Very true :XD:
Lineair Featured By Owner Jan 27, 2014
:o Woo, reclist! How would you pronounce stuff like aej? Sorry if the question is silly.
GrayOblivion Featured By Owner Jan 27, 2014  Hobbyist General Artist
There's a pronunciation key at the top of the reclist. The 'ae' sound corresponds to the 'a' sound in 'bat' or 'cat' or 'mat'.
In IPA (international phonetic alphabet), the 'aej' sound would probably be transcribed to <æd͡ʒ>, if that makes any more sense.
Hope this helped :D
Lineair Featured By Owner Feb 27, 2014
Not yet but it will make sense soon. xD; I'm a language student and we're going into phonetics soon haha...
akunouta Featured By Owner Dec 26, 2013  Hobbyist Traditional Artist
How do you make the oto.ini?
GrayOblivion Featured By Owner Dec 26, 2013  Hobbyist General Artist
This is a CV VC reclist. If you are familiar with CV voicebanks (which are common in Japanese), then the CV part is the same, you configure the consonant and vowel as usual. The VC part means that the vowel part of the sample is put in the part of the oto that would be the 'consonant' and the consonant part of the sample becomes the 'vowel'. I don't know exactly how to deal with preutterance or overlap in CV VC. Sorry if my answer is confusing, I don't have any experience doing CV VC oto.ini's.
RossCuth Featured By Owner Nov 5, 2013  Student Interface Designer
CV or VCV?
GrayOblivion Featured By Owner Nov 5, 2013  Hobbyist General Artist
This reclist is CV VC.
RossCuth Featured By Owner Nov 5, 2013  Student Interface Designer
Mmm okay thanks XD
TofaTheDragonRider Featured By Owner Nov 1, 2013  Student
An English reclist I can ACTUALLY understand :'D
memecow1 Featured By Owner Oct 3, 2013  Hobbyist General Artist
wow.. well easier then anything i was going to do..
