User avatar
🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca
4h
@hosford42 In general, for training the rules for pronouncing English, the CMU pronouncing dictionary is used: www.speech.cs.cmu.edu/cgi-bin/cmudict

When it comes to open-source speech data, LJSpeech is the best we have, though far from perfect:
keithito.com/LJ-Speech-Dataset/

And here's a link to GnuSpeech, the only open-source fully articulatory text to speech system I'm aware of:
github.com/mym-br/gnuspeech_sa?tab=readme-ov-file

I'm afraid I don't have any particular data of my own.