1y
So this looks like a high quality, fast, natural, and open source TTS system in Python. A key candidate for an . Unfortunately, I find addon development super confusing. Is there a good template to start from or something? github.com/thewh1teagle/kokoro-onnx
11
15
8
0
1y
Yeah, I am deeply confused about how buffers work and how to indicate when speaking is complete and do indexing and so-on. If this is going to be an addon, someone else will have to do it.
1
0
1
0
User avatar
Tyler Spivey @tspivey@dragonscave.space
1y
@fastfinge You need support from the synth for some features. This one doesn't have anything. Once it starts speaking, it blocks until it's done, so you can't interrupt it.
1
0
0
0
1y
@tspivey Wouldn't you just stop playing the samples it gave you?
1
0
0
0
User avatar
Tyler Spivey @tspivey@dragonscave.space
1y
@fastfinge That works. But you're still sitting there waiting a few seconds for it to finish generating them.
2
0
0
0
User avatar
Tyler Spivey @tspivey@dragonscave.space
1y
@fastfinge Taking this sentence and passing it straight through, it pauses after highly. That's not even that many words. He had quietly gone to Madame Pomfrey, who had regretfully told him that Dreamless Sleep was highly addicting and that while she could give him the occasional dose, it would have to be spread out enough to prevent it from becoming addicting – meaning he could only take it one night out of every two weeks or so.
3
0
0
0
User avatar
Zach Bennoui @ZBennoui@dragonscave.space
1y
@tspivey @fastfinge I'm not sure this is the reason for pausing, but the model has a total context size of 500 characters and will not do well with input longer than that. It may also just be bad training data, sentences not ending with correct punctuation, primarily trained on paragraphs, etc. I’ve trained many TTS models over the last few years and data quality is extremely important, something lacking in most open source TTS systems out there.
1
0
1
0
1y
@ZBennoui @tspivey I think it's something with the onnx implementation actually. The pytorch version doesn't have this issue. There's an open issue looking into it.
0
0
0
0