Note by @fastfinge

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

1y

So this looks like a high quality, fast, natural, and open source TTS system in Python. A key candidate for an #NVDA #addon. Unfortunately, I find #nvdasr addon development super confusing. Is there a good template to start from or something? github.com/thewh1teagle/kokoro-onnx

11

15

8

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

1y

Yeah, I am deeply confused about how buffers work and how to indicate when speaking is complete and do indexing and so-on. If this is going to be an #NVDA addon, someone else will have to do it.

1

0

1

0

Tyler Spivey @tspivey@dragonscave.space

1y

@fastfinge You need support from the synth for some features. This one doesn't have anything. Once it starts speaking, it blocks until it's done, so you can't interrupt it.

1

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

1y

@tspivey Wouldn't you just stop playing the samples it gave you?

1

0

Tyler Spivey @tspivey@dragonscave.space

1y

@fastfinge That works. But you're still sitting there waiting a few seconds for it to finish generating them.

2

0

Tyler Spivey @tspivey@dragonscave.space

1y

@fastfinge Taking this sentence and passing it straight through, it pauses after highly. That's not even that many words. He had quietly gone to Madame Pomfrey, who had regretfully told him that Dreamless Sleep was highly addicting and that while she could give him the occasional dose, it would have to be spread out enough to prevent it from becoming addicting – meaning he could only take it one night out of every two weeks or so.

3

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

1y

@tspivey Also, how does NVDA chunk text it passes to a synth? Even that's not really documented anywhere LOL. I think Kokoro inference would need running in its own thread, so the thread could be killed when we wanted to stop speech rather than generating extra samples, and a knew thread could be started so you could start the new speech quickly, like when someone's pressing down arrow rapidly. But I don't have the time, and I'm not smart enough.

1

0

Tyler Spivey @tspivey@dragonscave.space

1y

@fastfinge It doesn't. It leaves that up to the synth. If you're doing say all, then it tries to split by sentence and does it badly.

1

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

1y

@tspivey So if I cursor up onto a line with fifty thousand characters, that's why it just dies. Ah.

2

0

Tyler Spivey @tspivey@dragonscave.space

1y

@fastfinge To do that, you need toggleX, then NVDA+0, z. www.dropbox.com/scl/fi/qgz98942oyhv4b3crrpr5/toggleX.nvda-addon?rlkey=hhnevrqlrheiqk9fryoujprwu&dl=1

1

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

1y

@tspivey That gives me: TypeError: GlobalPlugin.script_toggleX.<locals>.<lambda>() got an unexpected keyword argument 'normalize'

2

0

Tyler Spivey @tspivey@dragonscave.space

1y

@fastfinge Ok, redownload and that should be fixed.

1

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

1y

@tspivey Yup, fixed! Are there docs?

1

0

Tyler Spivey @tspivey@dragonscave.space

1y

@fastfinge Nope. There's a toggle.txt in the root of the addon, but I don't know how updated that is. This thing has been hacked on over the years.

1

0

1

0