Note by @fastfinge

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

6mo

The State of Modern AI Text To Speech Systems for Screen Reader Users: The past year has seen an explosion in new text to speech engines based on neural networks, large language models, and machine learning. But has any of this advancement offered anything to those using screen readers? stuff.interfree.ca/2026/01/05/ai-tts-for-screenreaders.html #ai #tts #llm #accessibility #a11y #screenreaders

12

41

15

0

Amir @amir@dragonscave.space

6mo

@fastfinge What an interesting read! Needless to say, I read it with Eloquence - LOL!

1

0

1

0

Sean Randall @cachondo@defcon.social

6mo

@amir @fastfinge It's crazy that everyone is layering it in wrappers nowadays.
Do you know if codefactory are doing the same with their new android build?

2

0

Andre Louis @FreakyFwoof@universeodon.com

6mo

@cachondo @amir @fastfinge I sincerely hope someone will do the same for Orpheus. I'd even pay for it.

5

0

James Scholes @jscholes@dragonscave.space

6mo

@FreakyFwoof There is a 32-bit compatibility layer in the works for NVDA itself (although it currently only references SAPI4). But with any luck the need for every add-on to implement its own will go away.

github.com/nvaccess/nvda/pull/19412

@cachondo @amir @fastfinge

2

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

6mo

@jscholes @cachondo @FreakyFwoof @amir My understanding is that when this comes to addons, it's going to require some kind of secure addons API/layer. And it won't be ready for 2026.1, or maybe not even 2026.2.

1

0

James Scholes @jscholes@dragonscave.space

6mo

@fastfinge Where are you getting the first part of that understanding from? I.e. the dependence on the secure add-on runtime. @cachondo @FreakyFwoof @amir

1

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

6mo

@jscholes @cachondo @FreakyFwoof @amir It was mentioned in the roadmap NVDA released a while back.

1

0

James Scholes @jscholes@dragonscave.space

6mo

@fastfinge I see the "Secure add-on runtime" on the roadmap, with the note that "The first version of the runtime will provide support for speech synthesis and braille devices."

I don't see any implication that any 32-bit compatibility layer will only work for secure add-ons, which is hopefully a bit of a leap.

Still, the fact that people don't know what will or won't be happening, or whether their preferred synthesiser(s) will work or not, continues to be a big part of the problem. @cachondo @FreakyFwoof @amir

1

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

6mo

@jscholes @cachondo @FreakyFwoof @amir That's my assumption because the only things that really need a 32-bit compatibility layer are speech synthesizers and braille devices.

3

0

clv1 has moved @clv1@mastodon.social

6mo

@fastfinge @jscholes @cachondo @FreakyFwoof @amir And what about recording new voices for RHVoice?

1

0

1

0

🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca

6mo

@clv1 @jscholes @cachondo @FreakyFwoof @amir The issue is that both of these are effectively concatenative or parametric, rather than formant, systems. So they will never be as intelligible as eloquence.

0