User avatar
🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca
4mo
The State of Modern AI Text To Speech Systems for Screen Reader Users: The past year has seen an explosion in new text to speech engines based on neural networks, large language models, and machine learning. But has any of this advancement offered anything to those using screen readers? stuff.interfree.ca/2026/01/05/ai-tts-for-screenreaders.html
12
41
15
0
User avatar
Amir @amir@dragonscave.space
4mo
@fastfinge What an interesting read! Needless to say, I read it with Eloquence - LOL!
1
0
1
0
User avatar
Sean Randall @cachondo@defcon.social
4mo
@amir @fastfinge It's crazy that everyone is layering it in wrappers nowadays.
Do you know if codefactory are doing the same with their new android build?
2
0
0
0
User avatar
Andre Louis @FreakyFwoof@universeodon.com
4mo
@cachondo @amir @fastfinge I sincerely hope someone will do the same for Orpheus. I'd even pay for it.
5
0
0
0
User avatar
James Scholes @jscholes@dragonscave.space
4mo
@FreakyFwoof There is a 32-bit compatibility layer in the works for NVDA itself (although it currently only references SAPI4). But with any luck the need for every add-on to implement its own will go away.

github.com/nvaccess/nvda/pull/19412

@cachondo @amir @fastfinge
2
0
0
0
User avatar
🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca
4mo
@jscholes @cachondo @FreakyFwoof @amir My understanding is that when this comes to addons, it's going to require some kind of secure addons API/layer. And it won't be ready for 2026.1, or maybe not even 2026.2.
1
0
0
0
User avatar
James Scholes @jscholes@dragonscave.space
4mo
@fastfinge Where are you getting the first part of that understanding from? I.e. the dependence on the secure add-on runtime. @cachondo @FreakyFwoof @amir
1
0
0
0
User avatar
🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca
4mo
@jscholes @cachondo @FreakyFwoof @amir It was mentioned in the roadmap NVDA released a while back.
1
0
0
0
User avatar
James Scholes @jscholes@dragonscave.space
4mo
@fastfinge I see the "Secure add-on runtime" on the roadmap, with the note that "The first version of the runtime will provide support for speech synthesis and braille devices."

I don't see any implication that any 32-bit compatibility layer will only work for secure add-ons, which is hopefully a bit of a leap.

Still, the fact that people don't know what will or won't be happening, or whether their preferred synthesiser(s) will work or not, continues to be a big part of the problem.
@cachondo @FreakyFwoof @amir
1
0
0
0
User avatar
🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca
4mo
@jscholes @cachondo @FreakyFwoof @amir That's my assumption because the only things that really need a 32-bit compatibility layer are speech synthesizers and braille devices.
3
0
0
0
User avatar
clv1 has moved @clv1@mastodon.social
4mo
@fastfinge @jscholes @cachondo @FreakyFwoof @amir And what about recording new voices for RHVoice?
1
0
1
0
User avatar
🇨🇦Samuel Proulx🇨🇦 @fastfinge@interfree.ca
4mo
@clv1 @jscholes @cachondo @FreakyFwoof @amir The issue is that both of these are effectively concatenative or parametric, rather than formant, systems. So they will never be as intelligible as eloquence.
0
0
0
0