@fastfinge I have been developing a neural TTS system, focused on screen reading for many months, which offers instant responsiveness, but maintains good synthesis quality at the same time. And, BTW, it is not recommended at all to use espeak as a phonemizer backend as breaks the text embeddings during model training, especially if we use linguistic information. And, please consider to avoid overeading NVDA's python environment in your add-ons.
@rmcpantoja Yes, the only way to avoid messing with the NVDA Python environment would be to do an IPC server. But at that point, you're really just rewriting SAPI and it seems pointless.