Extend Lumo to have integrated TTS/STT for full Voice Interaction
For general Chat, its really nice to be able to have a conversation using voice rather than keyboard. And while I know that the Android/IOS apps offers STT, its unfortunate that I have to use the phones TTS to complete the loop. For my tablet and/or webbrowser a full TTS/STT feature would be nice.
-
Proton Nutzer
commented
Even for the use on iPhones the local STT does not cut it,
becase you loose alle the information that is carried by your intonation (you even loose the interpunctuation).
Also if the STT is done locally, Lumo cannot adjust to your dialect and speech patters (the things we're trying to keep out of the hands of the data brokers)..
On the reverse path, having to use the local TTS ist just a bad joke.
I want to have a proper conversation with Lumo, without having to press a button to read the answers.
So instead of just the "microphone STT button" and a "read replies TTS button", an additional selector should be there,
something like a "conversation toggle switch", that activates both, until you turn it off..
And i.e. when replying by voice,
Lumo should not just read what's on the screen, cuz. if lumo starts reading a table, that does not carry very well in speech.
So Lumo should point you to the full table in the chat, but curate what it replies for the auditive communication channel. -
Jolene Cook commented
Ditto!
-
Tony
commented
Not only that, but the STT doesn't realise that we finished our question/sentence and waits till we manually click send, making the experience not hands-free even with the use of the OS TTS. I think this is the future and there needs to be full voice mode like Grok, ChatGPT..etc
-
Ananda
commented
I think there needs to be a full Voice Mode (Like Claude and ChatGPT) and also an icon below every response from Lumo that enables reading each message individually! Those two options together form a complete voice integration which increases usability and accessibility! Which is a win-win situation :) Agreed with John Doe, such feature is absolutely necessary, otherwise it's like having a gigantic mansion with only one entrance at the back ;) I use AI assistants on voice mode 95% of the time! Thanks!
-
John Doe
commented
Please - This is severely needed. An LLM without voice mode is only half useful. Ideally, it would be a full conversation mode where you are able to speak, the AI realize you are done speaking and reply, and then you are able to reply...etc. All without clicking the microphone icon multiple times or clicking "play" on the AI's response.