- Text-to-Speech (TTS)
- Automatic Speech Recognition (ASR)
Setting Up TTS
Open the TTS tab inside Settings → Speech. Tap Add Provider and select the provider you want to use. Fill in the required fields, then tap Save. You can add multiple providers and switch between them at any time.To hear a message read aloud, tap the speaker icon that appears on any assistant message bubble.Available TTS Providers
OpenAI TTS
OpenAI TTS
Uses OpenAI’s neural TTS API. Produces natural-sounding speech with a choice of six preset voices.
Your OpenAI API key. Find this in your OpenAI dashboard.
Base URL for the OpenAI-compatible API. Change this if you are routing through a proxy or using a third-party OpenAI-compatible endpoint.
The TTS model to use.
gpt-4o-mini-tts is fast and cost-effective; swap in tts-1-hd for higher fidelity output.Voice preset. OpenAI offers
alloy, echo, fable, onyx, nova, and shimmer.Gemini TTS
Gemini TTS
Uses Google’s Gemini multimodal TTS. Supports a wide range of voices including the expressive Gemini 2.5 family.
Your Google AI Studio API key.
Base URL for the Gemini API.
Gemini model to use for speech synthesis.
Voice name. Refer to the Google AI voice list for all available options.
MiniMax TTS
MiniMax TTS
High-quality Chinese and multilingual TTS from MiniMax.
Your MiniMax API key.
MiniMax API base URL.
TTS model identifier.
Voice ID. Browse available voices in the MiniMax console.
Emotional tone of the synthesized voice, e.g.
calm, happy, sad.Playback speed multiplier. Values below
1.0 slow speech down; values above 1.0 speed it up.Qwen TTS
Qwen TTS
Alibaba DashScope’s Qwen TTS, optimised for Chinese and multilingual speech.
Your DashScope API key.
DashScope API base URL.
TTS model.
qwen3-tts-flash is the low-latency variant.Voice name. Check the DashScope documentation for the full list.
Language hint.
Auto lets the model detect the language automatically.Groq TTS
Groq TTS
Fast TTS via Groq’s inference infrastructure, powered by Orpheus models.
Your Groq API key.
Groq API base URL.
Orpheus TTS model to use.
Voice preset name.
xAI TTS
xAI TTS
MiMo TTS
MiMo TTS
System TTS
System TTS
Uses Android’s built-in text-to-speech engine. No API key required — works fully on-device using whatever voice packs you have installed in Android settings.
Speed of synthesized speech.
1.0 is normal speed; increase for faster delivery, decrease for slower.Pitch of the synthesized voice.
1.0 is the default pitch for the selected Android voice.The quality and language support of System TTS depend on the voice packs installed on your device. You can install additional voices in Android Settings → Accessibility → Text-to-speech output.
Using Speech in the Chat Screen
Once you have at least one provider configured in each category, the controls appear directly in the chat interface:Microphone (ASR)
Tap the microphone icon in the chat input bar to start recording. Speak your message naturally — Rikka streams the audio to your ASR provider and populates the text field as words are transcribed. Release or tap again to stop.
Speaker (TTS)
Tap the speaker icon on any assistant message to have it read aloud by your configured TTS provider. Tap again to stop playback mid-sentence.