FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning
IntermediateTanyu Chen, Tairan Chen et al.Jan 16arXiv
Chroma 1.0 is a real-time, end-to-end speech-to-speech system that can talk back in your own cloned voice with sub-second delay.
#end-to-end speech-to-speech#personalized voice cloning#streaming TTS