Solaria-3: Our new speech-to-text model
Solaria-3 is built for production audio: noisy, fast-paced, and conversational. Best-in-class on real customer recordings in English and core European languages, with higher precision on the names, terms, and entities that matter most in business scenarios.
- Best on real English audio: 9.6% WER on Gladia's internal production dataset of real customer calls, annotated by humans, and 26% better than Solaria-1.
- #1 on business calls and telephone speech: Leading WER on Earnings22 (6.4%, only model under 7%) and Switchboard (33.9%, all competing providers above 42%).
- Most accurate model for European languages: Consistent accuracy gains across English (-26%), French (-18%), Italian (-10%), Spanish (-9%), and German (-3%) vs. Solaria-1 on real customer audio.
Get started with Solaria-3
Try it now on the Developer Console, or read more about benchmarks and use cases on gladia.io/solaria-3.











