Avalon lands at #6 on OpenASR Leaderboard

October 8, 2025

Finnian Brown

OpenASR ran their speech recognition benchmarks against Avalon, and it is now the #1 commercial model in the world, surpassing OpenAI Whisper-large-v3, ElevenLabs Scribe v1, and Rev AI Fusion.

Avalon was the best performing proprietary model, with a WER average of 6.24.🔥

Avalon also outperformed open models like NVIDIA Canary 1B, CrisperWhisper, Voxtral Mini 3B, and Distil Whisper.

Shows ASR leaderboard with a blue ring around aquavoice/avalon-v1-en with a WER of 6.24%.

You can explore the OpenASR Leaderboard on Hugging Face to see the full breakdown.

Availability

Avalon is available on the Aqua Voice app for English and via our API. We're making the API available for free until October 30th.

About Avalon

Avalon is a transcription model optimized for human-computer interaction. Our goal was to fix the annoying mistakes that are common in other ASR systems when used in human-talking-to-computer interactions.

When we were building Avalon, our goal wasn't to reduce overall word error rate, but to get better at programming and coding terms and company names that are often mistranscribed.

We evaluated our performance on these terms in our AISpeak benchmark, which we unpack in Introducing Avalon. Compared to a baseline accuracy on these jargon terms of 65% for Whisper Large v3 and 78% for ElevenLabs Scribe v1, a leading commercial model, Avalon achieves an accuracy of 97%.

However, when we evaluated Avalon on the industry standard benchmarks, we were pleasantly surprised to see a significant reduction in overall word error rate.

Why OpenASR matters

OpenASR is the industry standard benchmark for transcription models. It measures the accuracy in word error rate of models on a variety of public audio datasets compared with human labels. The benchmark suite currently consists of seven datasets spanning different audio domains.

The leaderboard highlights Avalon's balance of technical fluency and broad accuracy:

#1 proprietary model globally. Avalon beats models like Whisper Large v3, ElevenLabs Scribe, and Rev AI Fusion.
Top-10 performance overall. Avalon ranks #6 across every model—open source and proprietary.
Practical accuracy gains. The same optimizations that help Avalon nail jargon also reduce total word errors in standard industry benchmarks.

We're excited to see more teams build with Avalon. If you'd like early access or to chat about integrations, get in touch with us.