Bitcoin

Bitcoin

$119,345.72

BTC 2.58%

Ethereum

Ethereum

$3,237.84

ETH 6.53%

  • Login
  • Register
Metaverse Media Group
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
Metaverse Media Group

Mistral unveils Voxtral, an open-source speech model with lower costs than proprietary rivals

Mistral unveils Voxtral, an open-source speech model with lower costs than proprietary rivals

The Decoderby The Decoder
16 July 2025
French AI company Mistral AI presents Voxtral, two open-source models for speech understanding that are designed to replace proprietary solutions at less than half the cost. The article Mistral unveils Voxtral, an open-source speech model with lower costs than proprietary rivals appeared first on THE DECODER….

summary
Summary

French AI company Mistral unveils Voxtral, an open-source speech understanding model that aims to replace proprietary solutions at less than half the cost.

The Voxtral models come in two versions: a 24B variant for production applications and a compact 3B model for local and edge deployments. Both support a 32,000-token context window, which Mistral says can handle audio files up to 30 minutes for transcription or 40 minutes for comprehension tasks.

Unlike basic transcription tools, Voxtral builds in Q&A and summarization features without requiring separate speech recognition and language models. It also lets users trigger backend functions directly through voice commands by automatically translating spoken requests into API calls.

Scatterplot: Preis (USD/min) vs. Wortfehlerrate im FLEURS-Datensatz, zeigt Voxtral Small als beste Kosten-Fehler-Balance.
Voxtral Small has a significantly lower error rate, but undercuts Whisper large in terms of price. | Image: Mistral
Recommend our article

The models support automatic speech recognition in English, Spanish, French, Portuguese, Hindi, German, Dutch and Italian while retaining the text comprehension capabilities of Mistral Small 3.1‘s language model backbone.

THE DECODER Newsletter
The most important AI news straight to your inbox.
✓ Weekly
✓ Cancel at any time

Benchmark performance exceeds competition

Mistral’s tests show Voxtral Small outperforming leading open-source model Whisper large-v3, along with GPT-4o mini Transcribe and Gemini 2.5 Flash across all tested tasks. For English short-form tasks and Mozilla’s Common Voice benchmark, it reportedly beats ElevenLabs Scribe – currently one of the strongest performers.

Balkendiagramm: Voxtral Mini/Small, GPT-4o mini Audio und Gemini 2.5 Flash bei Speech-Benchmarks und FLEURS BLEU
According to Mistral’s benchmarks, Voxtral can keep up with much larger models such as GPT-4o mini and Gemini 2.5 Flash. | Picture: Mistral

In the FLEURS multilingual speech recognition benchmark, Voxtral Small allegedly surpasses Whisper in all nine tested languages. For audio comprehension tasks, it performs comparably to GPT-4o-mini and Gemini 2.5 Flash while delivering state-of-the-art results in speech translation.

Pricing undercuts proprietary alternatives

Mistral positions Voxtral as a budget-friendly option, with API pricing starting at $0.001 per minute. The company claims Voxtral Mini Transcribe outperforms OpenAI’s Whisper at less than half the cost for price-sensitive applications, while Voxtral Small matches ElevenLabs Scribe’s performance at similar savings.

Enterprise features include private deployment options for regulated industries and domain-specific fine-tuning. Coming updates will add speaker segmentation, audio markups for age/emotion detection, and word-level timestamps.

Coming to Le Chat’s Voice Mode

Both Voxtral versions are available under Apache-2.0 license for download on Hugging Face, with Mistral also offering API access. The models will power the Voice Mode in Le Chat, which rolls out to all users in coming weeks.

Join our community
Join the DECODER community on Discord, Reddit or Twitter – we can’t wait to meet you.

Recommendation
Read the full article on The-Decoder.com
in AI
Reading Time: 3 mins read
0
0
20
VIEWS
Share on TwitterShare on Facebook

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now
ADVERTISEMENT

Related Posts

Zuckerberg predicts that not wearing AI glasses in the future will put you at a cognitive disadvantage
AI

Zuckerberg predicts that not wearing AI glasses in the future will put you at a cognitive disadvantage

4 hours ago
21
OpenAI is testing ChatGPT agents that create and edit presentations and spreadsheets in chat
AI

OpenAI is testing ChatGPT agents that create and edit presentations and spreadsheets in chat

5 hours ago
20
Grok 4 is not officially instructed to follow Musk’s views but often does on sensitive subjects
AI

xAI says Grok 4 is no longer searching for Musk’s views before it answers

21 hours ago
21

Comments

Please login to join discussion
ADVERTISEMENT

Latest News

  • All
  • Crypto
  • NFTs
  • Technology
  • Business
Future of Decentralized Intelligence The Lightchain AI Virtual Machine (AIVM) Inference
Crypto

Future of Decentralized Intelligence The Lightchain AI Virtual Machine (AIVM) Inference

Bitcoin.com News
by Bitcoin.com News
38 minutes ago
20
Fourth of July OG Whale Strikes Again: Another 10,000 BTC Moved as 30,000 Still Sit Idle
Crypto

Fourth of July OG Whale Strikes Again: Another 10,000 BTC Moved as 30,000 Still Sit Idle

Bitcoin.com News
by Bitcoin.com News
1 hour ago
21
These four charts show where AI companies could go next in the US
Technology

These four charts show where AI companies could go next in the US

Techonolgy Review
by Techonolgy Review
2 hours ago
20
Bitcoin Price Watch: Bulls Eye $120K as Market Tests Critical Resistance
Crypto

Bitcoin Price Watch: Bulls Eye $120K as Market Tests Critical Resistance

Bitcoin.com News
by Bitcoin.com News
2 hours ago
21
Former Top Google Researchers Have Made A New Kind of AI Agent
Business

Former Top Google Researchers Have Made A New Kind of AI Agent

Wired
by Wired
3 hours ago
21
BTCC Exchange Reports 132% Total Reserve Ratio With Ethereum Leading at 170% in July 2025
Crypto

BTCC Exchange Reports 132% Total Reserve Ratio With Ethereum Leading at 170% in July 2025

Bitcoin.com News
by Bitcoin.com News
3 hours ago
21
Load More
Next Post
Gemini: What is the Bitcoin Credit Card™ and How Does It Work?

Gemini: What is the Bitcoin Credit Card™ and How Does It Work?

ADVERTISEMENT

Follow Us

Categories

  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
Subscribe to our Newsletter

© 2022 Metaverse Media Group – The Metaverse Mecca

Privacy and Cookie Policy | Sitemap

Welcome Back!

Sign In with Google
OR

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Sign Up with Google
OR

Fill the forms below to register

*By registering into our website, you agree to the Terms & Conditions and Privacy Policy.
All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
Bitcoin

Bitcoin

$119,345.72

BTC 2.58%

Ethereum

Ethereum

$3,237.84

ETH 6.53%

  • Login
  • Sign Up
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Subscribe to our newsletter

Get the latest news & win monthly prizes

Subscribe to our newsletter

For the Latest News and Monthly Prize Giveaways

Join Now
Join Now