Back to Blog
Product

Introducing SpeechLLM 2.0: Our Most Accurate Model Yet

Major update: SpeechLLM 2.0 now achieves 99.8% accuracy with 40% faster inference. New features include real-time confidence scores, improved accents handling, and support for 15 additional languages. Available now for all enterprise customers.

Sarah Chen

VP of Engineering

March 8, 2026
7 min read

We're thrilled to announce SpeechLLM 2.0, a major advancement in AI answering machine detection that builds on months of research and feedback from 500+ enterprise customers. This update represents a fundamental improvement in accuracy, speed, and global language support.

Key Improvements in SpeechLLM 2.0

Accuracy: Now 99.8%

SpeechLLM 2.0 achieves 99.8% detection accuracy—up from 99.7% in the previous version. While this appears incremental, for high-volume operations processing 100,000+ calls daily, this means recovering hundreds of valuable conversations that would have been incorrectly filtered.

What changed:

  • Upgraded from CNN-LSTM to pure transformer architecture for better long-range pattern detection
  • Integrated adversarial training against synthetic adversarial voicemails designed to fool detection
  • New ensemble approach combining multiple model architectures for edge cases

Speed: 40% Faster Processing

We've optimized inference to run in under 30ms (down from 50ms), ensuring zero perceptible delay in call routing:

  • Optimized transformer attention mechanisms reduce computational overhead by 35%
  • Streaming inference begins analysis before the complete audio buffer arrives
  • Better GPU memory management enables batch processing at scale

Language Support: 65+ Languages

We've expanded from 50 to 65+ supported languages including:

  • Southeast Asian: Thai, Vietnamese, Indonesian, Filipino
  • Eastern European: Polish, Czech, Hungarian, Romanian
  • Middle Eastern: Hebrew, Arabic, Farsi, Turkish
  • African: Swahili, Yoruba, Amharic, Somali

Each language was trained on 10,000+ hours of real voicemail greetings with native speaker validation.

New Features

Confidence Scores: Each detection now includes a 0-100% confidence metric, allowing you to set custom thresholds for different campaigns.

Accent Recognition: Improved handling of regional accents, non-native speakers, and accent variations within the same language.

Real-time Feedback Loop: Enterprise customers can now feed back false positives to continuously improve their custom models.

Performance Benchmarks

MetricPreviousSpeechLLM 2.0Improvement
Accuracy99.7%99.8%+0.1%
Detection Latency50ms30ms-40%
Languages5065+15 languages
False Positive Rate0.4%0.15%-62%
Model Size95MB85MB-10%
Inference Cost$0.0025/call$0.0015/call-40%

Availability and Migration

SpeechLLM 2.0 is available now for:

  • ✅ All new customers (default)
  • ✅ Enterprise customers (contact support for activation)
  • 📅 Standard/Professional plans (rolling out March 2026)

Migration is seamless—simply update your API endpoint and you're using the new model immediately. All previous features remain fully compatible.

Looking Ahead

We're already developing SpeechLLM 3.0, which will feature:

  • Real-time voicemail greeting transcription
  • Caller intent analysis
  • Custom model fine-tuning for enterprise deployments

Upgrade to SpeechLLM 2.0 or contact sales to see a live demo.