UN sanctions on Iran to be reimposed, France's Macron says
European powers will likely reimpose international sanctions on Iran by the end of the month after their latest round of talks with Tehran aimed at pr...
AI startup ElevenLabs, known for its cutting-edge audio-generation technology, has unveiled its first stand-alone speech-to-text model, named Scribe.
The company, fresh off a $180 million funding round and valued at $3.3 billion, is now expanding its technology portfolio to compete in the speech detection arena.
Scribe supports over 99 languages at launch, with more than 25 languages achieving an “excellent” accuracy rating—defined as a word error rate of less than 5%. This list includes English, with a claimed accuracy rate of 97%, as well as French, German, Hindi, Indonesian, Japanese, Kannada, Malayalam, Polish, Portuguese, Spanish, and Vietnamese. Other languages are categorized into high, good, and moderate accuracy levels based on their word error rates.
According to benchmark tests using FLEURS and Common Voice datasets, Scribe has outperformed competitors such as Google Gemini 2.0 Flash and OpenAI’s Whisper Large V3 across multiple languages. Previously, ElevenLabs developed a speech-to-text component for its AI conversational agent platform, but Scribe marks the first time the company is releasing a dedicated, stand-alone speech detection model.
CEO Mati Staniszewski told TechCrunch last month, “We want to understand what’s being said by you in a conversation better. We are working on ways to move away from only generating content and understanding and transcribing speech.” He noted that while many consider speech-to-text a solved problem, performance for many languages remains suboptimal. “We think we can build better speech detection models because we have in-house teams to annotate data and give us quick feedback,” Staniszewski added.
In addition to accurate transcription, Scribe incorporates smart speaker diarization to identify who is speaking, provides word-level timestamps for precise subtitle generation, and auto-tags sound events such as audience laughter. The model currently processes pre-recorded audio formats, and ElevenLabs plans to release a low-latency real-time version in the near future, which would extend its use to meeting transcriptions and live voice note-taking.
Scribe is priced competitively at $0.40 per hour of transcribed audio, although some rival services offer lower prices with different feature sets. As ElevenLabs continues to push the boundaries of generative AI technology, the launch of Scribe marks another significant step in expanding its influence across both audio-generation and speech detection markets.
AnewZ has learned that India has once again blocked Azerbaijan’s application for full membership in the Shanghai Cooperation Organisation, while Pakistan’s recent decision to consider diplomatic relations with Armenia has been coordinated with Baku as part of Azerbaijan’s peace agenda.
A day of mourning has been declared in Portugal to pay respect to victims who lost their lives in the Lisbon Funicular crash which happened on Wednesday evening.
A Polish Air Force pilot was killed on Thursday when an F-16 fighter jet crashed during a training flight ahead of the 2025 Radom International Air Show.
At least eight people have died and more than 90 others were injured following a catastrophic gas tanker explosion on a major highway in Mexico City’s Iztapalapa district on Wednesday, authorities confirmed.
Palaeontologists in Peru unveiled the fossilized skeleton of an ancient, dolphin-like creature estimated to be between 8 and 12 million years old.
China has entered the United Nations’ annual list of the world’s ten most innovative nations for the first time, displacing Germany, Europe’s largest economy, as companies in Beijing ramp up investment in research and development.
Microsoft and OpenAI announced Thursday a non-binding deal outlining terms that would allow OpenAI to restructure into a for-profit company, marking a key step in the high-profile partnership fueling ChatGPT’s growth.
The U.S. Federal Trade Commission has launched an inquiry into seven technology companies over how their AI chatbots interact with children, amid rising concerns about safety and mental health risks.
Nvidia (NVDA.O) announced on Tuesday that it plans to release a new artificial intelligence chip by the end of next year, designed to manage complex tasks like video creation and software development.
You can download the AnewZ application from Play Store and the App Store.
What is your opinion on this topic?
Leave the first comment