Venezuelan oil exports drop sharply after U.S. tanker seizure
Venezuela’s oil shipments have plunged following the United States’ seizure of a tanker earlier this week....
AI startup ElevenLabs, known for its cutting-edge audio-generation technology, has unveiled its first stand-alone speech-to-text model, named Scribe.
The company, fresh off a $180 million funding round and valued at $3.3 billion, is now expanding its technology portfolio to compete in the speech detection arena.
Scribe supports over 99 languages at launch, with more than 25 languages achieving an “excellent” accuracy rating—defined as a word error rate of less than 5%. This list includes English, with a claimed accuracy rate of 97%, as well as French, German, Hindi, Indonesian, Japanese, Kannada, Malayalam, Polish, Portuguese, Spanish, and Vietnamese. Other languages are categorized into high, good, and moderate accuracy levels based on their word error rates.
According to benchmark tests using FLEURS and Common Voice datasets, Scribe has outperformed competitors such as Google Gemini 2.0 Flash and OpenAI’s Whisper Large V3 across multiple languages. Previously, ElevenLabs developed a speech-to-text component for its AI conversational agent platform, but Scribe marks the first time the company is releasing a dedicated, stand-alone speech detection model.
CEO Mati Staniszewski told TechCrunch last month, “We want to understand what’s being said by you in a conversation better. We are working on ways to move away from only generating content and understanding and transcribing speech.” He noted that while many consider speech-to-text a solved problem, performance for many languages remains suboptimal. “We think we can build better speech detection models because we have in-house teams to annotate data and give us quick feedback,” Staniszewski added.
In addition to accurate transcription, Scribe incorporates smart speaker diarization to identify who is speaking, provides word-level timestamps for precise subtitle generation, and auto-tags sound events such as audience laughter. The model currently processes pre-recorded audio formats, and ElevenLabs plans to release a low-latency real-time version in the near future, which would extend its use to meeting transcriptions and live voice note-taking.
Scribe is priced competitively at $0.40 per hour of transcribed audio, although some rival services offer lower prices with different feature sets. As ElevenLabs continues to push the boundaries of generative AI technology, the launch of Scribe marks another significant step in expanding its influence across both audio-generation and speech detection markets.
Japan has lifted a tsunami advisory issued after an earthquake with a magnitude of 6.9 hit the country's northeastern region on Friday (12 December), the Japan Meteorological Agency (JMA) said. The JMA had earlier put the earthquake's preliminary magnitude at 6.7.
The United States issued new sanctions targeting Venezuela on Thursday, imposing curbs on three nephews of President Nicolas Maduro's wife, as well as six crude oil tankers and shipping companies linked to them, as Washington ramps up pressure on Caracas.
Iran is preparing to host a multilateral regional meeting next week in a bid to mediate between Afghanistan and Pakistan.
Kyiv has escalated its naval campaign against Moscow’s economic lifelines, claiming a successful strike on a vessel suspected of skirting international sanctions within the Black Sea.
An extratropical cyclone has caused widespread disruption across Brazil’s São Paulo state, with powerful winds toppling trees and power lines, blocking streets and leaving large parts of the region without electricity.
Time Magazine has chosen the creators behind artificial intelligence as its 2025 Person of the Year, highlighting the technology’s sweeping impact on global business, politics and daily life.
Children are forming new patterns of trust and attachment with artificial intelligence (AI) companions, entering a world where digital partners shape their play, their confidence and the conversations they no longer share with adults.
The International Robot Exhibition (IREX) opened in Tokyo on 3 December, bringing together visitors to explore robotics applications for industry, healthcare, logistics, and everyday life.
A bipartisan group of U.S. senators, including prominent Republican China hawk Tom Cotton, introduced the SAFE CHIPS Act on Thursday, aiming to prevent the Trump administration from easing restrictions on China’s access to advanced artificial intelligence (AI) chips for a period of 2.5 years.
A former Apple engineer has unveiled a new Chinese chip designed to compete directly with Apple’s Vision Pro headset.
You can download the AnewZ application from Play Store and the App Store.
What is your opinion on this topic?
Leave the first comment