French PM proposes cutting two public holidays to reduce debt
French Prime Minister François Bayrou has unveiled a sweeping budget plan that includes scrapping two public holidays—Easter Monday and 8 May, whic...
AI startup ElevenLabs, known for its cutting-edge audio-generation technology, has unveiled its first stand-alone speech-to-text model, named Scribe.
The company, fresh off a $180 million funding round and valued at $3.3 billion, is now expanding its technology portfolio to compete in the speech detection arena.
Scribe supports over 99 languages at launch, with more than 25 languages achieving an “excellent” accuracy rating—defined as a word error rate of less than 5%. This list includes English, with a claimed accuracy rate of 97%, as well as French, German, Hindi, Indonesian, Japanese, Kannada, Malayalam, Polish, Portuguese, Spanish, and Vietnamese. Other languages are categorized into high, good, and moderate accuracy levels based on their word error rates.
According to benchmark tests using FLEURS and Common Voice datasets, Scribe has outperformed competitors such as Google Gemini 2.0 Flash and OpenAI’s Whisper Large V3 across multiple languages. Previously, ElevenLabs developed a speech-to-text component for its AI conversational agent platform, but Scribe marks the first time the company is releasing a dedicated, stand-alone speech detection model.
CEO Mati Staniszewski told TechCrunch last month, “We want to understand what’s being said by you in a conversation better. We are working on ways to move away from only generating content and understanding and transcribing speech.” He noted that while many consider speech-to-text a solved problem, performance for many languages remains suboptimal. “We think we can build better speech detection models because we have in-house teams to annotate data and give us quick feedback,” Staniszewski added.
In addition to accurate transcription, Scribe incorporates smart speaker diarization to identify who is speaking, provides word-level timestamps for precise subtitle generation, and auto-tags sound events such as audience laughter. The model currently processes pre-recorded audio formats, and ElevenLabs plans to release a low-latency real-time version in the near future, which would extend its use to meeting transcriptions and live voice note-taking.
Scribe is priced competitively at $0.40 per hour of transcribed audio, although some rival services offer lower prices with different feature sets. As ElevenLabs continues to push the boundaries of generative AI technology, the launch of Scribe marks another significant step in expanding its influence across both audio-generation and speech detection markets.
A series of earthquakes have struck Guatemala on Tuesday afternoon, leading authorities to advise residents to evacuate from buildings as a precaution against possible aftershocks.
Authorities in North Carolina are investigating three potential storm-related deaths linked to severe flooding from the remnants of Tropical Storm Chantal, officials said Tuesday.
Start your day informed with AnewZ Morning Brief: here are the top news stories for 10th July, covering the latest developments you need to know.
China and the Association of Southeast Asian Nations will send an upgraded ‘version 3.0’ free-trade agreement to their heads of government for approval in October, Chinese Foreign Minister Wang Yi said on Saturday after regional talks in Kuala Lumpur.
Chinese automaker Chery has denied an industry-ministry audit that disqualified more than $53 million in state incentives for thousands of its electric and hybrid vehicles, insisting it followed official guidance and committed no fraud.
Apple and mining company MP Materials announced a joint $500 million investment to develop a rare earth magnet recycling facility, with plans to bolster U.S.-based production and reduce reliance on China.
Meta CEO Mark Zuckerberg announced plans to invest hundreds of billions of dollars into building next-generation AI data centres, signalling an aggressive long-term bet on superintelligence and reaffirming Meta’s leadership ambitions in the global AI race.
Peggy Whitson, NASA retiree turned private astronaut, headed for splashdown in the Pacific on Tuesday after her fifth trip to the International Space Station, joined by crewmates from India, Poland, and Hungary returning from their countries’ first ISS mission.
A team led by Prof. Mingtai Wang at the Hefei Institutes of Physical Science has developed a breakthrough method to control the spacing of titanium dioxide nanorods without changing their size, significantly improving solar cell efficiency.
Israeli researchers have unveiled an artificial intelligence tool that can determine a person’s true biological age from tiny DNA samples with remarkable precision.
You can download the AnewZ application from Play Store and the App Store.
What is your opinion on this topic?
Leave the first comment