Google releases SpeciesNet, an AI model to identify wildlife
Google has open sourced SpeciesNet, an artificial intelligence model designed to automatically identify animal species from photos captured by camera traps.
AI startup ElevenLabs, known for its cutting-edge audio-generation technology, has unveiled its first stand-alone speech-to-text model, named Scribe.
The company, fresh off a $180 million funding round and valued at $3.3 billion, is now expanding its technology portfolio to compete in the speech detection arena.
Scribe supports over 99 languages at launch, with more than 25 languages achieving an “excellent” accuracy rating—defined as a word error rate of less than 5%. This list includes English, with a claimed accuracy rate of 97%, as well as French, German, Hindi, Indonesian, Japanese, Kannada, Malayalam, Polish, Portuguese, Spanish, and Vietnamese. Other languages are categorized into high, good, and moderate accuracy levels based on their word error rates.
According to benchmark tests using FLEURS and Common Voice datasets, Scribe has outperformed competitors such as Google Gemini 2.0 Flash and OpenAI’s Whisper Large V3 across multiple languages. Previously, ElevenLabs developed a speech-to-text component for its AI conversational agent platform, but Scribe marks the first time the company is releasing a dedicated, stand-alone speech detection model.
CEO Mati Staniszewski told TechCrunch last month, “We want to understand what’s being said by you in a conversation better. We are working on ways to move away from only generating content and understanding and transcribing speech.” He noted that while many consider speech-to-text a solved problem, performance for many languages remains suboptimal. “We think we can build better speech detection models because we have in-house teams to annotate data and give us quick feedback,” Staniszewski added.
In addition to accurate transcription, Scribe incorporates smart speaker diarization to identify who is speaking, provides word-level timestamps for precise subtitle generation, and auto-tags sound events such as audience laughter. The model currently processes pre-recorded audio formats, and ElevenLabs plans to release a low-latency real-time version in the near future, which would extend its use to meeting transcriptions and live voice note-taking.
Scribe is priced competitively at $0.40 per hour of transcribed audio, although some rival services offer lower prices with different feature sets. As ElevenLabs continues to push the boundaries of generative AI technology, the launch of Scribe marks another significant step in expanding its influence across both audio-generation and speech detection markets.
Royal Australian Air Force (RAAF) pilots, monitoring a Chinese navy warship as it navigated Australian waters, were alerted to a live-fire exercise via a civilian radio broadcast, defense officials revealed on Tuesday.
A powerful 7.7-magnitude earthquake struck Myanmar’s Sagaing region, followed by a 6.4-magnitude tremor, killing 2056 people and leaving 3,900 injured. The quake caused building collapses in Myanmar and Thailand, prompting emergency declarations and ongoing rescue efforts.
As the world shifts toward clean energy at an ever-accelerating pace, large economies are scrambling to secure reliable supply chains for rare earth minerals. These minerals, once seen as mere industrial components, have become a political tool in the global power struggle
Russian forces carried out a drone attack on Ukraine’s second-largest city, Kharkiv, late Wednesday, injuring at least twenty one people and causing structural damage, according to Ukrainian officials.
Neuralink plans to implant its first Blindsight vision chip in a human by the end of the year, enabling vision for those born blind, according to Elon Musk. The device could eventually surpass natural vision, allowing users to see in infrared, ultraviolet, and radar ranges.
Researchers at Rice University have made a groundbreaking discovery in the field of strange metals—materials that defy conventional understanding of electricity and magnetism
Airbus UK wins a £150 million-contract to engineer landing platform that will safely deliver the first European rover on Mars. First British-built rover will explore the red planet in 2030 for signs of present and past life on Mars.
The model aims to enhance Alibaba’s presence in the generative AI sector and is available as open-source.
OpenAI has asked the US government to permit AI companies to use copyrighted material for training to maintain America's leadership in AI development, as part of a proposal aligned with President Trump's upcoming "AI Action Plan."
You can download the AnewZ application from Play Store and the App Store.
What is your opinion on this topic?
Leave the first comment