-->

Speech Technology News

Phonic Launches End-to-End Speech-to-Speech Platform for Building Voice Agents

Phonic's intelligent decision system and hyper-realistic voices form the basis of its voice AI.

Krikey AI Launches Talking Avatars with ElevenLabs

Krikey AI users can create talking avatars with ElevenLabs' voice generator and text-to-speech.

SyncWords Introduces Ultra-Low Latency AI Captions with Kobe Muxer

SyncWord's Kobe Muxer is a video captioning solution with near-real-time availability.

Deepdub Launches Deepdub Live for Global Events

Deepdub Live brings expressive, multilingual voice localization to live sports, esports, and news events.

Gladia Launches Solaria, a Multilingual Speech-to-Text Model 

Gladia's Solaria delivers native-level transcription in 100 languages.

aiOla Launches Jargonic Speech Recognition Model

aiOla's Jargonic is a accurate speech model with specialized recognition of industry-specific terminologyfo.

Northeastern Researchers Develop AI App to Help Speech-Impaired

Two Northeastern University professors are developing an app to give users speech recognition, text, whole-word selection, and emojis on their mobile devices.

XL8 Delivers Real-Time Spanish Translation Captions to U.S. Public Broadcasters

XL8 Spanish translation captions mark the first commercial use of AI-based real-time translation technology in broadcasting.

OpenAI Introduces Speech-to-Text and Text-to-Speech Audio Models

OpenAI's new suite of audio models to power voice agents is now available to developers worldwide through its API.

Hona Launches Voice AI

Hona's Voice AI is an advanced voice artificial intelligence solution for managing law firms' client communications and client intake. 

SoundHound AI Delivers Voice Assistants at Scale with NVIDIA

SoundHound is pairing its advanced Voice AI with NVIDIA Ai Enterprise.

AI Virtual Assistants Market to Hit $2.45 Billion by 2030

Valuates Reports expects 16.5 percent growth for AI-powered virtual assistants, with voice interfaces as a major catalyst.

Kardome Mobility Now Available on NVIDIA AGX Platform

Kardome Mobility on NVIDIA AGX enhances the in-vehicle voice experience.

Wispr Launches Wispr Flow for Windows

Wispr's voice dictation product now expands to the Windows platform after a successful Mac launch in 2024.

Microsoft Releases .NET MAUI Toolkit V. 11 with Offline Speech Recognition

Microsoft's new open-source developer framework supports speech-to-text conversions with or without an internet connection.

Agora Launches Conversational AI Toolkit for IoT Devices 

Agora's new partnerships with Beken and Robopoet showcase the future of interactive toys and connected devices.

Mood Media Launches Messaging Copilot

Mood Media's Messaging Copilot helps retailers create and manage in-store audio.

Agora Launches Conversational AI Engine

Agora's new solution allows developers to use any AI model to create voice agents optimized for ultra-low latency and natural conversation flow.

Deepgram Launches Nova-3 Medical

Deepgram's Nova-3 Medical is a healthcare-specific speech-to-text model

Deepdub Partners with AWS

Deepdub's AI voice technology and media localization solutions are now available in AWS Marketplace.