-->

Speech Technology News

OpenAI Introduces Speech-to-Text and Text-to-Speech Audio Models

OpenAI's new suite of audio models to power voice agents is now available to developers worldwide through its API.

Hona Launches Voice AI

Hona's Voice AI is an advanced voice artificial intelligence solution for managing law firms' client communications and client intake. 

SoundHound AI Delivers Voice Assistants at Scale with NVIDIA

SoundHound is pairing its advanced Voice AI with NVIDIA Ai Enterprise.

AI Virtual Assistants Market to Hit $2.45 Billion by 2030

Valuates Reports expects 16.5 percent growth for AI-powered virtual assistants, with voice interfaces as a major catalyst.

Kardome Mobility Now Available on NVIDIA AGX Platform

Kardome Mobility on NVIDIA AGX enhances the in-vehicle voice experience.

Wispr Launches Wispr Flow for Windows

Wispr's voice dictation product now expands to the Windows platform after a successful Mac launch in 2024.

Microsoft Releases .NET MAUI Toolkit V. 11 with Offline Speech Recognition

Microsoft's new open-source developer framework supports speech-to-text conversions with or without an internet connection.

Agora Launches Conversational AI Toolkit for IoT Devices 

Agora's new partnerships with Beken and Robopoet showcase the future of interactive toys and connected devices.

Mood Media Launches Messaging Copilot

Mood Media's Messaging Copilot helps retailers create and manage in-store audio.

Agora Launches Conversational AI Engine

Agora's new solution allows developers to use any AI model to create voice agents optimized for ultra-low latency and natural conversation flow.

Deepgram Launches Nova-3 Medical

Deepgram's Nova-3 Medical is a healthcare-specific speech-to-text model

Deepdub Partners with AWS

Deepdub's AI voice technology and media localization solutions are now available in AWS Marketplace.

StudyFetch Launches Conversational TutorMe App

StudyFetch's TutorMe is a voice-enabled, conversational, and personalized tutor.

DeliverHealth Partners with Google Cloud on Documentation Solutions

DeliverHelth gains access to GoogleCloud's GenAI technologies for its clinical documentation solutions.

Zeta Launches Selene, a Speech-Enabled Customer Support Agent for Banks and Fintechs

Zeta's Selene leverages generative AI with banking-grade features to handle 100 percent of customer support calls. (Featured on SmartCustomerService.com.)

SoundHound Enhances Dynamic Drive-Thru

SoundHound's Dynamic Drive-Thru now includes omnichannel ordering and additional AI capabilities.

Teleperformance Partners with Sanas

Teleperformance has bought an equity stake in Sanas and will become a reseller of its technology. (Featured on SmartCustomerService.com.)

Deepgram Achieves Key Milestone in Delivering a Speech-to-Speech Architecture

Deepgram's new model will be able to deliver speech-to-speech technology without intermediate text representation.

Panjaya.ai Unveils Pod Pro with Localization and Multilingual Sync

Panjaya's Pod Pro is a free dubbing platform for podcasts.

Curve Dental Integrates with Mango Voice for AI-Powered Call Documentation

The integration of Curve Dental and Mango Voice provides automated call documentation for dental practices.