-->
Development Tools and APIs

Development tools and APIs designed to let users create custom speech technology applications are at the foundation of the speech technology industry. See below for the latest development tools and API news, trends, and solutions.

Features

Voice Is Poised to Take a Quantum Leap

Exploring quantum computing's expected impact on the speech technology market.

Industry-Standard Speech App Building Blocks Take Shape

Interface interoperability is becoming closer to reality, but more work is needed.

The Top Speech Technologies and Vendors: The 2023 Speech Industry Awards

AI, AI, and more AI: The technology is disrupting everything, and it's found everywhere in our speech industry achievements for 2023.

2023 Speech Industry Award Winner: D-ID Gives a Human Face and Voice to AI

D-ID, an Israeli company founded in 2017, is providing superpowers to individual creators and businesses alike, uniquely enabling them to transform any picture into an interactive video in seconds.

Industry Voices

Why Speech Researchers Need Better Benchmarks

Long-form speech recognition is here and growing. With updated datasets, we can accurately train and test ASR models for real-world use cases.

Four Pitfalls to Avoid When Building Compelling Voice Experiences

As voice experiences grow in popularity, here are some pitfalls developers can avoid when creating voice-focused products. 

Mitigating TDMA Noise in Microphone Lines

Here are a few countermeasures that designers can incorporate to mitigate TDMA noise without affecting the signals.

Protecting User Data: How Close is the US to its Own GDPR?

GDPR has already had wide-ranging consequences for companies collecting data, and now some are calling for federal regulations in the U.S. Voice-data isn't exempt from the regulations, and vendors need to be ready.

Columns

Let’s Continue to Prioritize Innovation

The next waves of innovation in the speech technology space promise to be more transformative than ever before.

Putting Teams of GenAI Agents to Work

Multi-agent collaboration is the best approach to problem solving.

OpenAI Was the Biggest Disrupter. Now, That Could Change

Chinese AI lab DeepSeek's open-source large language model immediately sent ripples through the tech world.

Speech’s Next Big Thing Is Moving Fast

Quantum computing is making a resurgence, and speech tools could be beneficiaries.

Development Tools and APIs Companies and Suppliers