Research Staff, Voice AI Foundations

at Deepgram

United StatesRemote

Responsibilities

- Develop embedding systems that cleanly factorize the codec latent space into interpretable dimensions of speaker, content, style, environment, and channel effects -- enabling precise control over each aspect and the ability to massively amplify an existing seed dataset through “latent recombination”.
- Design model architectures, training schemes, and inference algorithms that are adapted for hardware at the bare metal enabling cost efficient training on billion-hour datasets and powering real-time inference for hundreds of millions of concurrent conversations.

Requirements

COMPANY OVERVIEW Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building production-grade voice agents at scale.
COMPANY OPERATING RHYTHM At Deepgram, we expect an AI-first mindset—AI use and comfort aren’t optional, they’re core to how we operate, innovate, and measure performance.
Every team member who works at Deepgram is expected to actively use and experiment with advanced AI tools, and even build your own into your everyday work.
We measure how effectively AI is applied to deliver results, and consistent, creative use of the latest AI capabilities is key to success here.
Candidates should be comfortable adopting new models and modes quickly, integrating AI into their workflows, and continuously pushing the boundaries of what these technologies can do.
Additionally, we move at the pace of AI.
However, current sequence modeling paradigms based on jointly scaling model and data cannot deliver voice AI capable of universal human interaction.
We believe that entirely new paradigms for audio AI are needed to overcome these challenges and make voice interaction accessible to everyone.
THE ROLE As a Member of the Research Staff, you will pioneer the development of Latent Space Models (LSMs), a new approach that aims to solve the fundamental data, scale, and cost challenges associated with building robust, contextualized voice AI.
THE CHALLENGE We are seeking researchers who: - See "unsolved" problems as opportunities to pioneer entirely new approaches - Can identify the one critical experiment that will validate or kill an idea in days, not months - Have the vision to scale successful proofs-of-concept 100x - Are obsessed with using AI to automate and amplify your own impact If you find yourself energized rather than daunted by these expectations—if you're already thinking about five ideas to try while reading this—you might be the

Research Staff, Voice AI Foundations

Responsibilities

Requirements

Browse by category

Browse by skills

Browse by role

Benefits

Additional details

Browse by location