Voice Agents: tech sessions and community events, 10k dollars in free credits

New
·

4 Weeks

·

Cohort-based Course

Learn about core technologies and best practices for realtime, conversational AI. Explore models and tools with hands-on help from experts.

This course is popular

50+ people enrolled last week.

Hosted by

Kwindla Hultman Kramer and swyx

Brought to you by the architect of Pipecat and the inventor of "AI Engineer."

Course overview

Understand the core technologies driving voice AI today

📢 🛜 🎉 $10,000 in credits from partner companies for all students in the class 🎉 🛜 📢


➡️ $500 from Modal (https://modal.com/)

➡️ $500 from Tavus (https://tavus.io/)

➡️ $500 from Rime (https://www.rime.ai/)

➡️ $1,000 from Layercode (https://layercode.com/)

➡️ $500 from Cerebrium (https://www.cerebrium.ai/)

➡️ $1,000 from Gladia (https://www.gladia.io/)

➡️ One free month from Coval (https://www.coval.dev/)

➡️ $1,000 for Ultravox (https://fixie.ai)

➡️ $50 from OpenPipe (https://openpipe.ai/)

➡️ $500 from PlayAI (https://play.ai/)

➡️ $500 from Baseten (https://www.baseten.co/)

➡️ $1,000 from Freeplay (https://freeplay.ai/)

➡️ $500 from Fireworks (https://fireworks.ai/)

➡️ $500 from Deepgram (https://deepgram.com/)

➡️ $500 from Vapi (https://vapi.ai)

➡️ $100 from Groq (https://groq.com/)

➡️ $500 from fal (https://fal.ai/)

➡️ $1,000 from Pipecat Cloud (https://pipecat.daily.co/)


Four weeks of presentations and hands-on sessions with industry leaders in realtime AI, including:


Aidan Hornsby (Layercode). Batuhan Taskaya (fal). Brooke Hopkins (Coval). Charles Frye (Modal). Damien Tanner (Layercode). Dominik Kundel (OpenAI). Elizabeth Trykin (Vapi). Freddy Boulton (Hugging Face). Hamel H. (Parlance Labs). Hasssaan Raza (Tavus). Ian Cairns (Freeplay). Jean-Louis Quéguiner (Gladia). Joe Heitzeberg (AI Tinkerers). Jordan Dearsley (Vapi). Karan Goel (Cartesia). Kyle Corbitt (OpenPipe). Lily Clifford (Rime Labs). Mahmoud Felfel (PlayAI). Michael Louis (Cerebrium). Natalie Rutgers (Deepgram). Paige Bailey (Google DeepMind). Philip Kiely (Baseten). Prince Canuma (MLX hacker). Quinn Favret (Tavus). Rajiv Ayyangar (Product Hunt). Raymond Thai (Fireworks AI). Sam Stowers (Weights & Biases). Shrestha Basu Mallick (Google DeepMind). Sean DuBois (OpenAI). swyx (Latent Space). Taranjeet Singh (mem0). Vaibhav Srivastav (Hugging Face). Zach Koch (Fixie.ai).


Hear from people training models, designing APIs, and shipping AI agents at scale in production. Learn about the current state-of-the-art in voice AI for use cases like:


➡️ Customer support agents

➡️ Outbound agents that quality sales leads

➡️ Agents that answer the phone for restaurants, healthcare providers, and other small businesses

➡️ Games and social experiences


1. Core technology and tooling for voice agents


➡️ Using transcription models, language models, and voice models

➡️ Minimizing latency

➡️ Managing multimodal context

➡️ Turn detection and interruption handling

➡️ Audio codecs, telephony integration, and network transport

➡️ Voice AI evals

➡️ Scripting and workflows

➡️ Content guardrails

➡️ Retrieval augmented generation and conversation memory

➡️ Function calling for realtime AI, including how to handle async, parallel, and composite function calls


2. Deploying agents to production


➡️ Hosting

➡️ Monitoring and observability


3. Plus special sections on


➡️ Realtime video

➡️ Voice-enabled programming environments


Listen to the Latent Space podcast about the course.


https://www.youtube.com/watch?v=AbToUiWRhn4


Who is this course for

01

You are an AI Engineer working on a conversational voice project.

02

You are a technical manager adding AI Voice to your organization's products.

03

You develop models and API and are exploring how to expand your realtime capabilities.

What you’ll get out of this course

Understand the landscape of models and tools people are using to deploy voice agents to production, today.


Hear from labs developing the next generation of transcription, LLM, voice, and speech-to-speech models.


Build and deploy your own voice AI agent.


Attend hands-on workshops with developer relations engineers from the leading labs and infrastructure providers.


Explore the emerging areas of realtime conversational video and voice-enabled programming tools.

This course includes

28 interactive live sessions

Lifetime access to course materials

2 in-depth lessons

Direct access to instructor

Projects to apply learnings

Guided feedback & reflection

Private community of peers

Course certificate upon completion

Maven Satisfaction Guarantee

This course is backed by Maven’s guarantee. You can receive a full refund within 14 days after the course ends, provided you meet the completion criteria in our refund policy.

Course syllabus

Week 1

May 7—May 11

    An overview of the voice AI landscape

    2 items

    May

    7

    Overview of the Voice AI Landscape

    Wed 5/74:00 PM—5:30 PM (UTC)

    May

    8

    Deploying to production: hosting, infrastructure, telephony

    Thu 5/84:00 PM—5:30 PM (UTC)

Week 2

May 12—May 18

    Building voice agents for production

    0 items

    May

    14

    AI Evals

    Wed 5/144:00 PM—5:30 PM (UTC)

    May

    15

    Scripting and workflows

    Thu 5/154:00 PM—5:30 PM (UTC)

    May

    12

    Optional: Gladia office hours

    Mon 5/124:00 PM—5:00 PM (UTC)
    Optional

    May

    12

    Optional: Modal office hours

    Mon 5/125:30 PM—6:30 PM (UTC)
    Optional

    May

    13

    Optional: Cartesia office hours

    Tue 5/134:00 PM—5:00 PM (UTC)
    Optional

    May

    13

    Optional: Vapi office hours

    Tue 5/135:30 PM—6:30 PM (UTC)
    Optional

    May

    13

    Optional: Cerebrium office hours

    Tue 5/137:00 PM—8:00 PM (UTC)
    Optional

    May

    14

    Optional: Baseten office hours

    Wed 5/145:30 PM—6:30 PM (UTC)
    Optional

    May

    16

    Optional: Rime Labs office hours

    Fri 5/164:00 PM—5:00 PM (UTC)
    Optional

    May

    16

    Optional: Fine-tuning LLMs — hosted by Openpipe

    Fri 5/165:30 PM—6:30 PM (UTC)
    Optional

    May

    16

    Optional: PlayAI office hours

    Fri 5/167:00 PM—8:00 PM (UTC)
    Optional

Week 3

May 19—May 25

    Office hours

    0 items

    May

    19

    Optional: Layercode office hours

    Mon 5/194:00 PM—5:00 PM (UTC)
    Optional

    May

    20

    Optional: Coval office hours

    Tue 5/205:00 PM—6:00 PM (UTC)
    Optional

    May

    20

    Optional: Firework office hours

    Tue 5/205:30 PM—6:30 PM (UTC)
    Optional

    May

    23

    Optional: Building global community — AI Tinkerers and Product Hunt

    Fri 5/234:00 PM—5:00 PM (UTC)
    Optional

    May

    23

    Optional: WebRTC, FastRTC, and realtime networking

    Fri 5/235:30 PM—6:30 PM (UTC)
    Optional

    May

    23

    Optional: OpenAI office hours

    Fri 5/237:00 PM—8:00 PM (UTC)
    Optional

Week 4

May 26—May 29

    Up next

    0 items

    May

    27

    Web and native mobile apps

    Tue 5/274:00 PM—5:15 PM (UTC)

    May

    27

    Local models and hybrid architectures

    Tue 5/275:30 PM—6:45 PM (UTC)

    May

    27

    Optional: Weights & Biases Weave office hours

    Tue 5/277:00 PM—8:00 PM (UTC)
    Optional

    May

    28

    Programming with voice

    Wed 5/284:00 PM—5:15 PM (UTC)

    May

    28

    Optional: Freeplay office hours

    Wed 5/285:30 PM—6:30 PM (UTC)
    Optional

    May

    28

    Optional: Deepgram office hours

    Wed 5/287:00 PM—8:00 PM (UTC)
    Optional

    May

    29

    Optional: Google Gemini office hours

    Thu 5/294:00 PM—5:00 PM (UTC)
    Optional

    May

    29

    Optional: Fal office hours

    Thu 5/295:30 PM—6:30 PM (UTC)
    Optional

    May

    29

    Optional: Conversational video — hosted by Tavus

    Thu 5/297:00 PM—8:15 PM (UTC)
    Optional
A pattern of wavy dots

Join an upcoming cohort

Voice Agents: tech sessions and community events, 10k dollars in free credits

Cohort 1

$500

Dates

May 7—30, 2025

Payment Deadline

May 29, 2025
Get reimbursed

Course schedule

1 - 4 hours per week

  • Live Presentations

    12:00pm - 1:00pm EST

    Live presentations from people training models, developing APIs and tools, and shipping voice agents to production.

  • Office Hours

    Hands-on sessions and Q&A opportunities with technical engineers from leading labs, infrastructure companies, and startups making interesting new tools.

A pattern of wavy dots

Join an upcoming cohort

Voice Agents: tech sessions and community events, 10k dollars in free credits

Cohort 1

$500

Dates

May 7—30, 2025

Payment Deadline

May 29, 2025
Get reimbursed

$500

4 Weeks

OSZAR »