feat(realtime): Add audio conversations #6245

richiejp · 2025-09-10T14:22:39Z

Description

Add enough realtime API features to allow talking with an LLM using only audio.

Presently the realtime API only supports transcription which is a minor use-case for it. This PR should allow it to be used with a basic voice assistant.

This PR will ignore many of the options and edge-cases. Instead it'll just, for e.g., rely on server side VAD to commit conversation items.

Notes for Reviewers

Configure a model pipeline or use a multi-modal model.
Commit client audio to the conversation
Generate a text response (optional)
Generate an audio response
Interrupt generation on voice detection?

Fixes: #3714 (but we'll need follow issues)

Signed commits

Yes, I signed my commits.

Signed-off-by: Richard Palethorpe <[email protected]>

netlify · 2025-09-10T14:22:49Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`2eae0d9`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/68c3c9475b886b00083272f3
😎 Deploy Preview	https://deploy-preview-6245--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

richiejp · 2025-09-13T10:19:46Z

It's not clear to me if we have audio support in llama.cpp: ggml-org/llama.cpp#15194

richiejp · 2025-09-13T10:20:45Z

ggml-org/llama.cpp#13759

richiejp · 2025-09-13T10:21:50Z

ggml-org/llama.cpp#13784

feat(realtime): Add audio conversations

3917854

Signed-off-by: Richard Palethorpe <[email protected]>

mudler added the roadmap label Sep 11, 2025

fixup realtime

2eae0d9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(realtime): Add audio conversations #6245

feat(realtime): Add audio conversations #6245

richiejp commented Sep 10, 2025 •

edited

Loading

Uh oh!

netlify bot commented Sep 10, 2025 •

edited

Loading

Uh oh!

richiejp commented Sep 13, 2025

Uh oh!

richiejp commented Sep 13, 2025

Uh oh!

richiejp commented Sep 13, 2025

Uh oh!

Uh oh!

Uh oh!

feat(realtime): Add audio conversations #6245

Are you sure you want to change the base?

feat(realtime): Add audio conversations #6245

Conversation

richiejp commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

richiejp commented Sep 13, 2025

Uh oh!

richiejp commented Sep 13, 2025

Uh oh!

richiejp commented Sep 13, 2025

Uh oh!

Uh oh!

richiejp commented Sep 10, 2025 •

edited

Loading

netlify bot commented Sep 10, 2025 •

edited

Loading