Add live audio transcription streaming support to Foundry Local C# SDK by rui-ren · Pull Request #485 · microsoft/Foundry-Local

rui-ren · 2026-03-05T18:29:47Z

Here's the updated PR description based on the latest changes (renamed types, CoreInterop routing fix, mermaid updates):

Title: Add live audio transcription streaming support to Foundry Local C# SDK

Description:

Adds real-time audio streaming support to the Foundry Local C# SDK, enabling live microphone-to-text transcription via ONNX Runtime GenAI's StreamingProcessor API (Nemotron ASR).

The existing OpenAIAudioClient only supports file-based transcription. This PR introduces LiveAudioTranscriptionSession that accepts continuous PCM audio chunks (e.g., from a microphone) and returns partial/final transcription results as an async stream.

What's included

New files

src/OpenAI/LiveAudioTranscriptionClient.cs — Streaming session with StartAsync(), AppendAsync(), GetTranscriptionStream(), StopAsync()
src/OpenAI/LiveAudioTranscriptionTypes.cs — LiveAudioTranscriptionResult and CoreErrorResponse types

Modified files

src/OpenAI/AudioClient.cs — Added CreateLiveTranscriptionSession() factory method
src/Detail/ICoreInterop.cs — Added StreamingRequestBuffer struct, StartAudioStream, PushAudioData, StopAudioStream interface methods
src/Detail/CoreInterop.cs — Routes audio commands through existing execute_command / execute_command_with_binary native entry points (no separate audio exports needed)
src/Detail/JsonSerializationContext.cs — Registered LiveAudioTranscriptionResult for AOT compatibility
test/FoundryLocal.Tests/Utils.cs — Updated to use CreateLiveTranscriptionSession()

Documentation

API surface

var audioClient = await model.GetAudioClientAsync();
var session = audioClient.CreateLiveTranscriptionSession();

session.Settings.SampleRate = 16000;
session.Settings.Channels = 1;
session.Settings.Language = "en";

await session.StartAsync();

// Push audio from microphone callback
await session.AppendAsync(pcmBytes);

// Read results as async stream
await foreach (var result in session.GetTranscriptionStream())
{
    Console.Write(result.Text);
}

await session.StopAsync();

Design highlights

Internal push queue — Bounded Channel<T> serializes audio pushes from any thread (safe for mic callbacks) with backpressure
Retry policy — Transient native errors retried with exponential backoff (3 attempts); permanent errors terminate the session
Settings freeze — Audio format settings are snapshot-copied at StartAsync() and immutable during the session
Cancellation-safe stop — StopAsync always calls native stop even if cancelled, preventing native session leaks
Dedicated session CTS — Push loop uses its own CancellationTokenSource, decoupled from the caller's token
Routes through existing exports — StartAudioStream and StopAudioStream route through execute_command; PushAudioData routes through execute_command_with_binary — no new native entry points required

Core integration (neutron-server)

The Core side (AudioStreamingSession.cs) uses StreamingProcessor + Generator + Tokenizer + TokenizerStream from onnxruntime-genai to perform real-time RNNT decoding. The native commands (audio_stream_start/push/stop) are handled as cases in NativeInterop.ExecuteCommandManaged / ExecuteCommandWithBinaryManaged.

Verified working

✅ SDK build succeeds (0 errors)
✅ GenAI StreamingProcessor pipeline verified with WAV file (correct transcript)
✅ Core TranscribeChunk byte[] PCM path matches reference float[] path exactly
✅ Full E2E simulation: SDK Channel + JSON serialization + session management (32 partial + 1 final result)
✅ Live microphone test: 67s real-time transcription through SDK → Core → GenAI
✅ Full SDK → Core → GenAI E2E with locally built Core DLL and GenAI NuGet 0.13.0-dev

vercel · 2026-03-05T18:29:52Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
foundry-local	Ready	Preview, Comment	Mar 13, 2026 8:27pm

support audio streaming-csharp

c045bf3

delete dll mock test

3970936

vercel bot deployed to Preview March 5, 2026 21:50 View deployment

update core api

ef2e9e0

vercel bot deployed to Preview March 5, 2026 23:52 View deployment

ruiren_microsoft added 2 commits March 10, 2026 18:09

update sdk

535b735

update the api

f5bd916

vercel bot deployed to Preview March 13, 2026 01:53 View deployment

rename LiveAudioTranscription

6d067e0

vercel bot deployed to Preview March 13, 2026 19:17 View deployment

Merge branch 'main' into ruiren/audio-streaming-support-sdk

eb6598d

vercel bot deployed to Preview March 13, 2026 19:18 View deployment

rui-ren changed the title ~~Add real-time audio streaming support (Microphone ASR) - c#~~ Add live audio transcription streaming support to Foundry Local C# SDK Mar 13, 2026

fix: add missing using directives for EnumeratorCancellation and Channel

6dee740

vercel bot deployed to Preview March 13, 2026 20:22 View deployment

update test

b89e1bd

vercel bot deployed to Preview March 13, 2026 20:27 View deployment

rui-ren requested a review from kunal-vaishnavi March 13, 2026 20:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add live audio transcription streaming support to Foundry Local C# SDK#485

Add live audio transcription streaming support to Foundry Local C# SDK#485
rui-ren wants to merge 9 commits intomainfrom
ruiren/audio-streaming-support-sdk

rui-ren commented Mar 5, 2026 •

edited

Loading

Uh oh!

vercel bot commented Mar 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rui-ren commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's included

New files

Modified files

Documentation

API surface

Design highlights

Core integration (neutron-server)

Verified working

Uh oh!

vercel bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rui-ren commented Mar 5, 2026 •

edited

Loading

vercel bot commented Mar 5, 2026 •

edited

Loading