ElBruno.Whisper 0.2.0

.NET 8.0

dotnet add package ElBruno.Whisper --version 0.2.0

NuGet\Install-Package ElBruno.Whisper -Version 0.2.0

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="ElBruno.Whisper" Version="0.2.0" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="ElBruno.Whisper" Version="0.2.0" />
                    

                            Directory.Packages.props

<PackageReference Include="ElBruno.Whisper" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add ElBruno.Whisper --version 0.2.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: ElBruno.Whisper, 0.2.0"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package ElBruno.Whisper@0.2.0

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=ElBruno.Whisper&version=0.2.0
                    

                            Install as a Cake Addin

#tool nuget:?package=ElBruno.Whisper&version=0.2.0
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

ElBruno.Whisper

Run local Whisper speech-to-text in .NET 🎤

Transcribe audio to text in .NET using OpenAI's Whisper model. Powered by ONNX Runtime with automatic model download from HuggingFace.

Features

📦 Automatic model download — models are fetched from HuggingFace on first use
🔊 Multiple model sizes — tiny → base → small → medium → large (pick your speed/accuracy tradeoff)
🚀 Zero friction — works out of the box with sensible defaults (tiny.en)
🌍 Multilingual support — transcribe 99+ languages with multilingual models
💉 DI-friendly — register with AddWhisper() in ASP.NET Core
📊 Progress reporting — track model downloads with real-time callbacks
🎯 English-optimized models — dedicated .en variants for best accuracy on English audio

Installation

dotnet add package ElBruno.Whisper

Quick Start

using ElBruno.Whisper;

// Create client (downloads tiny.en model on first run)
using var client = await WhisperClient.CreateAsync();

var result = await client.TranscribeAsync("audio.wav");
Console.WriteLine(result.Text);

First Run

The first time you create a WhisperClient, the model is downloaded from HuggingFace to your local cache directory (~75 MB - 3 GB depending on model size). This typically takes 10-60 seconds depending on your internet connection and chosen model.

Track download progress:

using var client = await WhisperClient.CreateAsync(
    progress: new Progress<ElBruno.HuggingFace.DownloadProgress>(p =>
    {
        if (p.Stage == ElBruno.HuggingFace.DownloadStage.Downloading)
            Console.WriteLine($"{p.CurrentFile}: {p.PercentComplete:F0}%");
        else
            Console.WriteLine($"{p.Stage}: {p.Message}");
    })
);

Subsequent runs load instantly from cache (%LOCALAPPDATA%/ElBruno/Whisper/models).

Model Selection

Whisper offers various model sizes. English-optimized models (.en suffix) are smaller and faster for English audio:

using var client = await WhisperClient.CreateAsync(new WhisperOptions
{
    Model = KnownWhisperModels.WhisperSmallEn
});

var result = await client.TranscribeAsync("english-audio.wav");
Console.WriteLine(result.Text);

Available Models

Size	English	Multilingual	Parameters	Approx Size	Speed
tiny	tiny.en	tiny	39M	75 MB	⚡⚡⚡⚡⚡
base	base.en	base	74M	140 MB	⚡⚡⚡⚡
small	small.en	small	244M	460 MB	⚡⚡⚡
medium	medium.en	medium	769M	1.5 GB	⚡⚡
large	—	large	1550M	3.0 GB	⚡

Use English-optimized (.en) models for:

English audio only (slightly smaller, faster, better accuracy on English)

Use Multilingual models for:

Non-English audio
Mixed-language content
Language auto-detection

Progress Tracking

Monitor both file downloads and transcription progress:

var downloadProgress = new Progress<ElBruno.HuggingFace.DownloadProgress>(p =>
{
    if (p.Stage == ElBruno.HuggingFace.DownloadStage.Downloading)
        Console.Write($"\r⬇️ {p.PercentComplete:F0}%");
    else
        Console.WriteLine($"\n✓ {p.Message}");
});

using var client = await WhisperClient.CreateAsync(progress: downloadProgress);

var result = await client.TranscribeAsync("audio.wav");
Console.WriteLine($"✓ Transcribed: {result.Text}");

Dependency Injection

builder.Services.AddWhisper(options =>
{
    options.Model = KnownWhisperModels.WhisperBaseEn;
});

// Inject WhisperClient anywhere
public class TranscriptionService(WhisperClient whisper) { ... }

Transcription Result

The TranscriptionResult includes:

var result = await client.TranscribeAsync("audio.wav");

Console.WriteLine(result.Text);                    // Transcribed text
Console.WriteLine(result.DetectedLanguage);       // Detected language (for multilingual models)
Console.WriteLine(result.Duration);               // Audio duration

Troubleshooting

Model download fails?

Check your internet connection
For private HuggingFace models, set the HF_TOKEN environment variable

Out of memory?

Use a smaller model (tiny or base instead of medium/large)
Transcribe shorter audio files in chunks

For detailed troubleshooting, see docs.

Samples

Sample	Description
HelloWhisper	Minimal console transcription
BlazorWhisper	Blazor app with audio recording and real-time transcription

Documentation

Getting Started — installation, first steps, configuration
API Reference — full API documentation
Architecture — design decisions and internal structure
Testing Guide — running tests, test organization, CI/CD pipeline
Test Audio Files — audio resources for testing and transcription validation
Image Prompts — prompts for generating blog and social media images
Publishing — NuGet package publishing with OIDC

Building from Source

git clone https://github.com/elbruno/ElBruno.Whisper
cd ElBruno.Whisper
dotnet build ElBruno.Whisper.slnx
dotnet test ElBruno.Whisper.slnx --filter "Category!=Integration"

Testing

The repository includes comprehensive unit and integration tests:

Quick test run (unit tests, no model download):

dotnet test ElBruno.Whisper.slnx --filter "Category!=Integration"

Full test run (includes integration with real models):

dotnet test ElBruno.Whisper.slnx

Test audio files are provided in testdata/audio/ for validation and transcription testing. For details, see the Testing Guide.

🤝 Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License — see the LICENSE file for details.

ElBruno.LocalLLMs — Run local LLMs in .NET
ElBruno.HuggingFace — HuggingFace model utilities for .NET

🙏 Acknowledgments

ONNX Runtime — inference engine
OpenAI Whisper — speech-to-text model
Hugging Face — model hosting and community
ONNX Community — ONNX model conversions

👋 About the Author

Made with ❤️ by Bruno Capuano (ElBruno)

📝 Blog: elbruno.com
📺 YouTube: youtube.com/elbruno
🔗 LinkedIn: linkedin.com/in/elbruno
𝕏 Twitter: twitter.com/elbruno
🎙️ Podcast: notienenombre.com

Product	Compatible and additional computed target framework versions.
.NET	net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Product

.NET

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

net8.0
- ElBruno.HuggingFace.Downloader (>= 0.6.0)
- Microsoft.Extensions.DependencyInjection.Abstractions (>= 9.0.0)
- Microsoft.Extensions.Logging.Abstractions (>= 9.0.0)
- Microsoft.ML.OnnxRuntime (>= 1.22.0)

NuGet packages (1)

Showing the top 1 NuGet packages that depend on ElBruno.Whisper:

Package	Downloads
ElBruno.MarkItDotNet.Whisper Local audio transcription for ElBruno.MarkItDotNet using OpenAI Whisper via ONNX Runtime. Converts audio files to Markdown transcripts offline.	417

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
0.2.0	46	4/11/2026
0.1.6	45	4/10/2026
0.1.5	205	4/2/2026
0.1.2	91	3/30/2026
0.1.1	77	3/30/2026
0.1.0	86	3/30/2026