RapidOcrNet 2.0.0

.NET 8.0

dotnet add package RapidOcrNet --version 2.0.0

NuGet\Install-Package RapidOcrNet -Version 2.0.0

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="RapidOcrNet" Version="2.0.0" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="RapidOcrNet" Version="2.0.0" />
                    

                            Directory.Packages.props

<PackageReference Include="RapidOcrNet" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add RapidOcrNet --version 2.0.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: RapidOcrNet, 2.0.0"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package RapidOcrNet@2.0.0

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=RapidOcrNet&version=2.0.0
                    

                            Install as a Cake Addin

#tool nuget:?package=RapidOcrNet&version=2.0.0
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

RapidOcrNet

Cross-platform OCR processing library using PaddleOCR ONNX models, and based on original code from RapidAI's RapidOCR.

Available as NuGet package: https://www.nuget.org/packages/RapidOcrNet/

The code was optimised to remove dependencies on System.Drawing and OpenCV. Image processing is now done only using SkiaSharp and PContourNet. The library targets net8.0 and net10.0, is AOT-compatible, and ships the PP-OCRv5 models out of the box (v4 and v3 are also supported — see #3).

How the pipeline works

A single Detect() call runs three ONNX models in sequence:

Detector (DBNet) — finds polygons (text boxes) in the input image.
Classifier (cls) — predicts whether each crop is upside-down (0° vs 180°) and rotates it if so.
Recognizer (CRNN) — reads the characters in each upright crop and returns text + per-char confidences.

You get back an OcrResult containing one TextBlock per detected line, plus optional word-level / character-level polygons.

Installing the models

All ONNX models can be downloaded from: https://github.com/RapidAI/RapidOCR/blob/main/python/rapidocr/default_models.yaml

You need 4 files for the pipeline to work. The defaults bundled with the NuGet package are the PP-OCRv5 latin set:

Detection: ch_PP-OCRv5_mobile_det.onnx
Classification: ch_ppocr_mobile_v2.0_cls_infer.onnx
Recognition: latin_PP-OCRv5_rec_mobile_infer.onnx
Model dictionary: ppocrv5_latin_dict.txt

When using the NuGet package, these are copied to models/v5/ next to your binary automatically. If you want a different language (Chinese, Japanese, Korean, etc.), download the matching *_rec_mobile_infer.onnx + *_dict.txt pair and pass their paths to InitModels (see Using custom models / other languages).

Quick start

The shortest path to recognized text — load defaults, run, print:

using RapidOcrNet;
using SkiaSharp;

using var ocr = new RapidOcr();
ocr.InitModels();                                  // loads bundled PP-OCRv5 latin models

OcrResult result = ocr.Detect("image.png", RapidOcrOptions.Default);

Console.WriteLine(result.StrRes);                  // full recognized text, one line per block
foreach (var block in result.TextBlocks)
{
    Console.WriteLine($"{block.Text}  (avg score: {block.CharScores!.Average():F2})");
}

Detect has two overloads — pass a file path (string) for convenience, or an SKBitmap if you already have the image decoded.

Drawing the boxes back onto the image

Each TextBlock exposes the four corners of its detected polygon in BoxPoints (in clockwise order, in source-image pixel coordinates).

using var bmp = SKBitmap.Decode("image.png");
OcrResult result = ocr.Detect(bmp, RapidOcrOptions.Default);

using var canvas = new SKCanvas(bmp);
using var paint  = new SKPaint { Color = SKColors.Red, IsStroke = true };

foreach (var block in result.TextBlocks)
{
    var p = block.BoxPoints;
    canvas.DrawLine(p[0], p[1], paint);
    canvas.DrawLine(p[1], p[2], paint);
    canvas.DrawLine(p[2], p[3], paint);
    canvas.DrawLine(p[3], p[0], paint);
}

using var fs = File.Create("image_ocr.png");
bmp.Encode(fs, SKEncodedImageFormat.Png, 100);

Word-level and character-level boxes

By default the recognizer only returns one polygon per line. To also get one polygon per word (or per character), enable ReturnWordBox:

var options = RapidOcrOptions.Default with { ReturnWordBox = true };
OcrResult result = ocr.Detect("image.png", options);

foreach (var block in result.TextBlocks)
{
    Console.WriteLine(block.Text);
    foreach (var word in block.WordResults!)      // null when ReturnWordBox = false
    {
        Console.WriteLine($"   {word.Text}  score={word.Score}");
    }
}

Behavior:

Latin/numeric lines: one WordBox per whitespace-separated word.
Lines containing CJK characters: one WordBox per character (whitespace is meaningless inside CJK).
Setting ReturnSingleCharBox = true forces Latin/numeric lines to per-character granularity as well.

The polygons are reconstructed from CTC time-column positions, so they align tightly with the actual glyphs — useful for layout extraction, highlighting, or rebuilding searchable PDFs.

Choosing an options preset

There are two ready-made presets on RapidOcrOptions:

Preset	Pre-padding	Resize strategy	Best for
`RapidOcrOptions.Default`	50 px white border	`ImgResize = 1024` (legacy max-side cap)	The bundled PP-OCRv5 model — best accuracy on most photos and screenshots.
`RapidOcrOptions.PythonCompat`	0 (no border)	`LimitSideLen = 736` (short-side adaptive, like Python `rapidocr`)	Reproducing results from the Python `rapidocr` library or feeding very wide / very tall inputs.

Use a with-expression to tweak any individual option:

var options = RapidOcrOptions.Default with
{
    ReturnWordBox = true,
    DoAngle = false,       // skip the 180° classifier if you know text is upright
    TextScore = 0.7f,      // drop low-confidence lines more aggressively
};

Key options at a glance

Option	What it does
`Padding`	Extra white border added before detection. Helps when text touches the image edge. `Default` uses 50; `PythonCompat` uses 0.
`ImgResize`	If `> 0`, caps the longer side at this size. Set to `0` to use Python-style short-side adaptive resize via `LimitSideLen`.
`LimitSideLen`	When `ImgResize == 0`, the detector upscales the short side up to this value (default 736).
`MaxSideLen` / `MinSideLen`	Hard bounds on image size before detection.
`WidthHeightRatio` / `MinHeight`	Adds a vertical letterbox to very wide / very short inputs so the detector sees enough vertical context. `-1` disables.
`TextScore`	Filter threshold on the average per-character score of each line. Blocks below this are dropped.
`ClsThresh`	Confidence the classifier needs before it actually applies a 180° flip. Default 0.9 — anything lower would risk inverting clean upright text and producing garbage.
`ClsPreserveAspectRatio`	When `true`, crops are aspect-preserved + midgray-padded before classification (matches Python rapidocr). When `false`, legacy stretch-to-192×48.
`BoxScoreThresh` / `BoxThresh` / `UnClipRatio`	DBNet hyperparameters — keep at defaults unless you're tuning detection.
`DoAngle`	Run the 180° classifier at all. Disable if you know your text is upright.
`MostAngle`	Decide each crop's angle from the majority vote across all crops, instead of per-crop.
`ReturnWordBox` / `ReturnSingleCharBox`	See the word/char-box section above.

GPU acceleration and custom session options

The ONNX session is configurable. Build a SessionOptions (see the ONNX Runtime API docs) and pass it to InitModels:

using var ocr = new RapidOcr();

using var sessionOptions = RapidOcr.GetDefaultSessionOptions();   // sensible defaults: ORT_ENABLE_EXTENDED
try   { sessionOptions.AppendExecutionProvider_CUDA(); }          // NVIDIA GPU
catch { sessionOptions.AppendExecutionProvider_CPU();  }          // fallback

ocr.InitModels(sessionOptions);

GetDefaultSessionOptions(int numThread = 0) also takes an optional thread count if you'd rather keep CPU execution but pin Inter/Intra-op parallelism.

Using custom models / other languages

Pass explicit paths if you've downloaded different ONNX files (e.g. Chinese, Japanese, or a heavier _server_ recognizer):

using var ocr = new RapidOcr();

ocr.InitModels(
    detPath:  "models/v5/ch_PP-OCRv5_det_server.onnx",
    clsPath:  "models/v5/ch_ppocr_mobile_v2.0_cls_infer.onnx",
    recPath:  "models/v5/korean_PP-OCRv5_rec_mobile.onnx",
    keysPath: "models/v5/ppocrv5_korean_dict.txt");

// Or, with custom session options:
// ocr.InitModels(detPath, clsPath, recPath, keysPath, sessionOptions);

The recognizer's character set is defined by the *_dict.txt file — it must match the *_rec_*.onnx you load, otherwise the output will be gibberish.

Notice

Based on source code originally developed in the RapidOCR project (Apache-2.0 license).

https://github.com/RapidAI/RapidOCR

Uses parts of source code originally developed in the PdfPig project (Apache-2.0 license).

https://github.com/UglyToad/PdfPig

The dependency on OpenCV was removed thanks to the PContour library and its C# port.

The models made available are from the PaddleOCR project (Apache-2.0 license) and were downloaded from https://github.com/RapidAI/RapidOCR/blob/main/python/rapidocr/default_models.yaml

Product	Compatible and additional computed target framework versions.
.NET	net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Product

.NET

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

net10.0
- Clipper2 (>= 2.0.0)
- Microsoft.ML.OnnxRuntime (>= 1.24.3)
- Microsoft.ML.OnnxRuntime.Managed (>= 1.24.3)
- SkiaSharp (>= 3.119.1)
- SkiaSharp.NativeAssets.Linux (>= 3.119.1)
net8.0
- Clipper2 (>= 2.0.0)
- Microsoft.ML.OnnxRuntime (>= 1.24.3)
- Microsoft.ML.OnnxRuntime.Managed (>= 1.24.3)
- SkiaSharp (>= 3.119.1)
- SkiaSharp.NativeAssets.Linux (>= 3.119.1)

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
2.0.0	52	5/13/2026
1.0.2	1,299	3/29/2026
1.0.1	407	3/14/2026
1.0.0	1,434	11/22/2025