IronOcr 2020.12.2

IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET

It provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for:
* .Net Framework 4.0 +
* .Net Standard 2.0 +
* .Net Core 2.0 +
* .Net 5
* Mono for MacOS and Linux
* Xamarin for MacOS

IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes.

IronOCR's Unique Features:
* Pure .Net OCR API
* All OCR tasks run locally (no SAAS)
* 125 languages
* Barcode & QR Code reading
* Corrects low quality, noisy and distorted scans
* Performance tuned above and beyond any other known build of Tesseract OCR.
* Reads PDFs
* Reads multi-page TIFFs
* Can save any OCR Scan to a searchable PDF document or XHTML

Data output options include: Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters.

Language Support:
125 Languages including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish...  Custom language packs can also be created.

Licensing & Support available for commercial deployments. Email: developers@ironsoftware.com

For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/

There is a newer version of this package available.
See the version list below for details.
Install-Package IronOcr -Version 2020.12.2
dotnet add package IronOcr --version 2020.12.2
<PackageReference Include="IronOcr" Version="2020.12.2" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add IronOcr --version 2020.12.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
#r "nuget: IronOcr, 2020.12.2"
#r directive can be used in F# Interactive, C# scripting and .NET Interactive. Copy this into the interactive tool or source code of the script to reference the package.
// Install IronOcr as a Cake Addin
#addin nuget:?package=IronOcr&version=2020.12.2

// Install IronOcr as a Cake Tool
#tool nuget:?package=IronOcr&version=2020.12.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

* Added OcrResult.SaveAsSearchablePdfBytes Method
* Added OcrResult.SaveAsHocrString Method
* Resolved OcrData Installation Issues
* Improved MVC, WinForms and Azure Function Compatibility

NuGet packages (130)

Showing the top 5 NuGet packages that depend on IronOcr:

Package Downloads
IronOcr.Languages.Hebrew
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Hebrew * HebrewBest * HebrewFast * HebrewAlphabet * HebrewAlphabetBest * HebrewAlphabetFast ==================================== OCR בשפה העברית ב- C# & .NET. אופטימיזציה של C# Tesseract 5 OCR בנפרד .NET OCR API. ממיר מסמכים, תמונות ו- PDF לסורק לטקסט. דוגמאות C# ו- VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Hebrew support including: * Hebrew (also known as עברית) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Hebrew * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/
IronOcr.Languages.Arabic
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Arabic * ArabicBest * ArabicFast * ArabicAlphabet * ArabicAlphabetBest * ArabicAlphabetFast ==================================== OCR للغة العربية في C# & .NET. محسن C# Tesseract 5 OCR في .NET OCR API مستقل. يحول مستندات الماسح الضوئي والصور و PDF إلى نص. أمثلة على C# و VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Arabic support including: * Arabic (also known as العربية) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Arabic * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/
IronOcr.Languages.Japanese
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * JapaneseAlphabet * JapaneseAlphabetBest * JapaneseAlphabetFast * JapaneseVerticalAlphabet * JapaneseVerticalAlphabetBest * JapaneseVerticalAlphabetFast * Japanese * JapaneseBest * JapaneseFast * JapaneseVertical * JapaneseVerticalBest * JapaneseVerticalFast ==================================== C#および.NETの日本語OCR。スタンドアロンの.NETOCR APIで最適化されたC#Tesseract 5OCR。スキャナーのドキュメント、画像、PDFをテキストに変換します。 C#とVBの例:https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Japanese support including: * Japanese (also known as 日本語 (にほんご)) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Japanese * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/
IronOcr.Languages.Portuguese
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Portuguese * PortugueseBest * PortugueseFast ==================================== OCR em português em C# e .NET. OCR C# Tesseract 5 otimizado em uma API .NET OCR independente. Converte documentos do scanner, imagens e PDF em texto. Exemplos C# e VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Portuguese support including: * Portuguese (also known as Português) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Portuguese * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/
IronOcr.Languages.German
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * German * GermanBest * GermanFast * GermanFraktur ==================================== Deutschsprachige OCR in C# & .NET. Optimierte C# Tesseract 5 OCR in einer eigenständigen .NET OCR-API. Konvertiert Scannerdokumente, Bilder und PDF in Text. C# & VB Beispiele: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also German support including: * German (also known as Deutsch) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in German * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

GitHub repositories

This package is not used by any popular GitHub repositories.

Version History

Version Downloads Last updated
2021.6.0 4,659 6/24/2021
2021.2.1 17,500 2/24/2021
2020.12.2 6,217 12/14/2020
2020.11.2 11,302 11/13/2020
4.4.0 151,252 6/21/2018
4.3.0.1 16,292 4/9/2018
4.2.2.51 3,026 1/22/2018
4.2.2.1 2,039 12/1/2017
4.2.1.5 2,523 9/9/2017
4.1.1 2,621 8/4/2017
4.0.10 1,549 1/12/2017
4.0.9 895 12/20/2016