IronOcr 2021.11.0
Prefix ReservedSee the version list below for details.
dotnet add package IronOcr --version 2021.11.0
NuGet\Install-Package IronOcr -Version 2021.11.0
<PackageReference Include="IronOcr" Version="2021.11.0" />
paket add IronOcr --version 2021.11.0
#r "nuget: IronOcr, 2021.11.0"
// Install IronOcr as a Cake Addin #addin nuget:?package=IronOcr&version=2021.11.0 // Install IronOcr as a Cake Tool #tool nuget:?package=IronOcr&version=2021.11.0
IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET
IronOCR provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for:
- .Net Framework 4.0 +
- .Net Standard 2.0 +
- .Net Core 2.0 +
- .Net 5
- Mono for MacOS and Linux
- Xamarin for MacOS
IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes.
IronOCR's Unique Features
- Pure .Net OCR API
- All OCR tasks run locally (no SAAS)
- 125 languages
- Barcode & QR Code reading
- Corrects low quality, noisy and distorted scans
- Performance tuned above and beyond any other known build of Tesseract OCR.
- Reads PDFs and multi-page TIFFs
- Can save any OCR Scan to a searchable PDF document or XHTML
Data Output Options Include
Output Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters.
International Language Support
125 Languages supported including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish... Custom language packs can also be created.
Licensing & Support available
For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/ Email: developers@ironsoftware.com
Get Started Code Example
string Result = new IronOcr.IronTesseract().Read("scan.pdf").Text;
Why C# developers choose IronOCR over Vanilla Tesseract:
- Achieve 99.8%+ OCR accuracy without using external web services.
- Includes for Tesseract 5 , 4 and 3 Engines out of the box.
- Blazing Speed and MultiThreading
- MVC, WebApp, Desktop, Console & Server Application compatible
- No Exes or C++ code to work with
- Full PDF OCR support
- To perform OCR an almost any Image file or PDF
- Full .Net Core, Standard and FrameWork support
- Deploy on Windows, Mac, Linux, Azure, Docker, Lambda, AWS
- Read barcodes and QR codes
- Export OCR as to XHTML
- Export OCR to searchable PDF documents
- Multithreading support
- 125 international languages all managed via Nuget or OcrData files
- Extract Images, Coordinates, Statistics and Fonts. Not just text.
- Can be used to redistribute Tesseract OCR inside commercial & proprietary applications.
- Supports:
IronOCR shines when working with real world images and imperfect documents such as photographs, or scans of low resolution which may have digital noise or imperfections. Other free OCR libraries for the .NET platform such other .Net Tesseract APIs and web services do not perform so well on these real world use cases.
OCR with Tesseract 5 - Start Coding in C#
These code examples below shows how easy it is to read text from an image using C# or VB .NET.
Configurable Hello World
using IronOcr;
var Ocr = new IronTesseract();
using (var Input = new OcrInput()){
Input.AddImage("images/sample.jpeg")
//... you can add any number of images
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
C# PDF OCR
The same approach can similarly be used to extract text from any PDF document.
var Ocr = new IronTesseract();
using (var input = new OcrInput())
{
input.AddPdf("example.pdf","password");
// We can also select specific PDF page numnbers to OCR
var Result = Ocr.Read(input);
Console.WriteLine(Result.Text);
Console.WriteLine($"{Result.Pages.Count()} Pages");
// 1 page for every page of the PDF
}
C# OCR Working Code Examples
- OCR in 1 line of code
- PDF OCR Text Extraction
- OCR with Barcode & QR Reading
- 125 International OCR Languages
- Fixing Low Quality Scans & Images
- Fast OCR Configuration
- OCR Image Optimization Filters
- OcrResult Class
- Create Searchable PDFs by OCR
- Tesseract 5 for .NET
- Tesseract 4 for .NET
- Tesseract 3 Legacy for .NET
- Tesseract Detailed Configuration
- OcrInput Class
- OCR a Region of an Image
- TIFF to Searchable PDF Converter
- Image Resolution Optimization (DPI)
- MultiThreaded Tesseract OCR
- OCR for MultiPage TIFF Files
- Make any PDF have Searchable, Copyable Text
- Using Custom Tesseract Language Files
- Multiple Languages for 1 Document
- Exporting Images of OCR Elements
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net40 is compatible. net403 was computed. net45 was computed. net451 was computed. net452 was computed. net46 was computed. net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETFramework 4.0
- No dependencies.
-
.NETStandard 2.0
- System.Drawing.Common (>= 5.0.2)
NuGet packages (132)
Showing the top 5 NuGet packages that depend on IronOcr:
Package | Downloads |
---|---|
IronOcr.Languages.German
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * German * GermanBest * GermanFast * GermanFraktur ==================================== Deutschsprachige OCR in C# & .NET. Optimierte C# Tesseract 5 OCR in einer eigenständigen .NET OCR-API. Konvertiert Scannerdokumente, Bilder und PDF in Text. C# & VB Beispiele: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also German support including: * German (also known as Deutsch) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in German * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/ |
|
IronOcr.Languages.Japanese
Japanese Language pack for the IronOCR C# and VB.Net OCR library. Reads Japanese language text from images and PDFs in .NET. Ocr Dictionaries in this package: * JapaneseAlphabet * JapaneseAlphabetBest * JapaneseAlphabetFast * JapaneseVerticalAlphabet * JapaneseVerticalAlphabetBest * JapaneseVerticalAlphabetFast * Japanese * JapaneseBest * JapaneseFast * JapaneseVertical * JapaneseVerticalBest * JapaneseVerticalFast This package installs IronOCR and also Japanese support including: * Japanese (also known as 日本語 (にほんご)) OCR for screenshots, cameras, images files, tiffs and PDFs. * Custom OCR that significantly outperforms Tesseract on real world documents. * Can read scans with distortion, skewing, low resolution & contrast, and digital noise. * Also supports Tesseract 3, 4 and 5 in Japanese. * Support for 122 other languages also available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework(4.5 +) * .NET CORE(2.0 +) * .NET Standard(2.0 +) Works on: *Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: * Images * TIFFS * PDFs * Screenshots * Camera Input * Scans * Barcodes * QR codes This package also installs: https://www.nuget.org/packages/IronOcr/ For product and licensing support please email us at developers@ironsoftware.com ====== C# と .NET での日本語 OCR スタンドアロン .NET OCR API で最適化された C# Tesseract 5 OCR。 スキャナーのドキュメント、画像、PDF をテキストに変換します。 C# と VB の例: https://ironsoftware.com/csharp/ocr/languages/Japanese/ |
|
IronOcr.Languages.Spanish
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Spanish * SpanishBest * SpanishFast * SpanishOld * SpanishOldBest * SpanishOldFast ==================================== OCR en español en C# y .NET. C# Tesseract 5 OCR optimizado en una API de .NET OCR independiente. Convierte documentos, imágenes y PDF del escáner en texto. Ejemplos de C# y VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Spanish support including: * Spanish (also known as Español, Castellano) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Spanish * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/ |
|
IronOcr.Languages.French
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * French * FrenchBest * FrenchFast ==================================== OCR de langue française en C# & .NET. OCR C# Tesseract 5 optimisé dans une API OCR .NET autonome. Convertit les documents du scanner, les images et les PDF en texte. Exemples C# et VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also French support including: * French (also known as Français, Langue Française) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in French * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/ |
|
IronOcr.Languages.Arabic
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Arabic * ArabicBest * ArabicFast * ArabicAlphabet * ArabicAlphabetBest * ArabicAlphabetFast ==================================== OCR للغة العربية في C# & .NET. محسن C# Tesseract 5 OCR في .NET OCR API مستقل. يحول مستندات الماسح الضوئي والصور و PDF إلى نص. أمثلة على C# و VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Arabic support including: * Arabic (also known as العربية) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Arabic * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/ |
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
2024.11.3 | 471 | 11/11/2024 |
2024.10.3 | 9,045 | 10/3/2024 |
2024.9.6 | 8,547 | 9/9/2024 |
2024.8.21 | 15,864 | 7/31/2024 |
2024.7.13 | 10,163 | 7/10/2024 |
2024.6.3 | 17,255 | 6/4/2024 |
2024.5.25 | 13,054 | 4/29/2024 |
2024.4.6 | 13,929 | 4/5/2024 |
2024.3.4 | 23,972 | 3/8/2024 |
2024.2.41 | 28,438 | 1/29/2024 |
2024.1.17 | 14,080 | 12/29/2023 |
2023.12.34 | 115,822 | 11/27/2023 |
2023.11.35 | 13,152 | 10/27/2023 |
2023.10.9 | 13,192 | 9/26/2023 |
2023.9.4 | 7,377 | 9/8/2023 |
2023.8.34 | 40,682 | 8/1/2023 |
2023.7.28 | 54,222 | 7/3/2023 |
2023.6.6 | 49,731 | 5/29/2023 |
2023.5.35 | 21,836 | 5/1/2023 |
2023.4.13 | 19,662 | 4/6/2023 |
2023.3.2 | 34,414 | 3/1/2023 |
2023.1.11644 | 24,688 | 1/18/2023 |
2022.12.10830 | 32,057 | 12/5/2022 |
2022.11.10109 | 69,799 | 10/26/2022 |
2022.10.9390 | 15,876 | 9/27/2022 |
2022.8.8198 | 49,868 | 8/18/2022 |
2022.8.7804 | 79,884 | 7/26/2022 |
2022.3.0 | 191,044 | 3/10/2022 |
2022.1.0 | 54,317 | 1/17/2022 |
2021.12.0 | 22,225 | 12/21/2021 |
2021.11.0 | 755,432 | 10/29/2021 |
2021.9.0 | 27,774 | 8/24/2021 |
2021.6.0 | 33,644 | 6/24/2021 |
2021.2.1 | 41,218 | 2/24/2021 |
2020.12.2 | 32,214 | 12/14/2020 |
2020.11.2 | 146,512 | 11/13/2020 |
4.4.0 | 305,609 | 6/21/2018 |
4.3.0.1 | 37,207 | 4/9/2018 |
4.2.2.51 | 5,658 | 1/22/2018 |
4.2.2.1 | 4,537 | 12/1/2017 |
4.2.1.5 | 5,732 | 9/9/2017 |
4.1.1 | 9,841 | 8/4/2017 |
4.0.10 | 3,125 | 1/12/2017 |
4.0.9 | 2,772 | 12/20/2016 |
* Bug Fixed: Azure Function Compatibility
* Bug Fixed: Works with read-only OCR dictionaries
* Bug Fixed: Now works with .Net 5.07
* Bug Fixed: License Keys reading from project configuration files (edge casess)
* Feature: Color replacement methods added to OcrInput
* Feature: Load specfic frames from TIFF and PDF files
* Improved: Updated System.Drawing.Common