Tesseract is probably the most accurate open source OCR engine available. Combined with the Leptonica Image Processing Library it can read a wide variety of image formats and convert them to text in over 60 languages. It was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but since then it has been improved extensively by Google. It is released under the Apache License 2.0.
See the version list below for details.
Install-Package Tesseract -Version 22.214.171.124
dotnet add package Tesseract --version 126.96.36.199
<PackageReference Include="Tesseract" Version="188.8.131.52" />
paket add Tesseract --version 184.108.40.206
#r "nuget: Tesseract, 220.127.116.11"
// Install Tesseract as a Cake Addin #addin nuget:?package=Tesseract&version=18.104.22.168 // Install Tesseract as a Cake Tool #tool nuget:?package=Tesseract&version=22.214.171.124
*Note:* Version 2 was initially going to introduce support for Tesseract 3.03 however as this hasn't been released yet and we have a few minor breaking changes due to Mono support which require a version increment (we use semantic versioning).
#### Breaking changes from 1.0
* Tesseract.Interop is now internal which means we can make as many interop changes as we like as long as the public version doesn't change
* TesseractEngine.Handle, Pix.Handle, and PixColormap.Handle are now internal
* Logging is done to the ``Tesseract`` source, not ``Default``.
## New features
* Support for multi-page tiffs - Issue 50
* Support for linux\mono - Issue 23
## Bug fixes
* Fixed UTF8 handling for SetVariable (support for non-english languages) - Issue 120 & Issue 68
This package has no dependencies.
NuGet packages (26)
Showing the top 5 NuGet packages that depend on Tesseract:
Adds support for interop with System.Drawing to Tesseract such as passing Bitmap to Tesseract.
This helps to read simple text (string or number) from the images using Tesseract without additional configuration. IMPORTANT : Change the properties of all the files in the "tessdata" folder for "Copy To Output Directory" as "Copy always". Sample Project : https://github.com/rohitvipin/TesseractHelper.Demo
GitHub repositories (4)
Showing the top 4 popular GitHub repositories that depend on Tesseract:
Free open-source OCR application for the Windows Desktop - A modern GUI front-end for the Tesseract OCR engine. The application also includes support for reading and OCR'ing PDF files.
Samples for the Tesseract.Net wrapper