Toxy 1.4.0

Toxy is a .NET data/text extraction framework similar to Apache Tika in Java. It supports a lot of popular formats such as docx, xlsx, xls, pdf, csv, txt, epub, html and so on.

Neuzilla is the studio behind Toxy. You can check for detail.

There is a newer version of this package available.
See the version list below for details.
Install-Package Toxy -Version 1.4.0
dotnet add package Toxy --version 1.4.0
<PackageReference Include="Toxy" Version="1.4.0" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Toxy --version 1.4.0
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

a. Implement doc text extraction
b. Implement cnm mail extraction
c. Add CreateEmail method to ParserFactory
d. Replace iTextSharp with PDFSharp

Showing the top 1 GitHub repositories that depend on Toxy:

Repository Stars
.NET based webcrawler

Version History

Version Downloads Last updated 13,878 3/5/2016
1.6.1 665 3/5/2016
1.4.0 2,283 3/9/2015