UTF.Unknown 2.0.0-rc1

Detect character set for files, steams and other bytes.

This package is based on Ude (https://github.com/errepi/ude), which is a port of the Mozilla Universal Charset Detector (https://mxr.mozilla.org/mozilla/source/extensions/universalchardet/).

- Detects 28 charsets
- Easy to use API  
- .NET standard 1.0 + 2.0 support
- Strong named
- XML documentation included

This is a prerelease version of UTF.Unknown.
Install-Package UTF.Unknown -Version 2.0.0-rc1
dotnet add package UTF.Unknown --version 2.0.0-rc1
<PackageReference Include="UTF.Unknown" Version="2.0.0-rc1" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add UTF.Unknown --version 2.0.0-rc1
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

Compared to Ude:

- Refactor of API, namespaces and deadcode removal
- Added some docs
- Improve error handling
- Improved unit tests

Bug fixes:

- EUCTW: System.IndexOutOfRangeException
- pureascii detection issue
- BUG in SBCSGroupProber class in function Reset
- Detection fails on particular, simple ANSI file  
 See https://github.com/CharsetDetector/UTF-unknown/milestone/1?closed=1

Version History

Version Downloads Last updated
2.0.0-rc1 132 3/27/2019
1.0.0 485 2/15/2019
1.0.0-rc1 73 2/9/2019
1.0.0-beta1 8,172 4/7/2017