TikaOnDotnet.TextExtractor 1.17.1

.NET Framework

dotnet add package TikaOnDotnet.TextExtractor --version 1.17.1

NuGet\Install-Package TikaOnDotnet.TextExtractor -Version 1.17.1

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="TikaOnDotnet.TextExtractor" Version="1.17.1" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="TikaOnDotnet.TextExtractor" Version="1.17.1" />
                    

                            Directory.Packages.props

<PackageReference Include="TikaOnDotnet.TextExtractor" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add TikaOnDotnet.TextExtractor --version 1.17.1

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: TikaOnDotnet.TextExtractor, 1.17.1"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package TikaOnDotnet.TextExtractor@1.17.1

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=TikaOnDotnet.TextExtractor&version=1.17.1
                    

                            Install as a Cake Addin

#tool nuget:?package=TikaOnDotnet.TextExtractor&version=1.17.1
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Classes for running Apache Tika through **TikaOnDotNet**. Just use TextExtractor.Extract() and you'll be on your way.

Product	Compatible and additional computed target framework versions.
.NET Framework	net is compatible.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

- TikaOnDotnet (= 1.17.1)

NuGet packages (8)

Showing the top 5 NuGet packages that depend on TikaOnDotnet.TextExtractor:

Package	Downloads
Contrib.Sitecore.ContentSearch.TikaOnDotnet Contribution project for Sitecore ContentSearch	24.5K
DevelopmentHelpers.FileContentReader This package combine many open sources packages and allow one interface to read may types of content files. for example:use open.xml to read docx file	9.4K
Cogworks.ExamineFileIndexer An examine indexer that uses Apache TIKA	9.2K
Skybrud.Umbraco.Search.DocumentIndexer This package makes it possible to index and search a wide variety of filetypes in Umbraco, including .pdf and .docx	2.9K
ZeroStack.FileMetaKit 一个应用程序框架，您可以将它集成到任何 .NET/C# 应用程序中。	1.9K

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
1.17.1	757,214	4/3/2018
1.17.0	32,040	2/15/2018
1.16.0	174,178	7/30/2017
1.15.0	9,146	7/30/2017
1.14.2	130,080	4/22/2017
1.14.2-pre	3,836	4/15/2017
1.14.1	19,383	1/13/2017
1.14.0	11,021	12/8/2016
1.13.1	11,119	8/16/2016
1.13.0	17,087	6/30/2016
1.12.2	20,284	4/12/2016
1.12.1	2,070	4/12/2016
1.12.0	2,277	4/11/2016

- Add new overloads to the `TextExtractor.Extract` allowing users to provide their own extraction result assemblers. Example:
```cs
public class CustomResult
{
public string Text { get; set; }
public IDictionary<string, string[]> Metadata { get; set; }
}
public static CustomResult CreateCustomResult(string text, Metadata metadata)
{
var metaDataDictionary = metadata.names().ToDictionary(name => name, metadata.getValues);
return new CustomResult
{
Metadata = metaDataDictionary,
Text = text,
};
}
[Test]
public void should_extract_author_list_from_pdf()
{
var textExtractionResult = new TextExtractor().Extract("file_with_authors.pdf", CreateCustomResult);
textExtractionResult.Metadata["meta:author"].Should().ContainInOrder("Fred Jones, M. D.", "Donald Evans D. M.");
}
```

Total 1.2M

Current version 757.2K

Per day average 317