MyNixOS website logo
Description

A toolkit for extracting metadata and text from over a thousand different file types.

The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more.

Metadata

Version

2.9.2

License

Maintainers (1)

Platforms (6)

    Linux
Show all
  • aarch64-linux
  • armv6l-linux
  • armv7l-linux
  • i686-linux
  • powerpc64le-linux
  • x86_64-linux