MyNixOS website logo
Description

Toolkit for extracting metadata and text from over a thousand different file types.

The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more.

Metadata

Version

2.9.3

License

Maintainers (1)

Platforms (9)

    Darwin
    Linux
Show all
  • aarch64-darwin
  • aarch64-linux
  • armv6l-linux
  • armv7l-linux
  • i686-linux
  • powerpc64le-linux
  • riscv64-linux
  • x86_64-darwin
  • x86_64-linux