Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Small edits

...

This page is documentation on accessing Tika as a RESTful API via the Tika server (tika-server module).

Table of Contents

Installation of Tika Server

The current installation process for Tika server post 1.23 and prior to 1.24 is a bit in flux.  Read on below for some options:

...

The Tika Server binary is a standalone runnable jar.  Download the latest stable release binary from the Apache Tika downloads page, via your favourite favorite local mirror. You want the tika-server-1.x.jar file, eg e.g. tika-server-1.23.jar

You can start it by calling java with the -jar option, eg something like java -jar tika-server-1.23.jar

...

There is an unofficial image for Tika that has been available for years.  You can download and start it with:

No Format
java -jar tika-server-x.x.jar --host=intranet.local --port=12345 

...

Shipping in Tika 1.24 is a new Service Installation Script that lets you install Tika server as a service on Linux.  This script was heavily influenced by the Apache Solr project's script, so read up on that documentation if you want to customize the script..

Currently the script only supports CentOS, Debian, Red Hat, Suse and Ubuntu Linxu distributions. Before running the script, you need to determine a few parameters about your setup. Specifically, you need to decide where to install Tika server and which system user should be the owner of the Tika files and process

To run the scripts, you'll need the 1.24 (or later) Solr Tika distribution.  It will have a -bin suffice eg suffix e.g. tika-server-1.24-SNAPSHOT-bin.tgz.  Extract the installation script from the distribution via:

...

This will extract the install_tika_service.sh script from the archive into the current directory.  If installing on Red Hat, please make sure lsof is installed before running the Solr Tika installation script (sudo yum install lsof). The installation script must be run as root:

...

Once the script completes, Tika server will be installed as a service and running in the background on your server (on port 9998). To verify, you can do:

...

Your specific customization to Tika server setup are stored in the /etc/init.d/tika file.

...