Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

When using lang detection, you need to change also the dependencies from 2.0.0. It was before:

And it's now:

Code Block
languagexml
titlepom.xml 12.0.270
<dependency>
  <groupId>org.apache.tika</groupId>
  <artifactId>tika-langdetect-langdetect<optimaize</artifactId>
  <version>1<version>2.0.27<0</version>
</dependency>



tika-app

tika-server

General

Configuration

tika-pipes

See the tika-pipes page.

tika-eval

tika-langid

In the 1.x branch, the default (hardwired) language identification wrapper was the wrapper around optimaize.  In 1.x, you'd useAnd it's now:

Code Block
languagexml
titlepom.xml 21.0.027
<dependency>
  <groupId>org.apache.tika</groupId>
  <artifactId>tika-langdetect-tika<langdetect</artifactId>
  <version>2<version>1.0.0<27</version>
</dependency>

In 2.x, change this to:

No Format

<dependency>
  <groupId>org.apache.tika</groupId>
  <artifactId>tika-langdetect-optimaize</artifactId>
  <version>2.0.0<x</version>
</dependency>

tika-app

tika-server

General

Configuration

tika-pipes

See the tika-pipes page.

tika-eval

...

The legacy homegrown language id component that used to be in tika-core is now in the tika-langdetect-tika module.