Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Stay tuned for prerequisites, resources and an agenda!

The following is all a work in progress.  Please check back right before the workshop!

Prerequisites:

  1. java >= 8
  2. tika-eval app and tika-app jars: https://dlcdn.apache.org/tika/2.1.0/tika-eval-app-2.1.0.jar and https://dlcdn.apache.org/tika/2.1.0/tika-app-2.1.0.jar
  3. JSON editor/viewer (I like Sublime with the PrettyJSON plugin https://github.com/dzhibas/SublimePrettyJson)
  4. XLSX viewer (Excel or Open/LibreOffice)

...

Example tika-config files: TBD

Before the class, you should unzip the tika-eval-workshop-docs.tgz (tar -xzvf tika-eval-workshop-docs.tgz) and run tika-app on them java -jar tika-app-2.1.0.jar -J -t -i tika-eval-workshop-docs -o extracts