THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!
...
The following is all a work in progress. Please check back right before the workshop!
Prerequisites:
- java >= 8
- tika-eval app and tika-app jars: https://dlcdn.apache.org/tika/2.1.0/tika-eval-app-2.1.0.jar and https://dlcdn.apache.org/tika/2.1.0/tika-app-2.1.0.jar
- JSON editor/viewer (I like Sublime with the PrettyJSON plugin https://github.com/dzhibas/SublimePrettyJson)
- XLSX viewer (Excel or Open/LibreOffice)
Optional materials:
- tika-server-standard jar: https://dlcdn.apache.org/tika/2.1.0/tika-server-standard-2.1.0.jar
...
- tika-eval-core.jar: https://repo1.maven.org/maven2/org/apache/tika/tika-eval-core/2.1.0/tika-eval-core-2.1.0.jar
Example docs, extracts and config files: tika-eval-workshop-
...
20211109.tgz
...
Before the class, you should unzip the tika-eval-workshop-docs20211109.tgz (tar -xzvf tika-eval-workshop-docs20211109.tgz
) and run tika-app on them java -jar tika-app-2.1.0.jar -J -t -i tika-eval-workshop- docs -o extracts/my_extracts