Page History

...

Use fetcher in traditional /tika /rmeta endpoints

update configs/tika-config-basic.xml <basePath> element to get the full path to tika-pipes-tutorial-20221202/docs:

Code Block

language	xml
title	FileSystemFetcher
collapse	true

  <fetcher class="org.apache.tika.pipes.fetcher.fs.FileSystemFetcher">
    <params>
      <name>fsf</name>
      <basePath>/Users/allison/Desktop/tika-pipes-tutorial-20221202/docs</basePath>
    </params>
  </fetcher>

start the server: java -cp "server-bin/*" org.apache.tika.server.core.TikaServerCli -c configs/tika-config-basic.xml
curl -X PUT

http://localhost:9998/rmeta

-H "fetcherName:fsf" -H

"fetchKey:testPDF.pdf" | jq --sort-keys

Use /pipes handler to read from and write to a local file share

update configs/tika-config-basic.xml <basePath> element to get the full path to tika-pipes-tutorial-20221202/docs:

Code Block

language	xml
title	FileSystemEmitter
collapse	true

  <emitters>
    <emitter class="org.apache.tika.pipes.emitter.fs.FileSystemEmitter">
      <params>
        <name>fse</name>
        <basePath>/Users/allison/Desktop/tika-pipes-tutorial-20221202/extracts</basePath>
      </params>
    </emitter>
  </emitters>

start the server: java -cp "server-bin/*" org.apache.tika.server.core.TikaServerCli -c configs/tika-config-basic.xml
commandline TBD

Configure metadata handler and rerun 2.
Use /async handler file share to file share
Configure Solr/OpenSearch/ElasticSearch emitter and run /pipes handler
Run the async processor via tika-app

...

Page tree

Versions Compared

Old Version 27

New Version 28

Key