...
No Format |
---|
$ curl -T price.xls http://localhost:9998/tika/text --header "Accept: application/json" |
Skip Embedded Files/Attachments
No Format |
---|
$ curl -T test_recursive_embedded.docx http://localhost:9998/tika --header "Accept: text/plain" --header"X-Tika-Skip-Embedded: true" |
Multipart Support
Tika Resource also accepts the files as multipart/form-data attachments with POST. Posting files as multipart attachments may be beneficial in cases when the files are too big for them to be PUT directly in the request body. Note that Tika JAX-RS server makes the best effort at storing some of the multipart content to the disk while still supporting the streaming:
...
Note that the address has an extra "/form" path segment.
Skip Embedded Files/Attachments
No Format |
---|
$ curl -T test_recursive_embedded.docx http://localhost:9998/rmeta --header "X-Tika-Skip-Embedded: true" |
Specifying Limits
As of Tika 1.25, you can limit the maximum number of embedded resources and the write limit per handler.
...