Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Note that the address has an extra "/form" path segment.

Specifying Limits

As of Tika 1.25, you can limit the maximum number of embedded resources and the write limit per handler.

To specify the maximum number of embedded documents, set the maxEmbeddedResources in the header.  Note that the container document does not count towards this number.  The following will return the results for the container document only.

No Format
curl -T test_recursive_embedded.docx --header "maxEmbeddedResources: 0" http://localhost:9998/rmeta


To specify a write limit per handler, set the writeLimit  parameter in a header.   This limit applies to each handler (each embedded document).  If this is triggered, the parse will continue on to the next embedded object and store  X-TIKA:Exception:write_limit_reached = "true"  in the metadata object for the embedded file that triggered the write limit.


No Format
curl -T test_recursive_embedded.docx --header "writeLimit: 1000" http://localhost:9998/rmeta


Unpack Resource

No Format
/unpack

...