Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This table highlights some differences between some of the handlers. I've temporarily left in question marks for items we need to confirm.

Feature/tika (text|body)/tika (html)/tika (json)/rmeta/meta/unpack
Text (including text of embedded documents)YYYYNY ( with /unpack/all)
Metadata of main documentNYYYY
?
Y ( with /unpack/all)
Metadata of embedded documents/attachmentsNNNYNN
Notification of parse exceptionY/N[1]Y/N[1]YYYY?
Specific stacktrace if server is started with the -s  optionNNYYNN
Notification of parse exception in embedded documentNNNYNN?
Specific stacktrace for parse exception in embedded documentNNNYNN
Streaming write[2]YYNNNN
Actual attachments (raw bytes)NNNNNY


1 If the parse exception comes early in the parse before the streaming starts (as with an EncryptedDocumentException), you'll get an http status 422 in /tika (text) and /tika (html).  If the parse exception happens after content has started streaming, the stream will simply stop and you'll have no idea that there was a parse exception with the /tika (text) option; you'll see truncated html in /tika (html) if this happens.

...