Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. XML Descriptor files are the original method used to create pipelines in Apache UIMA™.  Though self-descriptive, they are verbose and error prone.
  2. uimaFIT™ enables creation of pipelines through Java code.  This greatly simplifies unit testing and experimentation.
  3. The PipelineBuilder class in ctakes-core is a facade for uimaFIT™ factories and objects.

  4. Piper files are a modern equivalent of the XML descriptor files.  Piper files list basic commands and parameters in a flat format.
CommandParameter 1Parameters 2-nDescription
packagepackage path 

Add to known packages. Shortens load and add specifications.

loadPiper file path Load external piper file.
setname=value<name=value ...>Add global parameter values.
cliname=char<name=char ...>

Add global parameter values based upon command-line character option values.

readerCR name<name=value ...>Set the collection reader for pipeline input data.
readFilesinput directory Set the collection reader for pipeline input data to the files in directory tree reader.
addAE or CC name<name=value ...>Add AE/CC to pipeline.
addDescriptionAE or CC name<value ...>

Add AE/CC to pipeline using its .createDescription method.

addLoggedAE or CC name<name=value ...>Add AE/CC to pipeline with Start/Finish logging.
addLastAE or CC name<name=value ...>Add AE/CC to the end of pipeline. Useful if the pipeline is meant to be extended.
writeXmisoutput directory Add XMI writer to the pipeline.
// or # or !comment text Line Comment.

...

  1. Create an empty text file.  The standard file extension for piper files is .piper
  2. Set a reader for your pipeline.   To set values to parameters used by the reader class, simply add one or more name=value pairs after the class name.   * readFiles is a convenience command.  "readFiles my/data" is equivalent to "reader FileTreeReader InputDirectory=my/data".
  3. add annotation engines and cas consumers to your pipeline. To set values to parameters used by the annotation engine class, simply add one or more name=value pairs after the class name. 
  4. load common groups of components from another piper file.  See Table 2 for piper files in cTAKES.
  5. reader, load and add* commands all take class names or file directories as their first parameter.  If the class is not in a standard cTAKES module's cr ae or cc package, or a piper file is not in a standard module's pipeline/ directory then the package or path must be specified for that component / file.
  6. Use package to simplify adding multiple pipeline components from a package not standard to cTAKES.
    Gliffy Diagram
    chromemin
    nameUse of package
  7. Use set to assign a value to a parameter used by following components.
    Gliffy Diagram
    nameset global value

    *A name=value pair on a component line will, for that component, override a set parameter value.
  8. cli is a special type of set that sets a parameter to some value entered by the User on a command line.

    Gliffy Diagram
    nameUse of cli
    * cli can only be used with the PiperFileRunner class or the bin/runPiperFile script.
    * Reserved parameters unavailable for cli are listed in Table 3.

  9. et Set a reader for your pipeline.  readFiles is a convenience method.  "readFiles my/data" is equivalent to "reader FileTreeReader InputDirectory=my/data".

  10. To add common groups of components, load another piper file.  See Table 2 for piper files in cTAKES.
  11. add annotation engines and cas consumers to your pipeline. 

...