...
Section | ||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Overview of Document Preprocessor
This component provides a CdaCasInitializer SECTION annotator that transforms a Clinical Document Architecture (CDA) document into plain text, provided the CDA document conforms to the DTD schema.
...
Info |
---|
This does not handle all CDA documents. The CDA document must conform to the DTD resources/cda/NotesIIST_RTF.DTD. |
Analysis engines (annotators)
AggregateAE.xml
The file cTAKESdesc/docpredesc/AggregateAE.xml defines a pipeline for preprocessing documents. The pipeline is a simple pipeline with only one delegate analysis engine (one annotator), the CdaCasInitializer, and is included for testing. Typically the CdaCasInitializer.xml descriptor is included in a more complete pipeline rather than using the AggregateAE.xml descriptor that is in this project.
CdaCasInitializer.xml
The CdaCasInitializer descriptor defines the analysis engine (annotator) for preprocessing documents. It creates a plain text view from a CDA view. The plain text view can then be annotated, using other components, for tokens, parts of speech, chunks, etc.
...