Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin

...

Section
Column
width65%
Panel
titleContents of this Page
Table of Contents
minLevel2
Column
Include Page
CTAKES:Menu cTAKES 3.1 0 to IncludeCTAKES:
Menu cTAKES 3.1 0 to Include

Overview of Document Preprocessor

This component provides a CdaCasInitializer SECTION annotator that transforms a Clinical Document Architecture (CDA) document into plain text, provided the CDA document conforms to the DTD schema.

...

Info

This does not handle all CDA documents. The CDA document must conform to the DTD resources/cda/NotesIIST_RTF.DTD.

Analysis engines (annotators)

AggregateAE.xml

The file cTAKESdesc/docpredesc/AggregateAE.xml defines a pipeline for preprocessing documents. The pipeline is a simple pipeline with only one delegate analysis engine (one annotator), the CdaCasInitializer, and is included for testing. Typically the CdaCasInitializer.xml descriptor is included in a more complete pipeline rather than using the AggregateAE.xml descriptor that is in this project.

CdaCasInitializer.xml

The CdaCasInitializer descriptor defines the analysis engine (annotator) for preprocessing documents. It creates a plain text view from a CDA view. The plain text view can then be annotated, using other components, for tokens, parts of speech, chunks, etc.

...