Page History

...

Step

Example

1. If you do not have a UMLS username and password, you may request one at UMLS Terminology Services.

No example

2. Edit the following files. Find the line in each script that runs java and add the ctakes.umlsuser and ctakes.umlspw parameters to the java command with your credentials. Make sure you substitute your actual ID and password if you cut and paste the example.

Windows:

Code Block

language	none

<cTAKES_HOME>\bin\runctakesCVD.bat
<cTAKES_HOME>\bin\runctakesCPE.bat

Linux:

Code Block

language	none

<cTAKES_HOME>/bin/runctakesCVD.sh
<cTAKES_HOME>/bin/runctakesCPE.sh

Code Block

language	none

 
java -Dctakes.umlsuser=<YOUR_UMLS_ID_HERE> -Dctakes.umlspw=<YOUR_UMLS_PASSSWORD_HERE>_HERE> -cp ...

For example, if your username and password were literally myusername and mypassword, you could insert them before the -cp option so the start of the java command would look like this:

Code Block
java -Dctakes.umlsuser=myusername -Dctakes.umlspw=mypassword -cp ...

Process documents using cTAKES

...

Step

Example

1. Open a command prompt and change to the cTAKES_HOME directory.

Note
It is best if <cTAKES_HOME> is your current directory. The scripts will change directories, so being home to run the command is best.

Windows:

Code Block

language	none

cd \apache-ctakes-3.2.1

Linux:

Code Block

language	none

cd /usr/local/apache-ctakes-3.2.1

2. Start the CAS Visual Debugger by running this command:
The application may take a minute to start on slower hardware.

Windows:

Code Block

language	none

bin\runctakesCVD.bat

Linux:

Code Block

language	none

bin/runctakesCVD.sh

3. Copy the example text from the next cell in this table and paste the contents into the Text section of CVD, replacing the text that is already there.

You can also download a copy of the file from here

Code Block

lang	none

Dr. Nutritious

Medical Nutrition Therapy for Hyperlipidemia
Referral from: Julie Tester, RD, LD, CNSD
Phone contact: (555) 555-1212
Height: 144 cm Current Weight: 45 kg Date of current weight: 02-29-2001
Admit Weight: 53 kg BMI: 18 kg/m2
Diet: General
Daily Calorie needs (kcals): 1500 calories, assessed as HB + 20% for activity.
Daily Protein needs: 40 grams, assessed as 1.0 g/kg.
Pt has been on a 3-day calorie count and has had an average intake of 1100 calories.
She was instructed to drink 2-3 cans of liquid supplement to help promote weight gain.
She agrees with the plan and has my number for further assessment. May want a Resting
Metabolic Rate as well. She takes an aspirin a day for knee pain.

4. An analysis engine (AE) needs to be loaded in order to process text.
If you installed the UMLS resources, use

No Format
AggregatePlaintextFastUMLSProcessor.xml

in this step.

Use the Run-> Load AE menu bar command. Navigate to the file

Code Block

language	none

<cTAKES_HOME>
  /desc
    /ctakes-clinical-pipeline
      /desc
        /analysis_engine
          /AggregatePlaintextFastUMLSProcessor.xml

Click Open.
Loading the analysis engine may take a minute. Once the analysis engine has successfully loaded you should see a tree in the Analysis Results frame:

CAS Index Repository
* SofaIndex [0]
* AnnotationIndex [1]

5. From the menu bar, click Run -> Run AggregatePlaintextFastUMLSProcessor.

Note: If you would like to TEST some simple annotators to ensure it's working without UMLS, you can just load:

/desc/ctakes-core/desc/analysis_egine/SentencesAndTokensAggregate.xml

6. You'll get a list of all the annotations for this clinical document in the Analysis Results frame. Annotations such as named entities, division by sentence, etc from the pipeline are viewable. To see one, in the Analysis Results frame, click on the key in front of:

CAS Index Repository
* AnnotationIndex
* uima.tcas.Annotation
* org.apache.ctakes.typsystem.type.textsem.IdentifiedAnnotation
* org.apache.ctakes.typsystem.type.textsem.EventMention

This will show an AnnotationIndex in the lower frame. Select any annotation in that lower frame and you will see the text discovered in
the text frame on the right like the concept of the disease/disorder Hyperlipidemia.

For a medication example select this

CAS Index Repository
* AnnotationIndex
* uima.tcas.Annotation
* org.apache.ctakes.typsystem.type.textsem.IdentifiedAnnotation
* org.apache.ctakes.typsystem.type.textsem.EventMention
* org.apache.ctakes.typsystem.type.textsem.MedicationMention

Now select items in the lower frame to see the text being annotated.

You may close the CAS Visual Debugger (CVD) application if you wish.

...

Annotator	Description	Example Aggregate Analysis Engine (AE)	Example Collection processing Engine (CPE)
Clinical Document Pipeline	The complete cTAKES pipeline to obtain majority of cTAKES annotations	<cTAKES_HOME>/desc/ctakes-clinical-pipeline/desc/analysis_engine/AggregatePlaintextUMLSProcessor.xml	<cTAKES_HOME>/desc/ctakes-clinical-pipeline/desc/collection_processing_engine/test1.xml
Chunker	Obtain cTAKES chunk annotations	NA	NA
Dependency Parser	Obtain dependency parsing tree	<cTAKES_HOME>/desc/ctakes-dependency-parser/desc/analysis_engine/ClearParserSRLTokenizedInfPosAggregate.xml	<cTAKES_HOME>/desc/ctakes-dependency-parser/desc/collection_processing_engine/ClearParserTestCPE.xml	Drug NER	The annotator to obtain drug annotationsdependency parsing tree	<cTAKES_HOME>/desc/ctakes-drugdependency-nerparser/desc/analysis_engine/DrugAggregatePlaintextUMLSProcesorClearParserSRLTokenizedInfPosAggregate.xml	<cTAKES_HOME>/desc/ctakes-drugdependency-nerparser/desc/collection_processing_engine/DrugNER_PlainText_CPEClearParserTestCPE.xml
Drug NER	The annotator to obtain drug annotations	Dictionary Lookup	Mapping cTAKES annotations to dictionaries (e.g., SNOMED_CT or RxNorm	<cTAKES_HOME>/desc/ctakes-dictionarydrug-lookupner/desc/analysis_engine/TestAggregateTAE.xml	NA	PAD Term Spotter	Identifying terms related to PAD/DrugAggregatePlaintextUMLSProcesor.xml	<cTAKES_HOME>/desc/ctakes-paddrug-term-spotterner/desc/analysiscollection_processing_engine/RadiologyDrugNER_PlainText_TermSpotterAnnotatorTAECPE.xml
Dictionary Lookup	Mapping cTAKES annotations to dictionaries (e.g., SNOMED_CT or RxNorm	<cTAKES_HOME>/desc/ctakes-paddictionary-term-spotterlookup/desc/collection_processinganalysis_engine/Radiology_SampleTestAggregateTAE.xml	NA
Relation Extractor	Annotate certain relations between certain Event, Entity, and Modifier annotations	<cTAKES_HOME>/desc/ctakes-relation-extractor/desc/analysis_engine/RelationExtractorAggregate.xml	N/A
Smoking Status	The annotator to obtain document or patient-level smoking status	<cTAKES_HOME>/desc/ctakes-smoking-status/desc/analysis_engine/SimulatedProdSmokingTAE.xml	<cTAKES_HOME>/desc/ctakes-smoking-status/desc/collection_processing_engine/Sample_SmokingStatus_output_flatfile.xml
Side Effect	The annotator to find side effect mentions and sentences from clinical documents	<cTAKES_HOME>/desc/ctakes-side-effect/desc/analysis_engine/SideEffectAggregateTAE_UMLS.xml	<cTAKES_HOME>/desc/ctakes-side-effect/desc/collection_processing_engine/SideEffectCPE.xml

...

Also, before you go on to process text in production, you will want to consider dictionaries and models. If you did not obtain the rights yet to the UMLS resources and models, you will want to do so. Be aware, the models have been trained on data that may not match your data well enough to be effective. In some cases you might want to modify the dictionaries and train models using your own data.

Space shortcuts

Child pages

Versions Compared

Old Version 7

New Version Current

Key

Process documents using cTAKES