Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: refer to GUI in 1st sentence under Building Your Own Dictionary

...

  • An RxNorm_index database (a Lucene index): Contains drug names from RxNorm.
  • The OrangeBook: If you are not using the drug NER pipeline, the Orange Book is used to filter out what it found in RxNorm so that only things in both RxNorm and Orange Book are annotated. If you use Drug NER, Orange Book filtering is bypassed.
  • UMLS database (using two hsqldb tables): Contains terms for anatomical sites, procedures, signs/symptoms, and disorders/diseases from SNOMED-CT, NCI Thesaurus, MeSH, and ICD-9 (umls_ms_2011ab) which have been tokenized by cTAKES.
  • The full LVG: From the lexical tools provided by the NLM for word normalization. Used to match similar words, for example the plural and singular forms of a word.

 

Building Your Own Dictionaries

See the cTAKES GUI component for building your own dictionary for the Fast Dictionary Lookup.

The UMLS dictionaries within the ctakes-resources archive might not match your underlying data completely. You might require other local terms, etc. To create customized dictionaries for RxNorm, SNOMED-CT, or other vocabularies that are available through the UMLS, you may use one of the dictionary tools currently in development (dictionary-gui and dictionarytool), that tools that can be found in the cTAKES GUI component.

...