Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin

...

{scrollbar}

...

...

Overview of LVG

This annotator wraps the National Library of Medicine (NLM) SPECIALIST lexical tools. It generates a canonical form for words, and this normalization step is helpful for dictionary lookup.* It also generates a list of lemma entries with Penn Treebank tags. These tags could be useful for a part of speech (POS) tagger. However, for the OpenNLP POS tagger, we use a tag dictionary rather than lemma information.

Refer to the documentation for the POS tagger annotator.

 

*Note: LVG adds variants that the dictionary lookup will use in an attempt to discover terms whose form in the text is not present in the dictionary database. e.g., the singular variants of plural forms, capitalization variants, etc. While LVG often increases the number of dictionary terms found in the text, there is, of course, a risk of introducing false positive returns from the dictionary.

Analysis engines (annotator) - LvgAnnotator.xml

...