Wiki Markup |
---|
{scrollbar} |
Section | ||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Overview of LVG
This annotator wraps the National Library of Medicine (NLM) SPECIALIST lexical tools. It generates a canonical form for words and also generates a list of lemma entries with Penn Treebank tags. These tags could be useful for a part of speech (POS) tagger. However, for the OpenNLP POS tagger, we use a tag dictionary rather than lemma information.
Refer to the documentation for the POS tagger annotator.
Analysis engines (annotator) - LvgAnnotator.xml
Parameters
UseSegments
controls whether only certain sections will be annotated by this annotator
...
LemmaCacheFileFrequencyCutoff
(cutoff value)
Resources
- lvg.properties - The LVG config file resources/lvg/data/config/lvg.properties defines the location and attributes of the LVG database and the jdbc driver used.
- LVG database - The database engine used is hsqldb. The database file included is a sample. Refer to the LVG section of the install instructions for details on how to replace the sample.