...
Named Entity Recognition is supported in tika-parsers v1.12 (TIKA-1787). This page describes the steps required to configure and activate the NamedEntityParser.
Contents
Activate Named Entity Parser
...
The following table shows types of entities and the paths to place the model file.
Entity Type | Path for model | URL to get |
PERSON | org/apache/tika/parser/ner/opennlp/ner-person.bin | |
LOCATION | org/apache/tika/parser/ner/opennlp/ner-location.bin | http://opennlp.sourceforge.net/models-1.5/en-ner-location.bin |
ORAGANIZATION | org/apache/tika/parser/ner/opennlp/ner-organization.bin | http://opennlp.sourceforge.net/models-1.5/en-ner-organization.bin |
DATE | org/apache/tika/parser/ner/opennlp/ner-date.bin | |
TIME | org/apache/tika/parser/ner/opennlp/ner-time.bin | |
PERCENT | org/apache/tika/parser/ner/opennlp/ner-percentage.bin | http://opennlp.sourceforge.net/models-1.5/en-ner-percentage.bin |
MONEY | org/apache/tika/parser/ner/opennlp/ner-money.bin |
Notes:
- You can use any combination of the models. If you are interested in only the LOCATION names, then skip other NER models save LOCATION.
NER Models for other languages are also available http://opennlp.sourceforge.net/models-1.5/ . If you choose to use different language, use those URLs in the below script.
...