You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Introduction and Goal

Now that OpenNLP has a langdetect model available for download it would be useful to distribute this model as a Maven dependency. Having the model available as a Maven dependency can make the model easier to acquire, use, and promote OpenNLP.  Any work done for this task is captured by the task  Unable to render Jira issues macro, execution error. .

The langdetect model is built from the OpenNLP data repository in SVN at https://svn.apache.org/repos/bigdata/opennlp/trunk. It would be ideal to automate whatever process is chosen as much as possible to take the models built from that corpus and release them as Maven artifacts. At the time of writing, the langdetect model is the only model available for download but the process chosen should be able to support other types (sentence, token, namefinder, etc.) of models and languages of those models.

 

  • No labels