Introduction
This guide will explain on how to do a labeling project with the Corpus Server, the Apache UIMA Cas Editor with the OpenNLP plugins.
Installing
Corpus Server
The Corpus server is running inside an OSGi Runtime Container. In this tutorial Apache Karaf will be used.
Install Apache Karaf
To install Apache Karaf follow the these steps:
- Download it from http://karaf.apache.org/
(testing here with 2.2.7)
- Unpack it
- Start it up with bin/karaf
Build the Corpus Server
Checkout the Corpus Server from svn with these two commands:
- svn co https://svn.apache.org/repos/asf/opennlp/sandbox/corpus-server
- svn co https://svn.apache.org/repos/asf/opennlp/sandbox/corpus-server-impl
This will create a folder for each maven module.
Go first to the corpus-server, and do a mvn install there,
afterward do the same in the corpus-server-impl module.
Install the Corpus Server
Starting Apache Karaf via bin/karaf open the karaf console.
The following command will install all dependencies and the Corpus Server:
- features:refreshUrl file:///home/xyz/dev/opennlp/sandbox/corpus-server/feature.xml
- features:install opennlp-corpus-server
The corpus-server itself is just an interface layer which can expose an actual
Corpus Server implementation via its REST API to vairous tools which know this API.
To use the server an actual implementation must be installed in the OSGi runtime as
well, e.g the default implementation corpus-server-impl or a self made one.
This can be done with these commands:
- features:refreshUrl file:///home/xyz/dev/opennlp/sandbox/corpus-server-impl/feature.xml
- features:install opennlp-corpus-server-impl