Section |
---|
Column |
---|
| Panel |
---|
title | Contents of this Page |
---|
| |
|
Column |
---|
Include Page |
---|
| CTAKES:Menu cTAKES 3.1 0 to IncludeCTAKES: |
---|
| Menu cTAKES 3.1 0 to Include |
---|
|
|
|
These are instructions for installation of cTAKES for developers. With these instructions you can set up your development environment with cTAKES code, change or extend the code, compile the code, and deploy. If you simply want to be a user of the software, refer to the cTAKES 3.1 0 User Install InstructionsGuide.
Knowledge about what the cTAKES components do is not supplied by the install instructions. This is found in the cTAKES 3.1 0 Component Use Guide. There is no training or documentation (except for code comments) on the code itself. You must familiarize yourself with the components and then study the code on your own to be able to extend it.
In order to modify/compile the source code for a cTAKES component, developers must utilize either an IDE, such as Eclipse, or another editor of your choice. Compiles are then performed in the IDE or with Ant. Follow the appropriate sections here depending upon your developer preferences.
Once you have compiled the code you can process documents with the cTAKES components. The documents upon which you can run cTAKES will take many forms. An example of doing this is covered in the Processing Documents section.
Prerequisites
Preparing Java
...
Step
...
Example
...
...
Code Block |
---|
|
C:\>java -version
java version "1.6.0_20"
Java(TM) SE Runtime Environment (build 1.6.0_20-b02)
Java HotSpot(TM) Client VM (build 16.3-b01, mixed mode, sharing)
|
...
2. It is possible that some commands and programs can find the Java
runtime that you want to be used but it is best to set the JAVA_HOME
environment variable. Set the value of JAVA_HOME to the absolute path of
the root of the Java Runtime environment that you want UIMA to use.
Windows:
Right-click on Computer> Properties> *Advanced*tab > *Environment Variables*button > *New* button for System variables. Keep clicking *OK* until you are out of the dialog series.
Linux:
Code Block |
---|
|
export JAVA_HOME=<path>
|
...
Image Removed
Preparing Eclipse
If you are going to use Eclipse for development then follow these instructions.
...
Step
...
Example
...
No example
...
Code Block |
---|
|
http://www.apache.org/dist/uima/eclipse-update-site
|
\
Click OK.
...
Image Removed
...
Image Removed
Note |
---|
|
These install instructions depend upon the installation of these plug-ins. |
...
Image Removed
Preparing Command Line Tools
If you are going to use command line only to compile then you will need these tools.
...
Step
...
Example
...
1. Navigate to the Ant download site on apache.org and install Ant 1.7.1+
...
Image Removed
...
2. Download Ant 1.7.1+. Unzip the file you downloaded to a local directory. We will call this <ANT_HOME> Follow the instructions for installing Ant on apache.org. This will include changing the PATH and ANT_HOME environment variables.
...
Compile the latest stable release in Eclipse
A tested and stable release is delivered as a ZIP file. The cTAKES source code is included in the file as well as UIMA.
...
Step
...
Example
...
1. Navigate to the source downloads for areleased version on SourceForge.net
...
Even without the full LVG data, cTAKES is about 790 MB when compressed.
...
Image Removed
3.
Unzip the file you downloaded into a directory that you want to be the
cTAKES install location. It will expand to about 1.4GB.
Windows:
...
Code Block |
---|
|
/usr/bin/cTAKES-2.5 |
...
Image Removed|
...
Code Block |
---|
|
export UIMA_HOME=/usr/bin/cTAKES-2.5
|
...
Image Removed
...
5. Edit PATH. This will be used for any command line access to binaries.
Windows:
Right-click on Computer> Properties> Advanced> *Environment Variables*button. Edit the *Path*environment variable adding *;<cTAKES_HOME>\bin*to the end. Keep clicking *OK*until you are out of the dialog series.
Linux:
Use the command export PATH=%PATH%:<cTAKES_HOME>/bin for example
Code Block |
---|
|
export PATH=%PATH%:/usr/bin/cTAKES-2.5/bin
|
...
Image Removed
...
Image Removed
Image Removed
...
Image Removed
...
Image Removed
...
Image Removed
...
Image Removed
Compile with commands only
Info |
---|
The UIMA command to generate the type system through the command line (JCasGen) is not shipped with cTAKES at this time. The common type system has already been generated for you. If you need, for some reason, to generate this then you will need to use the method described for Eclipse or install the entire UIMA SDK. |
...
Step
...
Example
...
1. Follow the first steps of the "Compile the latest stable release in Eclipse" (which do not require Eclipse) until you get to the part about creating a new project in Eclipse.
...
No example
...
No example
3. To compile cTAKES, change to the <cTAKES_HOME> directory and simply run:
...
No example
Compile a development release from SVN in Eclipse
If you know what you are doing with the cTAKES code and you must get the latest code currently under development (potentially unstable), then you need to use an SVN connection to retrieve the code.
...
Step
...
Example
...
1. Install subversion or a suitable plug-in for your IDE.
...
No example
2. Check-out the code to a local directory, such as:
...
Code Block |
---|
|
svn checkout https://ohnlp.svn.sourceforge.net/svnroot/ohnlp/trunk/cTAKES |
Note |
---|
If you are checking out via Subclipse in Eclipse, make sure to check out each project separately. |
...
Code Block |
---|
|
...
A cTAKES\PAD term spotter\desc\type_system
A cTAKES\PAD term spotter\desc\type_system\PADSiteAndTerm.xml
A cTAKES\PAD term spotter\desc\analysis_engine
A cTAKES\PAD term spotter\desc\analysis_engine\Radiology_TermSpotterAnnotatorTAEStyleMap.xml
A cTAKES\PAD term spotter\desc\analysis_engine\Radiology_TermSpotterAnnotatorTAE.xml
A cTAKES\PAD term spotter\desc\analysis_engine\DxStatusAnnotator.xml
A cTAKES\PAD term spotter\desc\analysis_engine\NegationDxAnnotator.xml
A cTAKES\PAD term spotter\desc\analysis_engine\PAD_Hit.xml
A cTAKES\PAD term spotter\desc\analysis_engine\SubSectionBoundaryAnnotator.xml
A cTAKES\PAD term spotter\desc\analysis_engine\Radiology_DictionaryLookupCSVAnnotator.xml
A cTAKES\PAD term spotter\desc\collection_processing_engine
A cTAKES\PAD term spotter\desc\collection_processing_engine\Radiology_sample.xml
A cTAKES\PAD term spotter\desc\collection_reader
A cTAKES\PAD term spotter\desc\collection_reader\RadiologyRecordsCollectionReader.xml
A cTAKES\PAD term spotter\.settings
A cTAKES\PAD term spotter\.settings\org.eclipse.jdt.ui.prefs
Checked out revision 667.
|
The minimal install instructions below are short but require a lot of prerequisite setup on your own. If you need more help then follow the step by step instructions. The step by step instructions for Eclipse assume a Windows or Ubuntu Linux install environment. You will need to extrapolate for any other environments.
Eclipse minimal install instructions
Prerequisites: Java JDK 1.6+, Eclipse IDE 4.2+, subversive plugin (or svn equivalent with appropriate SVN team provider connectors), m2e plugin (or mvn equivalent)
Info |
---|
The following location is the main trunk of cTAKES. See how cTAKES treats the trunk, branches, and tags in the developer FAQs. |
- Import Project > Maven > Checkout Maven Project from SCM and use: svn and https://svn.apache.org/repos/asf/ctakes/trunk
- Select all projects.
- Wait until Eclipse downloads and builds all of your projects (it may take up to 30 minutes depending on the machine).
- The various build helpers should run jcasgen and build the projects for you. There should not be any reason to run mvn install, etc.
- Merge the version-matching resources ZIP file from http://sourceforge.net/projects/ctakesresources/files/ into your ctakes-dictionary-lookup project.
- (Optional) If you would like to launch the UIMA CVD or CPE GUI, run ctakes-clinical-pipeline/resources/launch/UIMA_<CVD | CPE>GUI--clinical_documents pipeline.launch
- (Optional) UIMA plug-ins called "UIMA Eclipse tooling and runtime support" can be installed from update site: http://www.apache.org/dist/uima/eclipse-update-site
Eclipse step by step install instructions
Preparing Java
Include Page |
---|
| Java 1.6 Preparation |
---|
| Java 1.6 Preparation |
---|
|
Preparing Eclipse
If you are going to use Eclipse for development then follow these instructions.
Step | Example |
---|
1. Download and install Eclipse 4.2+.
(optional) It is recommended that you start a new workspace to keep your cTAKES projects separate from other work. | No example |
2. Subversion Eclipse plug-in (based on Subversive site). We will use the one called "Subversive - SVN Team Provider"
Help -> Install New Software... Select the current Eclipse release (in the case of Eclipse 4 it is Juno) for the "Work with" field:
Code Block |
---|
| Juno - http://download.eclipse.org/releases/juno |
Expand the Collaboration category. Select "Subversive SVN Team Provider". Click Next. Click Next. Agree to the license agreement. Click Finish. Restart Eclipse. | Image Added |
3. Subversion team provider connectors 1.7+.
After restarting Eclipse a dialog will ask you to install an SVN Kit for the Team Provider. Select SVN Kit 1.7 or greater. Click Finish. Click Next. Click Next. Agree to the license agreement. Click Finish. Accept the security warning (if one appears). Restart Eclipse. | Image Added Image Added |
4. Maven is already part of Eclipse, but more integration to Maven commands is needed.
Help -> Install New Software... Select the current Eclipse release (in the case of Eclipse 4 it is Juno) for the "Work with" field:
Code Block |
---|
| Juno - http://download.eclipse.org/releases/juno |
Expand the Collaboration category. Select "m2e - Maven Integration for Eclipse". Click Next. Click Finish. Restart Eclipse. | Image Added |
5. Maven SCM connector.
File -> Import ... -> Maven -> Check out Maven Projects from SCM. Click Next. Click the "m2e Marketplace" link. You are now in the m2e Marketplace not the Eclipse Marketplace. Scroll to and select m2e-subversive. Click Finish. Leave the defaults to install the "Maven SCM Handler for Subversive" and click Next. Click Next. Accept the license agreement. Click Finish. Accept the security warning (if one appears). Restart Eclipse. | Image Added Image Added |
Compile a release in Eclipse
Step | Example |
---|
1. Import the cTAKES projects using Maven.
File -> Import ... -> Maven -> Check out Maven Projects from SCM. Click Next. | Image Added |
Info |
---|
The following location is the main trunk of cTAKES. See how cTAKES treats the trunk, branches, and tags in the developer FAQs. |
2. For SCM URL use "svn" in the drop-down and this in the text field Code Block |
---|
| https://svn.apache.org/repos/asf/ctakes/trunk |
Click Finish. Eclipse will download and build all of the cTAKES sub-projects including running jcasgen as needed. Info |
---|
Due to the way Maven and Eclipse work together you will see two copies of the sub-projects in Eclipse. If you look into your workspace directories there is only one set of underlying files. |
| Image Added |
3. Download cTAKES 3.0 Dictionaries and models. Info |
---|
Due to licensing considerations and easy of installability, one download from an external location was established with all the resources you will need. Licensing for these resources is found within the download. |
Info |
---|
Download time will be commensurate with 1GB of data. |
Windows: Go to cTAKES resources and download the ZIP file with a matching version from the ctakesresources project. Unzip the files into a temporary location such as C:\temp.
Linux: Obtain the URL of the version matching ZIP file from cTAKES resources, get the file, and unzip to a temporary location.
| Windows: Image Added
Linux: Code Block |
---|
| cd /tmp
wget http://sourceforge.net/projects/ctakesresources/files/ctakes-resources-3.0.1.zip
sudo unzip ctakes-resources-3.0.1.zip |
|
4. Copy (or move) the resources to cTAKES_HOME. With Eclipse, cTAKES_HOME will be your workspace location followed by the project name "ctakes". Copy the contents of the temporary resources directory (and all sub-directories) to <cTAKES_HOME>/ctakes-dictionary-lookup/resources. Info |
---|
There may be conflicts while taking this action. Overwrite the cTAKES_HOME files with those in the resources download. |
| Windows: Code Block |
---|
| xcopy /s C:\temp\ctakes-resources-3.0.1\resources C:\Users\<userID>\workspace\ctakes\ctakes-dictionary-lookup\resources |
Linux: Code Block |
---|
| cp -R /tmp/resources/* <LINUX_ECLIPSE_WORKSPACE_HOME>/ctakes/ctakes-dictionary-lookup/resources |
|
5. Refresh Eclipse.
You must refresh your Eclipse projects to make sure that Eclipse knows about the new directories and data. | No example |
6. Add ctakes-dictionary-lookup/resources as a folder to the classpath. You will need to repeat this step for EACH project that needs access to those resources (nearly all of them do except for ctakes-relation-extractor). For example:
Open the properties on the top-level project ctakes-clinical-pipeline. For Eclipse, do not select the one under the ctakes project but the sibling to that at the highest level. Select Java Build Path -> Libraries tab -> Add Class Folder ... button. Select the resources directory under the ctakes-dictionary-lookup. Click OK on all dialogs until you are out of the sequence.
| Image Added |
7. UMLS user ID and password. Usually the dictionaries are required to process data. If you plan to utilize the UMLS dictionaries you must pass your UMLS user ID and password to the pipeline. There are several ways to do this - select one. Environment variable - Set or export environment variable
No Format |
---|
ctakes.umlsuser=<username>, ctakes.umlspw=<password>
|
Add the system properties to the Java arguments for a run configuration (shown in the next cell). Navigate to ctakes-clinical-pipeline -> resources -> launch > UIMA_<CVD | CPE>GUI--clinical_documents pipeline.launch. Right-click on the launch file and select Run-As -> Run Configurations... In the Arguments tab enter these parameters in the VM. Click Apply.
No Format |
---|
-Dctakes.umlsuser=<username> -Dctakes.umlspw=<password>
|
- Change the UMLSUser and UMLSPW <nameValuePair> strings in these descriptor files with your UMLS username and password.
| Image Added Image Added
|
Process documents using cTAKES
Step | Example |
---|
1. Launching the UIMA CAS Visual Debugger (CVD) or the Collection Processing Engine (CPE) from Eclipse can now be accomplished in the ctakes-clinical-pipeline project. Navigate to:
Code Block |
---|
| <cTAKES_HOME>
\ctakes-clinical-pipeline
\resources
\launch
\UIMA_<CVD | CPE>GUI--clinical_documents pipeline.launch |
where you must select between CVD and CPE in the command. Right-click on the launch file and select Run-As -> UIMA_<CVD | CPE>GUI-clinical_documents. Info |
---|
Note: Don't forget to set the location of the resources and your UMLS user ID and password if you have not done so for a particular project or run configuration. |
| Image Added
|
2. (Optional) Process data.
Now that you can launch the tools, the User Guide has steps to walk you through processing test data if you want to go there. Note |
---|
The User Guide describes files in the form that they came in the binary distributions. That form does not match the form when you checkout the code as we have done here.
For developers there is only one directory named "desc" in a path. When you use the User Guide you must ignore the first "desc" shown in a path. |
| No example |
(Optional) UIMA tools plug-in
Developers may be interested in the Eclipse plug-ins provided by the UIMA community. They include, for example, a UIMA component descriptor editor.
Step | Example |
---|
1. Find UIMA Eclipse plug-ins.
Help> Install New Software...> Add... button Set a repository name and this site location Code Block |
---|
| http://www.apache.org/dist/uima/eclipse-update-site
|
Click OK.
| Image Added |
2. Install UIMA Eclipse plug-ins.
Select the UIMA Eclipse tooling and runtime support. Click Next. Leave the defaults and click Next. Accept the license agreement. Click Finish. Restart Eclipse. | Image Added |
3. (optional) Verify the installation of the UIMA Plug-ins. Go to Help -> About Eclipse -> Installation Details -> Plug-ins. You will see a dialog such as that i the next cell with plug-in names starting with "UIMA Eclipse:". | Image Added |
Command line minimal install instructions
Prerequisites: Java JDK 1.6+, SVN, Maven 3.0+
Info |
---|
The following location is the main trunk of cTAKES. See how cTAKES treats the trunk, branches, and tags in the developer FAQs. |
- svn co https://svn.apache.org/repos/asf/ctakes/trunk ctakes-3.0
- mvn clean compile package
- Running the mvn package command will generate a binary distribution in /ctakes-distribution/target/ctakes-<release>-bin.tar.gz/zip
- Merge the version-matching resources ZIP file from http://sourceforge.net/projects/ctakesresources/files/ into your ctakes-dictionary-lookup project.
- (Optional) If you would like to launch the UIMA CVD or CPE GUI
- with MAVEN_OPTS="-Xmx2g -Xms1g" run mvn -PrunCVD compile
For further information see the Apache Source Code Repository page.
Command line step by step install instructions
Preparing Java
Include Page |
---|
| Java 1.6 Preparation |
---|
| Java 1.6 Preparation |
---|
|
Preparing command line tools
Step | Example |
---|
1. Install an SVN client. Info |
---|
Run this command to check the setup. svn --version |
Windows: We used a binary package of Apache Subversion from SlikSVN. Apache has a larger list here. Download and run the MSI file. Click Next. Accept the license agreement. Click Next. Click Next. Select Typical. Select Install. Wait Click Finish. Exit and restart any command prompts that were open.
Linux:
Code Block |
---|
| sudo apt-get install subversion |
| Windows: Image Added
Linux:
Code Block |
---|
| The following extra packages will be installed:
libapr1 libaprutil1 libneon27-gnutls libsvn1
Suggested packages:
subversion-tools db4.8-util
The following NEW packages will be installed:
libapr1 libaprutil1 libneon27-gnutls libsvn1 subversion
...
Processing triggers for python-central ...
Processing triggers for man-db ...
Setting up libapr1 (1.4.2-7ubuntu2) ...
Setting up libaprutil1 (1.3.9+dfsg-5ubuntu3) ...
Setting up libneon27-gnutls (0.29.5-3) ...
Setting up libsvn1 (1.6.12dfsg-4ubuntu2) ...
Setting up subversion (1.6.12dfsg-4ubuntu2) ... |
|
2. Install a Maven 3.0+ client.
Windows: We downloaded Apache Maven 3.0.4 from http://archive.apache.org/dist/maven/binaries/ Unzip the file to the root drive. C:\apache-maven-3.0.4 will be your MAVEN_HOME.
Linux: We downloaded Apache Maven 3.0.4 from http://archive.apache.org/dist/maven/binaries/ Unzip the file to /usr/local/apache-maven-3.0.4 which will be your MAVEN_HOME.
Code Block |
---|
| cd /tmp
wget http://archive.apache.org/dist/maven/binaries/apache-maven-3.0.4-bin.tar.gz tar -xvf apache-maven-3.0.4-bin.tar.gz -C /usr/local |
| Windows: Image Added
Linux: Image Added |
3. Set the Maven environment variable values - M2_HOME=<MAVEN_HOME> M2=<MAVEN_HOME>/bin PATH=<existing Path>;<MAVEN_HOME> where MAVEN_HOME is the path you unzipped to. Info |
---|
Run this command to check the setup. mvn --version |
Windows: Right-click on Computer > Properties > Advanced System Settings > Advanced tab > Environment Variables button > New button for User variables. Once the values are entered click OK until you are out of the dialog series.
Linux:
Code Block |
---|
| export M2_HOME=/usr/local/apache-maven-3.0.4
export M2=$M2_HOME/bin
export PATH=$PATH:$M2 |
Note |
---|
It is left to the user to make environment variables permanent in their development environment. |
| Windows: Image Added Image Added Image Added
Linux:
Code Block |
---|
| tbleeker@system:~$ export
...
declare -x M2="/usr/local/apache-maven-3.0.4/bin"
declare -x M2_HOME="/usr/local/apache-maven-3.0.4"
declare -x PATH="/usr/lib/jvm/java-6-sun/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/apache-maven-3.0.4/bin" |
|
Compile a release from command line
Step | Example |
---|
1. Checkout the cTAKES project. Info |
---|
The following location is the main trunk of cTAKES. See how cTAKES treats the trunk, branches, and tags in the developer FAQs. |
Windows/Linux:
Code Block |
---|
| cd /
svn co https://svn.apache.org/repos/asf/ctakes/trunk cTAKES-3.0 |
The parameter on the end will be created as a new directory in your current location. Note |
---|
For Linux, make sure that you have write access to the directory that you are in. |
We will refer to the directory you specify at the end of the checkout command as <cTAKES_HOME>. | Windows: Code Block |
---|
| C:\>cd /
C:\>svn co https://svn.apache.org/repos/asf/ctakes/trunk cTAKES-3.0
...
A ctakes-3.0\ctakes-type-system\pom.xml
A ctakes-3.0\ctakes-type-system\.settings
A ctakes-3.0\ctakes-type-system\.settings\org.eclipse.jdt.core.prefs
A ctakes-3.0\ctakes-type-system\.settings\org.eclipse.core.resources.prefs
A ctakes-3.0\ctakes-type-system\desc
A ctakes-3.0\DISCLAIMER
Checked out revision 1433729.
C:\>cd cTAKES-3.0
C:\cTAKES-3.0> |
Linux: Code Block |
---|
| tbleeker@system:~$ cd /
tbleeker@system:/$ svn co https://svn.apache.org/repos/asf/ctakes/trunk cTAKES-3.0
...
A ctakes-3.0/ctakes-type-system/pom.xml
A ctakes-3.0/ctakes-type-system/.settings
A ctakes-3.0/ctakes-type-system/.settings/org.eclipse.jdt.core.prefs
A ctakes-3.0/ctakes-type-system/.settings/org.eclipse.core.resources.prefs
A ctakes-3.0/ctakes-type-system/desc
A ctakes-3.0/DISCLAIMER
Checked out revision 1434842.
tbleeker@system:/$ cd cTAKES-3.0/
tbleeker@system:/cTAKES-3.0$ |
|
2. Download cTAKES 3.0 Dictionaries and models. Info |
---|
Due to licensing considerations and easy of installability, one download from an external location was established with all the resources you will need. Licensing for these resources is found within the download. |
Info |
---|
Download time will be commensurate with 1GB of data. |
Windows: Go to cTAKES resources and download the ZIP file with a matching version from the ctakesresources project. Unzip the files into a temporary location such as C:\temp.
Linux: Obtain the URL of the version matching ZIP file from cTAKES resources, get the file, and unzip to a temporary location.
| Windows: Image Added
Linux: Code Block |
---|
| cd /tmp
wget http://sourceforge.net/projects/ctakesresources/files/ctakes-resources-3.0.1.zip
sudo unzip ctakes-resources-3.0.1.zip |
|
3. Copy (or move) the resources to cTAKES_HOME. Copy the contents of the temporary resources directory (and all sub-directories) to <cTAKES_HOME>/ctakes-dictionary-lookup/resources. Info |
---|
There may be conflicts while taking this action. Overwrite the cTAKES_HOME files with those in the resources download. |
| Windows: Code Block |
---|
| xcopy /s C:\temp\ctakes-resources-3.0.1\resources C:\cTAKES-3.0\ctakes-dictionary-lookup\resources |
Linux: Code Block |
---|
| sudo cp -R /tmp/resources/* /cTAKES-3.0/ctakes-dictionary-lookup/resources |
|
4. Compile the complete set.
Make sure you are in the proper directory.
Windows/Linux: Code Block |
---|
| cd cTAKES-3.0
mvn clean compile |
Note |
---|
For Linux, make sure you are using the user that has access to the files in your cTAKES directory. |
Info |
---|
Instead of "compile" you can use the maven target called "package" to compile and build all the cTAKES deliverables. Package is convenient in situations like running cTAKES outside of maven with custom processes/scripts because it will bundle up all of the 3rd party and transient dependencies.
|
| Windows/Linux: Code Block |
---|
| ...
[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary:
[INFO]
[INFO] Apache cTAKES ..................................... SUCCESS [59.140s]
[INFO] Apache cTAKES common type system .................. SUCCESS [41.856s]
[INFO] Apache cTAKES utils ............................... SUCCESS [6.255s]
[INFO] Apache cTAKES core ................................ SUCCESS [17.940s]
[INFO] Apache cTAKES part-of-speech tagger ............... SUCCESS [5.148s]
[INFO] Apache cTAKES chunker ............................. SUCCESS [3.027s]
[INFO] Apache cTAKES document preprocessor ............... SUCCESS [4.118s]
[INFO] Apache cTAKES dictionary lookup ................... SUCCESS [1:14.740s]
[INFO] Apache cTAKES context dependent tokenizer ......... SUCCESS [5.975s]
[INFO] Apache cTAKES LVG lexical tools ................... SUCCESS [7.831s]
[INFO] Apache cTAKES named entity contexts ............... SUCCESS [4.743s]
[INFO] Apache cTAKES Constituency Parser ................. SUCCESS [9.516s]
[INFO] Apache cTAKES Dependency Parser ................... SUCCESS [32.386s]
[INFO] Apache cTAKES Assertion's zoner ................... SUCCESS [2.152s]
[INFO] Apache cTAKES Assertion ........................... SUCCESS [12.200s]
[INFO] Apache cTAKES ctakes-clinical-pipeline ............ SUCCESS [4.446s]
[INFO] Apache cTAKES Relation Extractor .................. SUCCESS [13.634s]
[INFO] Apache cTAKES CoReference Resolver ................ SUCCESS [8.923s]
[INFO] Apache cTAKES Drug NER ............................ SUCCESS [6.958s]
[INFO] Apache cTAKES Side Effects ........................ SUCCESS [7.566s]
[INFO] Apache cTAKES Smoking Status ...................... SUCCESS [8.377s]
[INFO] Apache cTAKES Pad Term Spotter .................... SUCCESS [9.048s]
[INFO] Apache cTAKES Temporal Information Extraction ..... SUCCESS [33.993s]
[INFO] Apache cTAKES Distribution ........................ SUCCESS [17:59.809s]
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 24:22.120s
[INFO] Finished at: Wed Jan 16 17:44:35 CST 2013
[INFO] Final Memory: 41M/181M
[INFO] ------------------------------------------------------------------------
... |
|
5. Add the resources as a folder to the classpath. Make sure the current path or dot (.) is in your CLASSPATH environment variable accessible to the process maven is running in. | No example |
6. UMLS user ID and password. Usually the dictionaries are required to process data. If you plan to utilize the UMLS dictionaries you must pass your UMLS user ID and password to the pipeline. There are several ways to do this - select one. - Environment variable - Set or export environment variables.
Refer to the Eclipse documentation above for more information. Add the system properties to the Java arguments for the maven environment. Add these parameters to the MAVEN_OPTS environment variable in the next section as you run the commands to process documents. No Format |
---|
-Dctakes.umlsuser=<username> -Dctakes.umlspw=<password>
|
Make the ID and password specific to you. - Change the UMLSUser and UMLSPW <nameValuePair> strings in these descriptor files with your UMLS username and password.
Refer to the Eclipse documentation above for more information.
| No example |
Process documents using cTAKES
Step | Example |
---|
1. Launching the UIMA CAS Visual Debugger (CVD) or the Collection Processing Engine (CPE) can now be accomplished in the ctakes-clinical-pipeline project:
Windows: Code Block |
---|
| cd \cTAKES-3.0
set MAVEN_OPTS=-Xmx2g -Xms1g
mvn -Prun<CVD|CPE> compile |
Linux: Code Block |
---|
| cd /cTAKES-3.0
export MAVEN_OPTS="-Xmx2g -Xms1g"
mvn -Prun<CVD|CPE> compile |
where you must select between CVD and CPE in the command. | Image Added |
2. (Optional) Process data.
Now that you can launch the tools, the User Guide has steps to walk you through processing test data if you want to go there. Note |
---|
The User Guide describes files in the form that they came in the binary distributions. That form does not match the form when you checkout the code as we have done here.
For developers there is only one directory named "desc" in a path. When you use the User Guide you must ignore the first "desc" shown in a path. |
| No example |
Next Steps
The cTAKES 3.0 Component Use Guide will help you to understand, in great detail, each of the cTAKES components that have been installed. In some cases you can learn how to improve the components.
Also, before you go on to process text in production you will need to consider dictionaries and models. cTAKES does not distribute from Apache a complete dictionary capable of annotating production data. The models provided have been trained on data that may not match your data well enough to be effective. In most cases, you will need to modify the dictionaries and train models on your own data to be effective.
...
Note |
---|
It may be necessary to import each project one by one! |
...
Image Removed
...
4. Install UIMA 2.4. Make note of UIMA_HOME.
...
No example
...
5. Add UIMA JARs to the build path configuration in Eclipse. Add all JARs from:
Code Block |
---|
<UIMA_HOME>/lib |
...
- jVinci.jar
- uima-adapter-soap.jar
- uima-adapter-vinci.jar
- uima-core.jar
- uima-cpe.jar
- uima-documentation-annotation.jar
- uima-examples.jar
- uima-bootstrap.jar
- uima-tools.jar
...
Image Removed
Process documents using cTAKES
You can now launch or debug the cTAKES components that you have built. You could run commands from a command prompt, as found in the user install instructions, but you can launch them from within Eclipse now instead. Launching the CAS Visual Debugger (CVD) and the Collection processing engine (CPE) from Eclipse is as simple as following this step.
...
Step
...
Example
...