NLP is a combination of tools used to get useful information from natural language sources such as web pages and text documents.
Apache OpenNLP is an open-source Java library used to perform natural language text. Here you can build effective text processing service using this library.
OpenNLP offers various services like tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and co-reference resolution, etc.
Following are the notable features of OpenNLP −
The Apache OpenNLP library offers various classes and interfaces to perform various tasks of natural language processing such as sentence detection, tokenization, finding a name, tagging the parts of speech, chunking a sentence, parsing, co-reference resolution, and document categorization.
Addition to this you can also train and evaluate our own models for any of these tasks.
Other than above library, OpenNLP also provides a Command Line Interface (CLI), useful to train and evaluate models. Let’s discuss this topic in detail in the last chapter of this tutorial.
You can find different types of predefined models to perform various NLP tasks. This set may includes models for different languages.
Let’s follow below mentioned steps to download the predefined models provided by OpenNLP.
Step 1 − Open the index page of OpenNLP models by clicking the following link − http://opennlp.sourceforge.net/models-1.5/.
Step 2 – After licking on the given link, you will get to see a list of components of various languages and the links to download them. Here, you can get the list of all the predefined models provided by OpenNLP.
Now, download all these models to the folder C:/OpenNLP_models/>, by clicking on their respective links. While, all these models are language dependent and while using these. But make sure that the model language matches with the language of the input text.
All rights reserved © 2018 Wisdom IT Services India Pvt. Ltd
Wisdomjobs.com is one of the best job search sites in India.