openNLP command line – Part 2

openNLP command line – Part 2. openNLP provides a Command Line Interface (CLI) to carry out different operations through the command line. This is part 2 of the command line description. Part 1 is here

Contains

The Command Line

The missing parts

The Tokenizer

the Parser

Other NLP AI Contentserver Articles

Starting Page

Example Linguistic Features of NLP- Part1

Example Linguistic Features of NLP- Part2

The buildin NLP categorizer

Detecting the language

a name finder example

a name and location finder example

Content Server NLP application examples

Content Server Autocategorizer

The Command line

Type “openNLP” to see the complete list of commands

These are runnable jars and can be used directly from the command line.

The missing parts of openNLP command line – Part 2

The Tokenizer

The Tokenizer Group has 7 members

  • SimpleTokenizer – Character class tokenizer
  • TokenizerME – Learnable tokenizer
  • TokenizerTrainer – Trainer for the learnable tokenizer
  • TokenizerMEEvaluator – Measures the performance of the learnable tokenizer
  • TokenizerCrossValidator – K-fold cross validator for the learnable tokenizer
  • TokenizerConverter – Converts external data formats (irishsentencebank,ad,pos,masc,conllx,namefinder,parse,conllu) to native OpenNLP format
  • DictionaryDetokenizer – Normalizes tokenized text according to rules defined in a detokenizer dictionary

The Parser

The Parser Group has 7 members

Parser – Performs full syntactic parsing
ParserTrainer – Trains the learnable parser
ParserEvaluator – Measures the performance of the Parser model with the reference data
ParserConverter – Converts external data formats (ontonotes,frenchtreebank) to native OpenNLP format
BuildModelUpdater – Trains and updates the build model in a parser model
CheckModelUpdater – Trains and updates the check model in a parser model
TaggerModelReplacer – Replaces the tagger model in a parser model