BreakIteratorTokenizer |
A tokenizer implementation based on Java's BreakIterator class
|
Lemmatizers |
Factory class for creating/retrieving lemmatizers for a given language
|
PartOfSpeech |
Interface defining a part-of-speech.
|
PartOfSpeechConverter |
|
PennTreeBank |
Part-of-speech tags defined by Penn Treebank
|
StandardTokenizer |
This class is a scanner generated by
JFlex 1.5.0-SNAPSHOT
from the specification file /home/ik/prj/gengoai/hermes-pom/core/src/main/jflex/StandardTokenizer.jflex
|
Stemmers |
Factory class for creating/retrieving stemmers for a given language
|
StopWords |
Defines a methodology for determining if an HString or String is a stopword for a given language.
|
StopWords.NoOptStopWords |
StopWords implementation that treats everything as a content word.
|
Tokenizer.Token |
An internal token
|
Tokenizers |
|
TokenType |
Defines the type for a given token.
|
UniversalFeatureSet |
|