SegTokenFilter
SegToken
CharType
constant of a given character.SegToken
representing the best segmentation of a sentenceSegToken
by converting full-width latin to half-width, then lowercasing latin.Set
of stopwords.SentenceTokenizer
WordTokenFilter
TokenFilter
that breaks sentences into words.WordType
of the textCopyright © 2000-2013 Apache Software Foundation. All Rights Reserved.