Package org.apache.lucene.analysis.util
Utility functions for text analysis.
-
Class Summary Class Description CharArrayIterator A CharacterIterator used internally for use withBreakIterator
CharTokenizer An abstract base class for simple, character-oriented tokenizers.ElisionFilter Removes elisions from aTokenStream
.ElisionFilterFactory Factory forElisionFilter
.FilesystemResourceLoader SimpleResourceLoader
that opens resource files from the local file system, optionally resolving against a base directory.OpenStringBuilder A StringBuilder that allows one to access the array.RollingCharBuffer Acts like a forever growing char[] as you read characters into it from the provided reader, but internally it uses a circular buffer to only hold the characters that haven't been freed yet.SegmentingTokenizerBase Breaks text into sentences with aBreakIterator
and allows subclasses to decompose these sentences into words.StemmerUtil Some commonly-used stemming functionsUnicodeProps This file contains unicode properties used by variousCharTokenizer
s.