Package org.apache.lucene.analysis.util
package org.apache.lucene.analysis.util
Utility functions for text analysis.
-
ClassDescriptionA CharacterIterator used internally for use with
BreakIterator
An abstract base class for simple, character-oriented tokenizers.Removes elisions from aTokenStream
.Factory forElisionFilter
.SimpleResourceLoader
that opens resource files from the local file system, optionally resolving against a base directory.A StringBuilder that allows one to access the array.Acts like a forever growing char[] as you read characters into it from the provided reader, but internally it uses a circular buffer to only hold the characters that haven't been freed yet.Breaks text into sentences with aBreakIterator
and allows subclasses to decompose these sentences into words.Some commonly-used stemming functionsThis file contains unicode properties used by variousCharTokenizer
s.