Uses of Package
org.apache.lucene.analysis
Packages that use org.apache.lucene.analysis
Package
Description
Text analysis.
Fast, general-purpose grammar-based tokenizer
StandardTokenizer
implements the Word Break rules from the
Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.Codecs API: API for customization of the encoding and structure of the index.
The logical representation of a
Document
for indexing and
searching.Code to maintain and access indices.
Some utility classes.
Utility classes for working with token streams as graphs.
-
Classes in org.apache.lucene.analysis used by org.apache.lucene.analysisClassDescriptionAbstract parent class for analysis factories
TokenizerFactory
,TokenFilterFactory
andCharFilterFactory
.An Analyzer builds TokenStreams, which analyze text.Strategy defining how TokenStreamComponents are reused per call toAnalyzer.tokenStream(String, java.io.Reader)
.This class encapsulates the outer components of a token stream.Extension toAnalyzer
suitable for Analyzers which wrap other Analyzers.A simple IO buffer to use withCharacterUtils.fill(CharacterBuffer, Reader)
.A simple class that stores key Strings as char[]'s in a hash table.public iterator class so efficient methods are exposed to userspublic EntrySet class so efficient methods are exposed to usersA simple class that stores Strings as char[]'s in a hash table.Abstract parent class for analysis factories that createCharFilter
instances.Abstract base class for TokenFilters that may remove tokens.A TokenFilter is a TokenStream whose input is another TokenStream.Abstract parent class for analysis factories that createTokenFilter
instances.A Tokenizer is a TokenStream whose input is a Reader.Abstract parent class for analysis factories that createTokenizer
instances. -
Classes in org.apache.lucene.analysis used by org.apache.lucene.analysis.standardClassDescriptionAbstract parent class for analysis factories
TokenizerFactory
,TokenFilterFactory
andCharFilterFactory
.An Analyzer builds TokenStreams, which analyze text.This class encapsulates the outer components of a token stream.A simple class that stores Strings as char[]'s in a hash table.Base class for Analyzers that need to make use of stopword sets.A Tokenizer is a TokenStream whose input is a Reader.Abstract parent class for analysis factories that createTokenizer
instances. -
Classes in org.apache.lucene.analysis used by org.apache.lucene.codecsClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
Classes in org.apache.lucene.analysis used by org.apache.lucene.documentClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
Classes in org.apache.lucene.analysis used by org.apache.lucene.indexClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
Classes in org.apache.lucene.analysis used by org.apache.lucene.utilClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
Classes in org.apache.lucene.analysis used by org.apache.lucene.util.graphClassDescription