Uses of Package
org.apache.lucene.analysis
Package
Description
Text analysis.
Fast, general-purpose grammar-based tokenizer
StandardTokenizer
implements the Word Break rules from the
Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.Codecs API: API for customization of the encoding and structure of the index.
The logical representation of a
Document
for indexing and
searching.Code to maintain and access indices.
Some utility classes.
Utility classes for working with token streams as graphs.
-
ClassDescriptionAbstract parent class for analysis factories
TokenizerFactory
,TokenFilterFactory
andCharFilterFactory
.An Analyzer builds TokenStreams, which analyze text.Strategy defining how TokenStreamComponents are reused per call toAnalyzer.tokenStream(String, java.io.Reader)
.This class encapsulates the outer components of a token stream.Extension toAnalyzer
suitable for Analyzers which wrap other Analyzers.A simple IO buffer to use withCharacterUtils.fill(CharacterBuffer, Reader)
.A simple class that stores key Strings as char[]'s in a hash table.public iterator class so efficient methods are exposed to userspublic EntrySet class so efficient methods are exposed to usersA simple class that stores Strings as char[]'s in a hash table.Abstract parent class for analysis factories that createCharFilter
instances.Abstract base class for TokenFilters that may remove tokens.A TokenFilter is a TokenStream whose input is another TokenStream.Abstract parent class for analysis factories that createTokenFilter
instances.A Tokenizer is a TokenStream whose input is a Reader.Abstract parent class for analysis factories that createTokenizer
instances. -
ClassDescriptionAbstract parent class for analysis factories
TokenizerFactory
,TokenFilterFactory
andCharFilterFactory
.An Analyzer builds TokenStreams, which analyze text.This class encapsulates the outer components of a token stream.A simple class that stores Strings as char[]'s in a hash table.Base class for Analyzers that need to make use of stopword sets.A Tokenizer is a TokenStream whose input is a Reader.Abstract parent class for analysis factories that createTokenizer
instances. -
ClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
ClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
ClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
ClassDescriptionAn Analyzer builds TokenStreams, which analyze text.
-
ClassDescription