Provides various convenience classes for creating payloads on Tokens.
org.apache.lucene.analysis.position - package org.apache.lucene.analysis.positionFilter for assigning position increments.org.apache.lucene.analysis.pt - package org.apache.lucene.analysis.ptAnalyzer for Portuguese.org.apache.lucene.analysis.query - package org.apache.lucene.analysis.queryAutomatically filter high-frequency stopwords.org.apache.lucene.analysis.reverse - package org.apache.lucene.analysis.reverseFilter to reverse token text.org.apache.lucene.analysis.ro - package org.apache.lucene.analysis.roAnalyzer for Romanian.org.apache.lucene.analysis.ru - package org.apache.lucene.analysis.ruAnalyzer for Russian.org.apache.lucene.analysis.shingle - package org.apache.lucene.analysis.shingleWord n-gram filtersorg.apache.lucene.analysis.sinks - package org.apache.lucene.analysis.sinksImplementations of the SinkTokenizer that might be useful.
org.apache.lucene.analysis.snowball - package org.apache.lucene.analysis.snowballTokenFilter
and Analyzer
implementations that use Snowball
stemmers.org.apache.lucene.analysis.sv - package org.apache.lucene.analysis.svAnalyzer for Swedish.org.apache.lucene.analysis.th - package org.apache.lucene.analysis.thAnalyzer for Thai.org.apache.lucene.analysis.tr - package org.apache.lucene.analysis.trAnalyzer for Turkish.org.apache.lucene.analysis.util - package org.apache.lucene.analysis.util org.apache.lucene.analysis.wikipedia - package org.apache.lucene.analysis.wikipediaTokenizer that is aware of Wikipedia syntax.org.tartarus.snowball - package org.tartarus.snowball org.tartarus.snowball.ext - package org.tartarus.snowball.ext out_grouping(char[], int, int) -
Method in class org.tartarus.snowball.SnowballProgram
out_grouping_b(char[], int, int) -
Method in class org.tartarus.snowball.SnowballProgram
out_range(int, int) -
Method in class org.tartarus.snowball.SnowballProgram
out_range_b(int, int) -
Method in class org.tartarus.snowball.SnowballProgram
P
- packValues(String) -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- Packs the values by storing them in 4 bits, two values into a byte Values
range is from 0 to 9.
- parse(String) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- Parses a hyphenation pattern file.
- parse(File) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- Parses a hyphenation pattern file.
- parse(InputSource) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- Parses a hyphenation pattern file.
- parse(Class<? extends RSLPStemmerBase>, String) -
Static method in class org.apache.lucene.analysis.pt.RSLPStemmerBase
- Parse a resource file into an RSLP stemmer description.
- PathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
- Take something like:
- PathHierarchyTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PatternAnalyzer - Class in org.apache.lucene.analysis.miscellaneous
- Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a
Reader
, that can flexibly separate text into terms via a regular expression Pattern
(with behaviour identical to String.split(String)
),
and that combines the functionality of
LetterTokenizer
,
LowerCaseTokenizer
,
WhitespaceTokenizer
,
StopFilter
into a single efficient
multi-purpose class. - PatternAnalyzer(Version, Pattern, boolean, Set<?>) -
Constructor for class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
- Constructs a new instance with the given parameters.
- PatternConsumer - Interface in org.apache.lucene.analysis.compound.hyphenation
- This interface is used to connect the XML pattern file parser to the
hyphenation tree.
- PatternParser - Class in org.apache.lucene.analysis.compound.hyphenation
- A SAX document handler to read and parse hyphenation patterns from a XML
file.
- PatternParser() -
Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PatternParser(PatternConsumer) -
Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PayloadEncoder - Interface in org.apache.lucene.analysis.payloads
- Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to Payload.
- PayloadHelper - Class in org.apache.lucene.analysis.payloads
- Utility methods for encoding payloads.
- PayloadHelper() -
Constructor for class org.apache.lucene.analysis.payloads.PayloadHelper
-
- permutationIterator() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
- Deprecated.
- PersianAnalyzer - Class in org.apache.lucene.analysis.fa
Analyzer
for Persian.- PersianAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Builds an analyzer with the default stop words:
PersianAnalyzer.DEFAULT_STOPWORD_FILE
.
- PersianAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Builds an analyzer with the given stop words
- PersianAnalyzer(Version, String...) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Deprecated. use
PersianAnalyzer.PersianAnalyzer(Version, Set)
instead
- PersianAnalyzer(Version, Hashtable<?, ?>) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Deprecated. use
PersianAnalyzer.PersianAnalyzer(Version, Set)
instead
- PersianAnalyzer(Version, File) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Deprecated. use
PersianAnalyzer.PersianAnalyzer(Version, Set)
instead
- PersianCharFilter - Class in org.apache.lucene.analysis.fa
- CharFilter that replaces instances of Zero-width non-joiner with an
ordinary space.
- PersianCharFilter(CharStream) -
Constructor for class org.apache.lucene.analysis.fa.PersianCharFilter
-
- PersianNormalizationFilter - Class in org.apache.lucene.analysis.fa
- A
TokenFilter
that applies PersianNormalizer
to normalize the
orthography. - PersianNormalizationFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilter
-
- PersianNormalizer - Class in org.apache.lucene.analysis.fa
- Normalizer for Persian.
- PersianNormalizer() -
Constructor for class org.apache.lucene.analysis.fa.PersianNormalizer
-
- PorterStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- PorterStemmer() -
Constructor for class org.tartarus.snowball.ext.PorterStemmer
-
- PortugueseAnalyzer - Class in org.apache.lucene.analysis.pt
Analyzer
for Portuguese.- PortugueseAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
- Builds an analyzer with the default stop words:
PortugueseAnalyzer.DEFAULT_STOPWORD_FILE
.
- PortugueseAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
- Builds an analyzer with the given stop words.
- PortugueseAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
- Builds an analyzer with the given stop words.
- PortugueseLightStemFilter - Class in org.apache.lucene.analysis.pt
- A
TokenFilter
that applies PortugueseLightStemmer
to stem
Portuguese words. - PortugueseLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
-
- PortugueseLightStemmer - Class in org.apache.lucene.analysis.pt
- Light Stemmer for Portuguese
- PortugueseLightStemmer() -
Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- PortugueseMinimalStemFilter - Class in org.apache.lucene.analysis.pt
- A
TokenFilter
that applies PortugueseMinimalStemmer
to stem
Portuguese words. - PortugueseMinimalStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
-
- PortugueseMinimalStemmer - Class in org.apache.lucene.analysis.pt
- Minimal Stemmer for Portuguese
- PortugueseMinimalStemmer() -
Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- PortugueseStemFilter - Class in org.apache.lucene.analysis.pt
- A
TokenFilter
that applies PortugueseStemmer
to stem
Portuguese words. - PortugueseStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilter
-
- PortugueseStemmer - Class in org.apache.lucene.analysis.pt
- Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa)
algorithm.
- PortugueseStemmer() -
Constructor for class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- PortugueseStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- PortugueseStemmer() -
Constructor for class org.tartarus.snowball.ext.PortugueseStemmer
-
- PositionFilter - Class in org.apache.lucene.analysis.position
- Set the positionIncrement of all tokens to the "positionIncrement",
except the first return token which retains its original positionIncrement value.
- PositionFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.position.PositionFilter
- Constructs a PositionFilter that assigns a position increment of zero to
all but the first token from the given input stream.
- PositionFilter(TokenStream, int) -
Constructor for class org.apache.lucene.analysis.position.PositionFilter
- Constructs a PositionFilter that assigns the given position increment to
all but the first token from the given input stream.
- postBreak -
Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- preBreak -
Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- PrefixAndSuffixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
- Links two
PrefixAwareTokenFilter
. - PrefixAndSuffixAwareTokenFilter(TokenStream, TokenStream, TokenStream) -
Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- PrefixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
- Joins two token streams and leaves the last token of the first stream available
to be used when updating the token values in the second stream based on that token.
- PrefixAwareTokenFilter(TokenStream, TokenStream) -
Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- prefixes -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- printStats() -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
- printStats() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- PUA_EC00_MARKER -
Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Example marker character: U+EC00 (PRIVATE USE AREA: EC00)
- put(int, byte) -
Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
- put(int, char) -
Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
Q
- QueryAutoStopWordAnalyzer - Class in org.apache.lucene.analysis.query
- An
Analyzer
used primarily at query time to wrap another analyzer and provide a layer of protection
which prevents very common words from being passed into queries. - QueryAutoStopWordAnalyzer(Version, Analyzer) -
Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- Initializes this analyzer with the Analyzer object that actually produces the tokens
R
- read(char[], int, int) -
Method in class org.apache.lucene.analysis.fa.PersianCharFilter
-
- readToken(StringBuffer) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- replace(char[], int) -
Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- replace_s(int, int, CharSequence) -
Method in class org.tartarus.snowball.SnowballProgram
-
- replace_s(int, int, String) -
Method in class org.tartarus.snowball.SnowballProgram
- Deprecated. for binary back compat. Will be removed in Lucene 4.0
- replacement -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- reserve(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() -
Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
- Deprecated.
- reset(Reader) -
Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
- Deprecated.
- reset() -
Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
- reset() -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- reset() -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- reset() -
Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.position.PositionFilter
-
- reset() -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
- reset() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- reset() -
Method in class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
-
- reset() -
Method in class org.apache.lucene.analysis.th.ThaiWordFilter
-
- reset() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() -
Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- resize(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- resolveEntity(String, String) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- result -
Variable in class org.tartarus.snowball.Among
-
- reusableTokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
- reusableTokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- reusableTokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Returns a (possibly reused)
StandardTokenizer
filtered by a
StandardFilter
, a LowerCaseFilter
,
a StopFilter
, and a SnowballFilter
- reverse(String) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, String)
instead. This method
will be removed in Lucene 4.0
- reverse(Version, String) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Reverses the given input string
- reverse(char[]) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, char[])
instead. This
method will be removed in Lucene 4.0
- reverse(Version, char[]) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Reverses the given input buffer in-place
- reverse(char[], int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, char[], int)
instead. This
method will be removed in Lucene 4.0
- reverse(Version, char[], int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Partially reverses the given input buffer in-place from offset 0
up to the given length.
- reverse(char[], int, int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, char[], int, int)
instead. This
method will be removed in Lucene 4.0
- reverse(Version, char[], int, int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Partially reverses the given input buffer in-place from the given offset
up to the given length.
- ReversePathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
- Take something like:
- ReversePathHierarchyTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReverseStringFilter - Class in org.apache.lucene.analysis.reverse
- Reverse token string, for example "country" => "yrtnuoc".
- ReverseStringFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.ReverseStringFilter(Version, TokenStream)
instead. This constructor will be removed in Lucene 4.0
- ReverseStringFilter(TokenStream, char) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.ReverseStringFilter(Version, TokenStream, char)
instead. This constructor will be removed in Lucene 4.0
- ReverseStringFilter(Version, TokenStream) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Create a new ReverseStringFilter that reverses all tokens in the
supplied
TokenStream
.
- ReverseStringFilter(Version, TokenStream, char) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Create a new ReverseStringFilter that reverses and marks all tokens in the
supplied
TokenStream
.
- rewind() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
-
- RomanianAnalyzer - Class in org.apache.lucene.analysis.ro
Analyzer
for Romanian.- RomanianAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
- Builds an analyzer with the default stop words:
RomanianAnalyzer.DEFAULT_STOPWORD_FILE
.
- RomanianAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
- Builds an analyzer with the given stop words.
- RomanianAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
- Builds an analyzer with the given stop words.
- RomanianStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- RomanianStemmer() -
Constructor for class org.tartarus.snowball.ext.RomanianStemmer
-
- root -
Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- RSLPStemmerBase - Class in org.apache.lucene.analysis.pt
- Base class for stemmers that use a set of RSLP-like stemming steps.
- RSLPStemmerBase() -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
- RSLPStemmerBase.Rule - Class in org.apache.lucene.analysis.pt
- A basic rule, with no exceptions.
- RSLPStemmerBase.Rule(String, int, String) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- Create a rule.
- RSLPStemmerBase.RuleWithSetExceptions - Class in org.apache.lucene.analysis.pt
- A rule with a set of whole-word exceptions.
- RSLPStemmerBase.RuleWithSetExceptions(String, int, String, String[]) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
-
- RSLPStemmerBase.RuleWithSuffixExceptions - Class in org.apache.lucene.analysis.pt
- A rule with a set of exceptional suffixes.
- RSLPStemmerBase.RuleWithSuffixExceptions(String, int, String, String[]) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
-
- RSLPStemmerBase.Step - Class in org.apache.lucene.analysis.pt
- A step containing a list of rules.
- RSLPStemmerBase.Step(String, RSLPStemmerBase.Rule[], int, String[]) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- Create a new step
- RTL_DIRECTION_MARKER -
Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Example marker character: U+200F (RIGHT-TO-LEFT MARK)
- rules -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- RussianAnalyzer - Class in org.apache.lucene.analysis.ru
Analyzer
for Russian language.- RussianAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianAnalyzer(Version, String...) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Deprecated. use
RussianAnalyzer.RussianAnalyzer(Version, Set)
instead
- RussianAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Map<?, ?>) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Deprecated. use
RussianAnalyzer.RussianAnalyzer(Version, Set)
instead
- RussianLetterTokenizer - Class in org.apache.lucene.analysis.ru
- Deprecated. Use
StandardTokenizer
instead, which has the same functionality.
This filter will be removed in Lucene 5.0 - RussianLetterTokenizer(Version, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. Construct a new RussianLetterTokenizer.
- RussianLetterTokenizer(Version, AttributeSource, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. Construct a new RussianLetterTokenizer using a given
AttributeSource
.
- RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory
.
- RussianLetterTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. use
RussianLetterTokenizer.RussianLetterTokenizer(Version, Reader)
instead. This will
be removed in Lucene 4.0.
- RussianLetterTokenizer(AttributeSource, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. use
RussianLetterTokenizer.RussianLetterTokenizer(Version, AttributeSource, Reader)
instead. This will be removed in Lucene 4.0.
- RussianLetterTokenizer(AttributeSource.AttributeFactory, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. use
RussianLetterTokenizer.RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader)
instead. This will be removed in Lucene 4.0.
- RussianLightStemFilter - Class in org.apache.lucene.analysis.ru
- A
TokenFilter
that applies RussianLightStemmer
to stem Russian
words. - RussianLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilter
-
- RussianLightStemmer - Class in org.apache.lucene.analysis.ru
- Light Stemmer for Russian.
- RussianLightStemmer() -
Constructor for class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- RussianLowerCaseFilter - Class in org.apache.lucene.analysis.ru
- Deprecated. Use
LowerCaseFilter
instead, which has the same
functionality. This filter will be removed in Lucene 4.0 - RussianLowerCaseFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.ru.RussianLowerCaseFilter
- Deprecated.
- RussianStemFilter - Class in org.apache.lucene.analysis.ru
- Deprecated. Use
SnowballFilter
with
RussianStemmer
instead, which has the
same functionality. This filter will be removed in Lucene 4.0 - RussianStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.ru.RussianStemFilter
- Deprecated.
- RussianStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- RussianStemmer() -
Constructor for class org.tartarus.snowball.ext.RussianStemmer
-
S
- s -
Variable in class org.tartarus.snowball.Among
-
- s_size -
Variable in class org.tartarus.snowball.Among
-
- sameRow -
Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
- Deprecated.
- sc -
Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
The character stored in this node: splitchar.
- searchPatterns(char[], int, byte[]) -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Search for all possible partial matches of word starting at index an update
interletter values.
- set(char[], int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setArticles(Version, Set<?>) -
Method in class org.apache.lucene.analysis.fr.ElisionFilter
- Deprecated. use
ElisionFilter.ElisionFilter(Version, TokenStream, Set)
instead
- setArticles(Set<?>) -
Method in class org.apache.lucene.analysis.fr.ElisionFilter
- Deprecated. use
ElisionFilter.setArticles(Version, Set)
instead
- setCharAt(int, char) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setConsumer(PatternConsumer) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- setCurrent(String) -
Method in class org.tartarus.snowball.SnowballProgram
- Set the current string.
- setCurrent(char[], int) -
Method in class org.tartarus.snowball.SnowballProgram
- Set the current string.
- setExclusionSet(Set<?>) -
Method in class org.apache.lucene.analysis.de.GermanStemFilter
- Deprecated. use
KeywordAttribute
with KeywordMarkerFilter
instead.
- setExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
- Deprecated. use
KeywordAttribute
with KeywordMarkerFilter
instead.
- setExclusionTable(HashSet<?>) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Deprecated. use
KeywordAttribute
with KeywordMarkerFilter
instead.
- setFirst(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- setIgnoringSinglePrefixOrSuffixShingle(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setLast(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- setLength(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setMatrix(ShingleMatrixFilter.Matrix) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setMaximumShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setMaxShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Set the maximum size of output shingles (default: 2)
- setMaxShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Set the max shingle size (default: 2)
- setMinimumShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setMinShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Set the min shingle size (default: 2).
- setMinShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Set the min shingle size (default: 2).
- setOutputUnigrams(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Shall the filter pass the original tokens (the "unigrams") to the output
stream?
- setOutputUnigrams(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Shall the output stream contain the input tokens (unigrams) as well as
shingles? (default: true.)
- setOutputUnigramsIfNoShingles(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Shall we override the behavior of outputUnigrams==false for those
times when no shingles are available (because there are fewer than
minShingleSize tokens in the input stream)? (default: false.)
- setOutputUnigramsIfNoShingles(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Shall we override the behavior of outputUnigrams==false for those
times when no shingles are available (because there are fewer than
minShingleSize tokens in the input stream)? (default: false.)
- setPrefix(TokenStream) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setSpacerCharacter(Character) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setStemDictionary(File) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Reads a stemdictionary file , that overrules the stemming algorithm
This is a textfile that contains per line
word\tstem, i.e: two tab seperated words
- setStemDictionary(HashMap<?, ?>) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Deprecated. Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
- setStemExclusionTable(String...) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Deprecated. use
BrazilianAnalyzer.BrazilianAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Deprecated. use
BrazilianAnalyzer.BrazilianAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Deprecated. use
BrazilianAnalyzer.BrazilianAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(String[]) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Deprecated. use
GermanAnalyzer.GermanAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Deprecated. use
GermanAnalyzer.GermanAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Deprecated. use
GermanAnalyzer.GermanAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(String...) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Deprecated. use
FrenchAnalyzer.FrenchAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Deprecated. use
FrenchAnalyzer.FrenchAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Deprecated. use
FrenchAnalyzer.FrenchAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(String...) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Deprecated. use
DutchAnalyzer.DutchAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(HashSet<?>) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Deprecated. use
DutchAnalyzer.DutchAnalyzer(Version, Set, Set)
instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Deprecated. use
DutchAnalyzer.DutchAnalyzer(Version, Set, Set)
instead
- setStemmer(GermanStemmer) -
Method in class org.apache.lucene.analysis.de.GermanStemFilter
- Set a alternative/custom
GermanStemmer
for this filter.
- setStemmer(FrenchStemmer) -
Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
- Deprecated. Set a alternative/custom
FrenchStemmer
for this filter.
- setStemmer(DutchStemmer) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Deprecated. Set a alternative/custom
DutchStemmer
for this filter.
- setStemmer(RussianStemmer) -
Method in class org.apache.lucene.analysis.ru.RussianStemFilter
- Deprecated. Set a alternative/custom
RussianStemmer
for this filter.
- setSuffix(TokenStream) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setToken(Token) -
Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
- Deprecated.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
- Deprecated. Sets the TokenPositioner as token flags int value.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
- Deprecated. Sets information on how a
Token
is to be inserted to a ShingleMatrixFilter.Matrix
.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
- Deprecated.
- setTokens(List<Token>) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
- Deprecated.
- setTokenSeparator(String) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Sets the string to use when joining adjacent tokens to form a shingle
- setTokenSeparator(String) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Sets the string to use when joining adjacent tokens to form a shingle
- setTokenType(String) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Set the type of the shingle tokens produced by this filter.
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
- Deprecated.
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
- Deprecated. Stores a 32 bit float in the payload, or set it to null if 1f;
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
- Deprecated. Have this method do nothing in order to 'disable' weights.
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
- Deprecated.
- SHADDA -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- ShingleAnalyzerWrapper - Class in org.apache.lucene.analysis.shingle
- A ShingleAnalyzerWrapper wraps a
ShingleFilter
around another Analyzer
. - ShingleAnalyzerWrapper(Analyzer) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Version) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Wraps
StandardAnalyzer
.
- ShingleAnalyzerWrapper(Version, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Wraps
StandardAnalyzer
.
- ShingleFilter - Class in org.apache.lucene.analysis.shingle
- A ShingleFilter constructs shingles (token n-grams) from a token stream.
- ShingleFilter(TokenStream, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Construct a ShingleFilter with default shingle size: 2.
- ShingleFilter(TokenStream, String) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Construct a ShingleFilter with the specified token type for shingle tokens
and the default shingle size: 2
- ShingleMatrixFilter - Class in org.apache.lucene.analysis.shingle
- Deprecated. Will be removed in Lucene 4.0. This filter is unmaintained and might not behave
correctly if used with custom Attributes, i.e. Attributes other than
the ones located in
org.apache.lucene.analysis.tokenattributes
. It also uses
hardcoded payload encoders which makes it not easily adaptable to other use-cases. - ShingleMatrixFilter(ShingleMatrixFilter.Matrix, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter based on a user defined matrix.
- ShingleMatrixFilter(TokenStream, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter using the default
ShingleMatrixFilter.TokenSettingsCodec
.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter with ad hoc parameter settings.
- ShingleMatrixFilter.Matrix - Class in org.apache.lucene.analysis.shingle
- Deprecated. A column focused matrix in three dimensions:
- ShingleMatrixFilter.Matrix() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
- Deprecated.
- ShingleMatrixFilter.Matrix.Column - Class in org.apache.lucene.analysis.shingle
- Deprecated.
- ShingleMatrixFilter.Matrix.Column(Token) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- ShingleMatrixFilter.Matrix.Column() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row - Class in org.apache.lucene.analysis.shingle
- Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
- Deprecated.
- ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. Using this codec makes a
ShingleMatrixFilter
act like ShingleFilter
. - ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
- Deprecated.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. A full featured codec not to be used for something serious.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
- Deprecated.
- ShingleMatrixFilter.TokenPositioner - Class in org.apache.lucene.analysis.shingle
- Deprecated. Used to describe how a
Token
is to be inserted to a ShingleMatrixFilter.Matrix
. - ShingleMatrixFilter.TokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. Strategy used to code and decode meta data of the tokens from the input stream
regarding how to position the tokens in the matrix, set and retreive weight, et c.
- ShingleMatrixFilter.TokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
- Deprecated.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. A codec that creates a two dimensional matrix
by treating tokens from the input stream with 0 position increment
as new rows to the current column.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
- Deprecated.
- SingleTokenTokenStream - Class in org.apache.lucene.analysis.miscellaneous
- A
TokenStream
containing a single token. - SingleTokenTokenStream(Token) -
Constructor for class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- size() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- size() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- slice_check() -
Method in class org.tartarus.snowball.SnowballProgram
-
- slice_del() -
Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(CharSequence) -
Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(String) -
Method in class org.tartarus.snowball.SnowballProgram
- Deprecated. for binary back compat. Will be removed in Lucene 4.0
- slice_from(StringBuilder) -
Method in class org.tartarus.snowball.SnowballProgram
- Deprecated. for binary back compat. Will be removed in Lucene 4.0
- slice_to(StringBuilder) -
Method in class org.tartarus.snowball.SnowballProgram
-
- SnowballAnalyzer - Class in org.apache.lucene.analysis.snowball
- Deprecated. Use the language-specific analyzer in contrib/analyzers instead.
This analyzer will be removed in Lucene 5.0
- SnowballAnalyzer(Version, String) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Builds the named analyzer with no stop words.
- SnowballAnalyzer(Version, String, String[]) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Use
SnowballAnalyzer.SnowballAnalyzer(Version, String, Set)
instead.
- SnowballAnalyzer(Version, String, Set<?>) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Builds the named analyzer with the given stop words.
- SnowballFilter - Class in org.apache.lucene.analysis.snowball
- A filter that stems words using a Snowball-generated stemmer.
- SnowballFilter(TokenStream, SnowballProgram) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
- SnowballFilter(TokenStream, String) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
- Construct the named stemming filter.
- SnowballProgram - Class in org.tartarus.snowball
- This is the rev 502 of the Snowball SVN trunk,
but modified:
made abstract and introduced abstract method stem to avoid expensive reflection in filter class.
- SnowballProgram() -
Constructor for class org.tartarus.snowball.SnowballProgram
-
- SpanishAnalyzer - Class in org.apache.lucene.analysis.es
Analyzer
for Spanish.- SpanishAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
- Builds an analyzer with the default stop words:
SpanishAnalyzer.DEFAULT_STOPWORD_FILE
.
- SpanishAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
- Builds an analyzer with the given stop words.
- SpanishAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
- Builds an analyzer with the given stop words.
- SpanishLightStemFilter - Class in org.apache.lucene.analysis.es
- A
TokenFilter
that applies SpanishLightStemmer
to stem Spanish
words. - SpanishLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilter
-
- SpanishLightStemmer - Class in org.apache.lucene.analysis.es
- Light Stemmer for Spanish
- SpanishLightStemmer() -
Constructor for class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- SpanishStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- SpanishStemmer() -
Constructor for class org.tartarus.snowball.ext.SpanishStemmer
-
- START_OF_HEADING_MARKER -
Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Example marker character: U+0001 (START OF HEADING)
- startElement(String, String, String, Attributes) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- startsWith(char[], int, String) -
Static method in class org.apache.lucene.analysis.util.StemmerUtil
- Returns true if the character array starts with the suffix.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.ar.ArabicStemmer
- Stem an input buffer of Arabic text.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.bg.BulgarianStemmer
- Stem an input buffer of Bulgarian text.
- stem(String) -
Method in class org.apache.lucene.analysis.br.BrazilianStemmer
- Stems the given term to an unique discriminator.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.cz.CzechStemmer
- Stem an input buffer of Czech text.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.de.GermanLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.de.GermanMinimalStemmer
-
- stem(String) -
Method in class org.apache.lucene.analysis.de.GermanStemmer
- Stemms the given term to an unique discriminator.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.el.GreekStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.en.EnglishMinimalStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.fi.FinnishLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.fr.FrenchLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
-
- stem(String) -
Method in class org.apache.lucene.analysis.fr.FrenchStemmer
- Deprecated. Stems the given term to a unique discriminator.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.gl.GalicianStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.hi.HindiStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.hu.HungarianLightStemmer
-
- stem(char[], int, boolean) -
Method in class org.apache.lucene.analysis.id.IndonesianStemmer
- Stem a term (returning its new length).
- stem(char[], int) -
Method in class org.apache.lucene.analysis.it.ItalianLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.lv.LatvianStemmer
- Stem a latvian word.
- stem(String) -
Method in class org.apache.lucene.analysis.nl.DutchStemmer
- Deprecated.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.ArmenianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.BasqueStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.CatalanStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.DanishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.DutchStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.EnglishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.FinnishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.FrenchStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.German2Stemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.GermanStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.HungarianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.ItalianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.KpStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.LovinsStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.NorwegianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.PorterStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.PortugueseStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.RomanianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.RussianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.SpanishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.SwedishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.TurkishStemmer
-
- stem() -
Method in class org.tartarus.snowball.SnowballProgram
-
- StemmerOverrideFilter - Class in org.apache.lucene.analysis.miscellaneous
- Provides the ability to override any
KeywordAttribute
aware stemmer
with custom dictionary-based stemming. - StemmerOverrideFilter(Version, TokenStream, Map<?, String>) -
Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
- Create a new StemmerOverrideFilter, performing dictionary-based stemming
with the provided
dictionary
.
- StemmerUtil - Class in org.apache.lucene.analysis.util
- Some commonly-used stemming functions
- StemmerUtil() -
Constructor for class org.apache.lucene.analysis.util.StemmerUtil
-
- stemPrefix(char[], int) -
Method in class org.apache.lucene.analysis.ar.ArabicStemmer
- Stem a prefix off an Arabic word.
- stemSuffix(char[], int) -
Method in class org.apache.lucene.analysis.ar.ArabicStemmer
- Stem suffix(es) off an Arabic word.
- STOP_WORDS -
Static variable in class org.apache.lucene.analysis.cjk.CJKAnalyzer
- Deprecated. use
CJKAnalyzer.getDefaultStopSet()
instead
- STOP_WORDS -
Static variable in class org.apache.lucene.analysis.cn.ChineseFilter
- Deprecated.
- stoplist -
Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- This map stores hyphenation exceptions
- STOPWORDS_COMMENT -
Static variable in class org.apache.lucene.analysis.ar.ArabicAnalyzer
- Deprecated. use
WordlistLoader.getWordSet(File, String)
directly
- STOPWORDS_COMMENT -
Static variable in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
- Deprecated. use
WordlistLoader.getWordSet(File, String)
directly
- STOPWORDS_COMMENT -
Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
- The comment character in the stopwords file.
- strcmp(char[], int, char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- Compares 2 null terminated char arrays
- strcmp(String, char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- Compares a string with null terminated char array
- strcpy(char[], int, char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[]) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- SUB_HEADING -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- SUB_HEADING_ID -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- subSequence(int, int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- substring_i -
Variable in class org.tartarus.snowball.Among
-
- suffix -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- suffixes -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- suffixes -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- SUKUN -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- SwedishAnalyzer - Class in org.apache.lucene.analysis.sv
Analyzer
for Swedish.- SwedishAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
- Builds an analyzer with the default stop words:
SwedishAnalyzer.DEFAULT_STOPWORD_FILE
.
- SwedishAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
- Builds an analyzer with the given stop words.
- SwedishAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
- Builds an analyzer with the given stop words.
- SwedishLightStemFilter - Class in org.apache.lucene.analysis.sv
- A
TokenFilter
that applies SwedishLightStemmer
to stem Swedish
words. - SwedishLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilter
-
- SwedishLightStemmer - Class in org.apache.lucene.analysis.sv
- Light Stemmer for Swedish.
- SwedishLightStemmer() -
Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- SwedishStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- SwedishStemmer() -
Constructor for class org.tartarus.snowball.ext.SwedishStemmer
-
T
- TATWEEL -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- TEH -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- TEH_MARBUTA -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- TEH_MARBUTA -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- termAtt -
Variable in class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
-
- TernaryTree - Class in org.apache.lucene.analysis.compound.hyphenation
- Ternary Search Tree.
- TernaryTree.Iterator - Class in org.apache.lucene.analysis.compound.hyphenation
-
- TernaryTree.Iterator() -
Constructor for class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
-
- TestApp - Class in org.tartarus.snowball
-
- TestApp() -
Constructor for class org.tartarus.snowball.TestApp
-
- ThaiAnalyzer - Class in org.apache.lucene.analysis.th
Analyzer
for Thai language.- ThaiAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.th.ThaiAnalyzer
-
- ThaiWordFilter - Class in org.apache.lucene.analysis.th
TokenFilter
that use BreakIterator
to break each
Token that is Thai into separate Token(s) for each Thai word.- ThaiWordFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.th.ThaiWordFilter
- Deprecated. Use the ctor with
matchVersion
instead!
- ThaiWordFilter(Version, TokenStream) -
Constructor for class org.apache.lucene.analysis.th.ThaiWordFilter
- Creates a new ThaiWordFilter with the specified match version.
- toCharArray() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- TOKEN_SEPARATOR -
Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
- The default string to use when joining adjacent tokens to form a shingle
- TOKEN_TYPES -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- String token types that correspond to token type int constants
- TokenOffsetPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
- Adds the
Token.setStartOffset(int)
and Token.setEndOffset(int)
First 4 bytes are the start - TokenOffsetPayloadTokenFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilter
-
- TokenRangeSinkFilter - Class in org.apache.lucene.analysis.sinks
- Counts the tokens as they go by and saves to the internal list those between the range of lower and upper, exclusive of upper
- TokenRangeSinkFilter(int, int) -
Constructor for class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
-
- tokens -
Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
- TOKENS_ONLY -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Only output tokens
- tokenStream(String, String) -
Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
- Creates a token stream that tokenizes the given string into token terms
(aka words).
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
- Creates a token stream that tokenizes all the text in the given Reader;
This implementation forwards to
tokenStream(String, String)
and is
less efficient than tokenStream(String, String)
.
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Constructs a
StandardTokenizer
filtered by a StandardFilter
, a LowerCaseFilter
, a StopFilter
,
and a SnowballFilter
- TokenTypeSinkFilter - Class in org.apache.lucene.analysis.sinks
-
- TokenTypeSinkFilter(String) -
Constructor for class org.apache.lucene.analysis.sinks.TokenTypeSinkFilter
-
- toString() -
Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- toString() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
- Deprecated.
- toString() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- toString() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
- Deprecated.
- toString() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- trimToSize() -
Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
- trimToSize() -
Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
- trimToSize() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- Each node stores a character (splitchar) which is part of some key(s).
- TurkishAnalyzer - Class in org.apache.lucene.analysis.tr
Analyzer
for Turkish.- TurkishAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
- Builds an analyzer with the default stop words:
TurkishAnalyzer.DEFAULT_STOPWORD_FILE
.
- TurkishAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
- Builds an analyzer with the given stop words.
- TurkishAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
- Builds an analyzer with the given stop words.
- TurkishLowerCaseFilter - Class in org.apache.lucene.analysis.tr
- Normalizes Turkish token text to lower case.
- TurkishLowerCaseFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.tr.TurkishLowerCaseFilter
- Create a new TurkishLowerCaseFilter, that normalizes Turkish token text
to lower case.
- TurkishStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- TurkishStemmer() -
Constructor for class org.tartarus.snowball.ext.TurkishStemmer
-
- TypeAsPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
- Makes the
Token.type()
a payload. - TypeAsPayloadTokenFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilter
-
U
- unpackValues(int) -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
- unsafeWrite(char) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- unsafeWrite(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- unsafeWrite(char[], int, int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- UNTOKENIZED_ONLY -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Only output untokenized tokens, which are tokens that would normally be split into several tokens
- UNTOKENIZED_TOKEN_FLAG -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- This flag is used to indicate that the produced "Token" would, if
WikipediaTokenizer.TOKENS_ONLY
was used, produce multiple tokens.
- updateInputToken(Token, Token) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- updateSuffixToken(Token, Token) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- updateSuffixToken(Token, Token) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- The default implementation adds last prefix token end offset to the suffix token start and end offsets.
- updateToken(Token, List<Token>, int, List<ShingleMatrixFilter.Matrix.Column.Row>, List<Token>) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Final touch of a shingle token before it is passed on to the consumer from method
ShingleMatrixFilter.incrementToken()
.
V
- valueOf(String) -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
- Returns the enum constant of this type with the specified name.
- valueOf(String) -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
- Returns the enum constant of this type with the specified name.
- values() -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
- Returns an array containing the constants of this enum type, in
the order they are declared.
- values() -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
- Returns an array containing the constants of this enum type, in
the order they are declared.
- vspace -
Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- value space: stores the interletter values
W
- WAW -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- WHITESPACE_PATTERN -
Static variable in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
"\\s+"
; Divides text at whitespaces (Character.isWhitespace(c))
- WikipediaTokenizer - Class in org.apache.lucene.analysis.wikipedia
- Extension of StandardTokenizer that is aware of Wikipedia syntax.
- WikipediaTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer
.
- WikipediaTokenizer(Reader, int, Set<String>) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer
.
- WikipediaTokenizer(AttributeSource.AttributeFactory, Reader, int, Set<String>) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer
.
- WikipediaTokenizer(AttributeSource, Reader, int, Set<String>) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer
.
- write(char) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(char[]) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(char[], int, int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(OpenStringBuilder) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(String) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
Y
- YEH -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- YEH -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- YEH -
Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
-
- YEH_BARREE -
Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
-
A B C D E F G H I K L M N O P Q R S T U V W Y
Copyright © 2000-2011 Apache Software Foundation. All Rights Reserved.