Provides various convenience classes for creating payloads on Tokens.
org.apache.lucene.analysis.position - package org.apache.lucene.analysis.positionFilter for assigning position increments.org.apache.lucene.analysis.pt - package org.apache.lucene.analysis.ptAnalyzer for Portuguese.org.apache.lucene.analysis.query - package org.apache.lucene.analysis.queryAutomatically filter high-frequency stopwords.org.apache.lucene.analysis.reverse - package org.apache.lucene.analysis.reverseFilter to reverse token text.org.apache.lucene.analysis.ro - package org.apache.lucene.analysis.roAnalyzer for Romanian.org.apache.lucene.analysis.ru - package org.apache.lucene.analysis.ruAnalyzer for Russian.org.apache.lucene.analysis.shingle - package org.apache.lucene.analysis.shingleWord n-gram filtersorg.apache.lucene.analysis.sinks - package org.apache.lucene.analysis.sinksImplementations of the SinkTokenizer that might be useful.
org.apache.lucene.analysis.snowball - package org.apache.lucene.analysis.snowballTokenFilter and Analyzer implementations that use Snowball
stemmers.org.apache.lucene.analysis.sv - package org.apache.lucene.analysis.svAnalyzer for Swedish.org.apache.lucene.analysis.synonym - package org.apache.lucene.analysis.synonymAnalysis components for Synonyms.org.apache.lucene.analysis.th - package org.apache.lucene.analysis.thAnalyzer for Thai.org.apache.lucene.analysis.tr - package org.apache.lucene.analysis.trAnalyzer for Turkish.org.apache.lucene.analysis.util - package org.apache.lucene.analysis.util org.apache.lucene.analysis.wikipedia - package org.apache.lucene.analysis.wikipediaTokenizer that is aware of Wikipedia syntax.org.tartarus.snowball - package org.tartarus.snowball org.tartarus.snowball.ext - package org.tartarus.snowball.ext out_grouping(char[], int, int) -
Method in class org.tartarus.snowball.SnowballProgram
out_grouping_b(char[], int, int) -
Method in class org.tartarus.snowball.SnowballProgram
out_range(int, int) -
Method in class org.tartarus.snowball.SnowballProgram
out_range_b(int, int) -
Method in class org.tartarus.snowball.SnowballProgram
P
- packValues(String) -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- Packs the values by storing them in 4 bits, two values into a byte Values
range is from 0 to 9.
- parse(String) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- Parses a hyphenation pattern file.
- parse(File) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- Parses a hyphenation pattern file.
- parse(InputSource) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- Parses a hyphenation pattern file.
- parse(Class<? extends RSLPStemmerBase>, String) -
Static method in class org.apache.lucene.analysis.pt.RSLPStemmerBase
- Parse a resource file into an RSLP stemmer description.
- PathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
- Take something like:
- PathHierarchyTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PatternAnalyzer - Class in org.apache.lucene.analysis.miscellaneous
- Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a
Reader, that can flexibly separate text into terms via a regular expression Pattern
(with behaviour identical to String.split(String)),
and that combines the functionality of
LetterTokenizer,
LowerCaseTokenizer,
WhitespaceTokenizer,
StopFilter into a single efficient
multi-purpose class. - PatternAnalyzer(Version, Pattern, boolean, Set<?>) -
Constructor for class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
- Constructs a new instance with the given parameters.
- PatternConsumer - Interface in org.apache.lucene.analysis.compound.hyphenation
- This interface is used to connect the XML pattern file parser to the
hyphenation tree.
- PatternParser - Class in org.apache.lucene.analysis.compound.hyphenation
- A SAX document handler to read and parse hyphenation patterns from a XML
file.
- PatternParser() -
Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PatternParser(PatternConsumer) -
Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PayloadEncoder - Interface in org.apache.lucene.analysis.payloads
- Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to Payload.
- PayloadHelper - Class in org.apache.lucene.analysis.payloads
- Utility methods for encoding payloads.
- PayloadHelper() -
Constructor for class org.apache.lucene.analysis.payloads.PayloadHelper
-
- permutationIterator() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
- Deprecated.
- PersianAnalyzer - Class in org.apache.lucene.analysis.fa
Analyzer for Persian.- PersianAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Builds an analyzer with the default stop words:
PersianAnalyzer.DEFAULT_STOPWORD_FILE.
- PersianAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Builds an analyzer with the given stop words
- PersianAnalyzer(Version, String...) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Deprecated. use
PersianAnalyzer.PersianAnalyzer(Version, Set) instead
- PersianAnalyzer(Version, Hashtable<?, ?>) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Deprecated. use
PersianAnalyzer.PersianAnalyzer(Version, Set) instead
- PersianAnalyzer(Version, File) -
Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
- Deprecated. use
PersianAnalyzer.PersianAnalyzer(Version, Set) instead
- PersianCharFilter - Class in org.apache.lucene.analysis.fa
- CharFilter that replaces instances of Zero-width non-joiner with an
ordinary space.
- PersianCharFilter(CharStream) -
Constructor for class org.apache.lucene.analysis.fa.PersianCharFilter
-
- PersianNormalizationFilter - Class in org.apache.lucene.analysis.fa
- A
TokenFilter that applies PersianNormalizer to normalize the
orthography. - PersianNormalizationFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilter
-
- PersianNormalizer - Class in org.apache.lucene.analysis.fa
- Normalizer for Persian.
- PersianNormalizer() -
Constructor for class org.apache.lucene.analysis.fa.PersianNormalizer
-
- PorterStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- PorterStemmer() -
Constructor for class org.tartarus.snowball.ext.PorterStemmer
-
- PortugueseAnalyzer - Class in org.apache.lucene.analysis.pt
Analyzer for Portuguese.- PortugueseAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
- Builds an analyzer with the default stop words:
PortugueseAnalyzer.DEFAULT_STOPWORD_FILE.
- PortugueseAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
- Builds an analyzer with the given stop words.
- PortugueseAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
- Builds an analyzer with the given stop words.
- PortugueseLightStemFilter - Class in org.apache.lucene.analysis.pt
- A
TokenFilter that applies PortugueseLightStemmer to stem
Portuguese words. - PortugueseLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
-
- PortugueseLightStemmer - Class in org.apache.lucene.analysis.pt
- Light Stemmer for Portuguese
- PortugueseLightStemmer() -
Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- PortugueseMinimalStemFilter - Class in org.apache.lucene.analysis.pt
- A
TokenFilter that applies PortugueseMinimalStemmer to stem
Portuguese words. - PortugueseMinimalStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
-
- PortugueseMinimalStemmer - Class in org.apache.lucene.analysis.pt
- Minimal Stemmer for Portuguese
- PortugueseMinimalStemmer() -
Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- PortugueseStemFilter - Class in org.apache.lucene.analysis.pt
- A
TokenFilter that applies PortugueseStemmer to stem
Portuguese words. - PortugueseStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilter
-
- PortugueseStemmer - Class in org.apache.lucene.analysis.pt
- Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa)
algorithm.
- PortugueseStemmer() -
Constructor for class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- PortugueseStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- PortugueseStemmer() -
Constructor for class org.tartarus.snowball.ext.PortugueseStemmer
-
- PositionFilter - Class in org.apache.lucene.analysis.position
- Set the positionIncrement of all tokens to the "positionIncrement",
except the first return token which retains its original positionIncrement value.
- PositionFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.position.PositionFilter
- Constructs a PositionFilter that assigns a position increment of zero to
all but the first token from the given input stream.
- PositionFilter(TokenStream, int) -
Constructor for class org.apache.lucene.analysis.position.PositionFilter
- Constructs a PositionFilter that assigns the given position increment to
all but the first token from the given input stream.
- postBreak -
Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- preBreak -
Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- PrefixAndSuffixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
- Links two
PrefixAwareTokenFilter. - PrefixAndSuffixAwareTokenFilter(TokenStream, TokenStream, TokenStream) -
Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- PrefixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
- Joins two token streams and leaves the last token of the first stream available
to be used when updating the token values in the second stream based on that token.
- PrefixAwareTokenFilter(TokenStream, TokenStream) -
Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- prefixes -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- previous() -
Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
- printStats() -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
- printStats() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- PUA_EC00_MARKER -
Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Example marker character: U+EC00 (PRIVATE USE AREA: EC00)
- put(int, byte) -
Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
- put(int, char) -
Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
Q
- QueryAutoStopWordAnalyzer - Class in org.apache.lucene.analysis.query
- An
Analyzer used primarily at query time to wrap another analyzer and provide a layer of protection
which prevents very common words from being passed into queries. - QueryAutoStopWordAnalyzer(Version, Analyzer) -
Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- Deprecated. Stopwords should be calculated at instantiation using one of the other constructors
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader) -
Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency percentage greater than
QueryAutoStopWordAnalyzer.defaultMaxDocFreqPercent
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, int) -
Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency greater than the given
maxDocFreq
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, float) -
Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency percentage greater than
the given maxPercentDocs
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, Collection<String>, float) -
Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the
given selection of fields from terms with a document frequency percentage
greater than the given maxPercentDocs
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, Collection<String>, int) -
Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the
given selection of fields from terms with a document frequency greater than
the given maxDocFreq
R
- read(char[], int, int) -
Method in class org.apache.lucene.analysis.fa.PersianCharFilter
-
- readToken(StringBuffer) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- replace(char[], int) -
Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- replace_s(int, int, CharSequence) -
Method in class org.tartarus.snowball.SnowballProgram
-
- replace_s(int, int, String) -
Method in class org.tartarus.snowball.SnowballProgram
- Deprecated. for binary back compat. Will be removed in Lucene 4.0
- replacement -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- reserve(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() -
Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
- Deprecated.
- reset(Reader) -
Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
- Deprecated.
- reset() -
Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
- reset() -
Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilter
-
- reset() -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- reset() -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- reset() -
Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- reset() -
Method in class org.apache.lucene.analysis.position.PositionFilter
-
- reset() -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
- reset() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- reset() -
Method in class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
-
- reset() -
Method in class org.apache.lucene.analysis.synonym.SynonymFilter
-
- reset() -
Method in class org.apache.lucene.analysis.th.ThaiWordFilter
-
- reset() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() -
Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- reset(Reader) -
Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- resize(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- resolveEntity(String, String) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- result -
Variable in class org.tartarus.snowball.Among
-
- reusableTokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
- reusableTokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- reusableTokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Returns a (possibly reused)
StandardTokenizer filtered by a
StandardFilter, a LowerCaseFilter,
a StopFilter, and a SnowballFilter
- reverse(String) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, String) instead. This method
will be removed in Lucene 4.0
- reverse(Version, String) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Reverses the given input string
- reverse(char[]) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, char[]) instead. This
method will be removed in Lucene 4.0
- reverse(Version, char[]) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Reverses the given input buffer in-place
- reverse(char[], int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, char[], int) instead. This
method will be removed in Lucene 4.0
- reverse(Version, char[], int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Partially reverses the given input buffer in-place from offset 0
up to the given length.
- reverse(char[], int, int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.reverse(Version, char[], int, int) instead. This
method will be removed in Lucene 4.0
- reverse(Version, char[], int, int) -
Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Partially reverses the given input buffer in-place from the given offset
up to the given length.
- ReversePathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
- Take something like:
- ReversePathHierarchyTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char, int) -
Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReverseStringFilter - Class in org.apache.lucene.analysis.reverse
- Reverse token string, for example "country" => "yrtnuoc".
- ReverseStringFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.ReverseStringFilter(Version, TokenStream)
instead. This constructor will be removed in Lucene 4.0
- ReverseStringFilter(TokenStream, char) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Deprecated. use
ReverseStringFilter.ReverseStringFilter(Version, TokenStream, char)
instead. This constructor will be removed in Lucene 4.0
- ReverseStringFilter(Version, TokenStream) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Create a new ReverseStringFilter that reverses all tokens in the
supplied
TokenStream.
- ReverseStringFilter(Version, TokenStream, char) -
Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Create a new ReverseStringFilter that reverses and marks all tokens in the
supplied
TokenStream.
- rewind() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
-
- RomanianAnalyzer - Class in org.apache.lucene.analysis.ro
Analyzer for Romanian.- RomanianAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
- Builds an analyzer with the default stop words:
RomanianAnalyzer.DEFAULT_STOPWORD_FILE.
- RomanianAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
- Builds an analyzer with the given stop words.
- RomanianAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
- Builds an analyzer with the given stop words.
- RomanianStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- RomanianStemmer() -
Constructor for class org.tartarus.snowball.ext.RomanianStemmer
-
- root -
Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- RSLPStemmerBase - Class in org.apache.lucene.analysis.pt
- Base class for stemmers that use a set of RSLP-like stemming steps.
- RSLPStemmerBase() -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
- RSLPStemmerBase.Rule - Class in org.apache.lucene.analysis.pt
- A basic rule, with no exceptions.
- RSLPStemmerBase.Rule(String, int, String) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- Create a rule.
- RSLPStemmerBase.RuleWithSetExceptions - Class in org.apache.lucene.analysis.pt
- A rule with a set of whole-word exceptions.
- RSLPStemmerBase.RuleWithSetExceptions(String, int, String, String[]) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
-
- RSLPStemmerBase.RuleWithSuffixExceptions - Class in org.apache.lucene.analysis.pt
- A rule with a set of exceptional suffixes.
- RSLPStemmerBase.RuleWithSuffixExceptions(String, int, String, String[]) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
-
- RSLPStemmerBase.Step - Class in org.apache.lucene.analysis.pt
- A step containing a list of rules.
- RSLPStemmerBase.Step(String, RSLPStemmerBase.Rule[], int, String[]) -
Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- Create a new step
- RTL_DIRECTION_MARKER -
Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Example marker character: U+200F (RIGHT-TO-LEFT MARK)
- rules -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- RussianAnalyzer - Class in org.apache.lucene.analysis.ru
Analyzer for Russian language.- RussianAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianAnalyzer(Version, String...) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Deprecated. use
RussianAnalyzer.RussianAnalyzer(Version, Set) instead
- RussianAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Map<?, ?>) -
Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- Deprecated. use
RussianAnalyzer.RussianAnalyzer(Version, Set) instead
- RussianLetterTokenizer - Class in org.apache.lucene.analysis.ru
- Deprecated. Use
StandardTokenizer instead, which has the same functionality.
This filter will be removed in Lucene 5.0 - RussianLetterTokenizer(Version, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. Construct a new RussianLetterTokenizer.
- RussianLetterTokenizer(Version, AttributeSource, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. Construct a new RussianLetterTokenizer using a given
AttributeSource.
- RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory.
- RussianLetterTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. use
RussianLetterTokenizer.RussianLetterTokenizer(Version, Reader) instead. This will
be removed in Lucene 4.0.
- RussianLetterTokenizer(AttributeSource, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. use
RussianLetterTokenizer.RussianLetterTokenizer(Version, AttributeSource, Reader)
instead. This will be removed in Lucene 4.0.
- RussianLetterTokenizer(AttributeSource.AttributeFactory, Reader) -
Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
- Deprecated. use
RussianLetterTokenizer.RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader)
instead. This will be removed in Lucene 4.0.
- RussianLightStemFilter - Class in org.apache.lucene.analysis.ru
- A
TokenFilter that applies RussianLightStemmer to stem Russian
words. - RussianLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilter
-
- RussianLightStemmer - Class in org.apache.lucene.analysis.ru
- Light Stemmer for Russian.
- RussianLightStemmer() -
Constructor for class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- RussianLowerCaseFilter - Class in org.apache.lucene.analysis.ru
- Deprecated. Use
LowerCaseFilter instead, which has the same
functionality. This filter will be removed in Lucene 4.0 - RussianLowerCaseFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.ru.RussianLowerCaseFilter
- Deprecated.
- RussianStemFilter - Class in org.apache.lucene.analysis.ru
- Deprecated. Use
SnowballFilter with
RussianStemmer instead, which has the
same functionality. This filter will be removed in Lucene 4.0 - RussianStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.ru.RussianStemFilter
- Deprecated.
- RussianStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- RussianStemmer() -
Constructor for class org.tartarus.snowball.ext.RussianStemmer
-
S
- s -
Variable in class org.tartarus.snowball.Among
-
- s_size -
Variable in class org.tartarus.snowball.Among
-
- sameRow -
Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
- Deprecated.
- sc -
Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
The character stored in this node: splitchar.
- searchPatterns(char[], int, byte[]) -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Search for all possible partial matches of word starting at index an update
interletter values.
- set(char[], int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setAppend(String) -
Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
- Sets the append defined for the affix
- setAppendFlags(char[]) -
Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
- Sets the flags defined for the affix append
- setArticles(Version, Set<?>) -
Method in class org.apache.lucene.analysis.fr.ElisionFilter
- Deprecated. use
ElisionFilter.ElisionFilter(Version, TokenStream, Set) instead
- setArticles(Set<?>) -
Method in class org.apache.lucene.analysis.fr.ElisionFilter
- Deprecated. use
ElisionFilter.setArticles(Version, Set) instead
- setCharAt(int, char) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setCondition(String, String) -
Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
- Sets the condition that must be met before the affix can be applied
- setConsumer(PatternConsumer) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- setCrossProduct(boolean) -
Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
- Sets whether the affix is defined as cross product
- setCurrent(String) -
Method in class org.tartarus.snowball.SnowballProgram
- Set the current string.
- setCurrent(char[], int) -
Method in class org.tartarus.snowball.SnowballProgram
- Set the current string.
- setExclusionSet(Set<?>) -
Method in class org.apache.lucene.analysis.de.GermanStemFilter
- Deprecated. use
KeywordAttribute with KeywordMarkerFilter instead.
- setExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
- Deprecated. use
KeywordAttribute with KeywordMarkerFilter instead.
- setExclusionTable(HashSet<?>) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Deprecated. use
KeywordAttribute with KeywordMarkerFilter instead.
- setFirst(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- setFlag(char) -
Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
- Sets the affix flag
- setIgnoringSinglePrefixOrSuffixShingle(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setIndex(int) -
Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
- setLast(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- setLength(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setMatrix(ShingleMatrixFilter.Matrix) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setMaximumShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setMaxShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Deprecated. Setting maxShingleSize after Analyzer instantiation prevents reuse.
Confgure maxShingleSize during construction.
- setMaxShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Set the max shingle size (default: 2)
- setMinimumShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setMinShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Deprecated. Setting minShingleSize after Analyzer instantiation prevents reuse.
Confgure minShingleSize during construction.
- setMinShingleSize(int) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Set the min shingle size (default: 2).
- setOutputUnigrams(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Deprecated. Setting outputUnigrams after Analyzer instantiation prevents reuse.
Confgure outputUnigrams during construction.
- setOutputUnigrams(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Shall the output stream contain the input tokens (unigrams) as well as
shingles? (default: true.)
- setOutputUnigramsIfNoShingles(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Deprecated. Setting outputUnigramsIfNoShingles after Analyzer instantiation prevents reuse.
Confgure outputUnigramsIfNoShingles during construction.
- setOutputUnigramsIfNoShingles(boolean) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Shall we override the behavior of outputUnigrams==false for those
times when no shingles are available (because there are fewer than
minShingleSize tokens in the input stream)? (default: false.)
- setPrefix(TokenStream) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setSpacerCharacter(Character) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated.
- setStemDictionary(File) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Deprecated. This prevents reuse of TokenStreams. If you wish to use a custom
stem dictionary, create your own Analyzer with
StemmerOverrideFilter
- setStemDictionary(HashMap<?, ?>) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Deprecated. Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
- setStemExclusionTable(String...) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Deprecated. use
BrazilianAnalyzer.BrazilianAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Deprecated. use
BrazilianAnalyzer.BrazilianAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- Deprecated. use
BrazilianAnalyzer.BrazilianAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(String[]) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Deprecated. use
GermanAnalyzer.GermanAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Deprecated. use
GermanAnalyzer.GermanAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- Deprecated. use
GermanAnalyzer.GermanAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(String...) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Deprecated. use
FrenchAnalyzer.FrenchAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(Map<?, ?>) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Deprecated. use
FrenchAnalyzer.FrenchAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- Deprecated. use
FrenchAnalyzer.FrenchAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(String...) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Deprecated. use
DutchAnalyzer.DutchAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(HashSet<?>) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Deprecated. use
DutchAnalyzer.DutchAnalyzer(Version, Set, Set) instead
- setStemExclusionTable(File) -
Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- Deprecated. use
DutchAnalyzer.DutchAnalyzer(Version, Set, Set) instead
- setStemmer(GermanStemmer) -
Method in class org.apache.lucene.analysis.de.GermanStemFilter
- Set a alternative/custom
GermanStemmer for this filter.
- setStemmer(FrenchStemmer) -
Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
- Deprecated. Set a alternative/custom
FrenchStemmer for this filter.
- setStemmer(DutchStemmer) -
Method in class org.apache.lucene.analysis.nl.DutchStemFilter
- Deprecated. Set a alternative/custom
DutchStemmer for this filter.
- setStemmer(RussianStemmer) -
Method in class org.apache.lucene.analysis.ru.RussianStemFilter
- Deprecated. Set a alternative/custom
RussianStemmer for this filter.
- setStrip(String) -
Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
- Sets the stripping characters defined for the affix
- setSuffix(TokenStream) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setText(char[], int, int) -
Method in class org.apache.lucene.analysis.util.CharArrayIterator
- Set a new region of text to be examined by this iterator
- setToken(Token) -
Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
- Deprecated.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
- Deprecated. Sets the TokenPositioner as token flags int value.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
- Deprecated. Sets information on how a
Token is to be inserted to a ShingleMatrixFilter.Matrix.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
- Deprecated.
- setTokens(List<Token>) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
- Deprecated.
- setTokenSeparator(String) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Deprecated. Setting tokenSeparator after Analyzer instantiation prevents reuse.
Confgure tokenSeparator during construction.
- setTokenSeparator(String) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Sets the string to use when joining adjacent tokens to form a shingle
- setTokenType(String) -
Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- Set the type of the shingle tokens produced by this filter.
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
- Deprecated.
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
- Deprecated. Stores a 32 bit float in the payload, or set it to null if 1f;
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
- Deprecated. Have this method do nothing in order to 'disable' weights.
- setWeight(Token, float) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
- Deprecated.
- SHADDA -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- ShingleAnalyzerWrapper - Class in org.apache.lucene.analysis.shingle
- A ShingleAnalyzerWrapper wraps a
ShingleFilter around another Analyzer. - ShingleAnalyzerWrapper(Analyzer) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int, int, String, boolean, boolean) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Creates a new ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Version) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Wraps
StandardAnalyzer.
- ShingleAnalyzerWrapper(Version, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- Wraps
StandardAnalyzer.
- ShingleFilter - Class in org.apache.lucene.analysis.shingle
- A ShingleFilter constructs shingles (token n-grams) from a token stream.
- ShingleFilter(TokenStream, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Constructs a ShingleFilter with the specified shingle size from the
TokenStream input
- ShingleFilter(TokenStream, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Constructs a ShingleFilter with the specified shingle size from the
TokenStream input
- ShingleFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Construct a ShingleFilter with default shingle size: 2.
- ShingleFilter(TokenStream, String) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
- Construct a ShingleFilter with the specified token type for shingle tokens
and the default shingle size: 2
- ShingleMatrixFilter - Class in org.apache.lucene.analysis.shingle
- Deprecated. Will be removed in Lucene 4.0. This filter is unmaintained and might not behave
correctly if used with custom Attributes, i.e. Attributes other than
the ones located in
org.apache.lucene.analysis.tokenattributes. It also uses
hardcoded payload encoders which makes it not easily adaptable to other use-cases. - ShingleMatrixFilter(ShingleMatrixFilter.Matrix, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter based on a user defined matrix.
- ShingleMatrixFilter(TokenStream, int, int) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter using the default
ShingleMatrixFilter.TokenSettingsCodec.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Creates a shingle filter with ad hoc parameter settings.
- ShingleMatrixFilter.Matrix - Class in org.apache.lucene.analysis.shingle
- Deprecated. A column focused matrix in three dimensions:
- ShingleMatrixFilter.Matrix() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
- Deprecated.
- ShingleMatrixFilter.Matrix.Column - Class in org.apache.lucene.analysis.shingle
- Deprecated.
- ShingleMatrixFilter.Matrix.Column(Token) -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- ShingleMatrixFilter.Matrix.Column() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row - Class in org.apache.lucene.analysis.shingle
- Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
- Deprecated.
- ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. Using this codec makes a
ShingleMatrixFilter act like ShingleFilter. - ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
- Deprecated.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. A full featured codec not to be used for something serious.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
- Deprecated.
- ShingleMatrixFilter.TokenPositioner - Class in org.apache.lucene.analysis.shingle
- Deprecated. Used to describe how a
Token is to be inserted to a ShingleMatrixFilter.Matrix. - ShingleMatrixFilter.TokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. Strategy used to code and decode meta data of the tokens from the input stream
regarding how to position the tokens in the matrix, set and retreive weight, et c.
- ShingleMatrixFilter.TokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
- Deprecated.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
- Deprecated. A codec that creates a two dimensional matrix
by treating tokens from the input stream with 0 position increment
as new rows to the current column.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec() -
Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
- Deprecated.
- SingleTokenTokenStream - Class in org.apache.lucene.analysis.miscellaneous
- A
TokenStream containing a single token. - SingleTokenTokenStream(Token) -
Constructor for class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- size() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- size() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- slice_check() -
Method in class org.tartarus.snowball.SnowballProgram
-
- slice_del() -
Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(CharSequence) -
Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(String) -
Method in class org.tartarus.snowball.SnowballProgram
- Deprecated. for binary back compat. Will be removed in Lucene 4.0
- slice_from(StringBuilder) -
Method in class org.tartarus.snowball.SnowballProgram
- Deprecated. for binary back compat. Will be removed in Lucene 4.0
- slice_to(StringBuilder) -
Method in class org.tartarus.snowball.SnowballProgram
-
- SnowballAnalyzer - Class in org.apache.lucene.analysis.snowball
- Deprecated. Use the language-specific analyzer in contrib/analyzers instead.
This analyzer will be removed in Lucene 5.0
- SnowballAnalyzer(Version, String) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Builds the named analyzer with no stop words.
- SnowballAnalyzer(Version, String, String[]) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Use
SnowballAnalyzer.SnowballAnalyzer(Version, String, Set) instead.
- SnowballAnalyzer(Version, String, Set<?>) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Builds the named analyzer with the given stop words.
- SnowballFilter - Class in org.apache.lucene.analysis.snowball
- A filter that stems words using a Snowball-generated stemmer.
- SnowballFilter(TokenStream, SnowballProgram) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
- SnowballFilter(TokenStream, String) -
Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
- Construct the named stemming filter.
- SnowballProgram - Class in org.tartarus.snowball
- This is the rev 502 of the Snowball SVN trunk,
but modified:
made abstract and introduced abstract method stem to avoid expensive reflection in filter class.
- SnowballProgram() -
Constructor for class org.tartarus.snowball.SnowballProgram
-
- SolrSynonymParser - Class in org.apache.lucene.analysis.synonym
- Parser for the Solr synonyms format.
- SolrSynonymParser(boolean, boolean, Analyzer) -
Constructor for class org.apache.lucene.analysis.synonym.SolrSynonymParser
-
- SpanishAnalyzer - Class in org.apache.lucene.analysis.es
Analyzer for Spanish.- SpanishAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
- Builds an analyzer with the default stop words:
SpanishAnalyzer.DEFAULT_STOPWORD_FILE.
- SpanishAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
- Builds an analyzer with the given stop words.
- SpanishAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
- Builds an analyzer with the given stop words.
- SpanishLightStemFilter - Class in org.apache.lucene.analysis.es
- A
TokenFilter that applies SpanishLightStemmer to stem Spanish
words. - SpanishLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilter
-
- SpanishLightStemmer - Class in org.apache.lucene.analysis.es
- Light Stemmer for Spanish
- SpanishLightStemmer() -
Constructor for class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- SpanishStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- SpanishStemmer() -
Constructor for class org.tartarus.snowball.ext.SpanishStemmer
-
- START_OF_HEADING_MARKER -
Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- Example marker character: U+0001 (START OF HEADING)
- startElement(String, String, String, Attributes) -
Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- startOffset -
Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
-
- startsWith(char[], int, String) -
Static method in class org.apache.lucene.analysis.util.StemmerUtil
- Returns true if the character array starts with the suffix.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.ar.ArabicStemmer
- Stem an input buffer of Arabic text.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.bg.BulgarianStemmer
- Stem an input buffer of Bulgarian text.
- stem(String) -
Method in class org.apache.lucene.analysis.br.BrazilianStemmer
- Stems the given term to an unique discriminator.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.cz.CzechStemmer
- Stem an input buffer of Czech text.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.de.GermanLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.de.GermanMinimalStemmer
-
- stem(String) -
Method in class org.apache.lucene.analysis.de.GermanStemmer
- Stemms the given term to an unique discriminator.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.el.GreekStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.en.EnglishMinimalStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.fi.FinnishLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.fr.FrenchLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
-
- stem(String) -
Method in class org.apache.lucene.analysis.fr.FrenchStemmer
- Deprecated. Stems the given term to a unique discriminator.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.gl.GalicianStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.hi.HindiStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.hu.HungarianLightStemmer
-
- stem(String) -
Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
- Find the stem(s) of the provided word
- stem(char[], int) -
Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
- Find the stem(s) of the provided word
- stem(char[], int, boolean) -
Method in class org.apache.lucene.analysis.id.IndonesianStemmer
- Stem a term (returning its new length).
- stem(char[], int) -
Method in class org.apache.lucene.analysis.it.ItalianLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.lv.LatvianStemmer
- Stem a latvian word.
- stem(String) -
Method in class org.apache.lucene.analysis.nl.DutchStemmer
- Deprecated.
- stem(char[], int) -
Method in class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- stem(char[], int) -
Method in class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.ArmenianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.BasqueStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.CatalanStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.DanishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.DutchStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.EnglishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.FinnishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.FrenchStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.German2Stemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.GermanStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.HungarianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.ItalianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.KpStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.LovinsStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.NorwegianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.PorterStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.PortugueseStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.RomanianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.RussianStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.SpanishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.SwedishStemmer
-
- stem() -
Method in class org.tartarus.snowball.ext.TurkishStemmer
-
- stem() -
Method in class org.tartarus.snowball.SnowballProgram
-
- StemmerOverrideFilter - Class in org.apache.lucene.analysis.miscellaneous
- Provides the ability to override any
KeywordAttribute aware stemmer
with custom dictionary-based stemming. - StemmerOverrideFilter(Version, TokenStream, Map<?, String>) -
Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
- Create a new StemmerOverrideFilter, performing dictionary-based stemming
with the provided
dictionary.
- StemmerUtil - Class in org.apache.lucene.analysis.util
- Some commonly-used stemming functions
- StemmerUtil() -
Constructor for class org.apache.lucene.analysis.util.StemmerUtil
-
- stemPrefix(char[], int) -
Method in class org.apache.lucene.analysis.ar.ArabicStemmer
- Stem a prefix off an Arabic word.
- stemSuffix(char[], int) -
Method in class org.apache.lucene.analysis.ar.ArabicStemmer
- Stem suffix(es) off an Arabic word.
- STOP_WORDS -
Static variable in class org.apache.lucene.analysis.cjk.CJKAnalyzer
- Deprecated. use
CJKAnalyzer.getDefaultStopSet() instead
- STOP_WORDS -
Static variable in class org.apache.lucene.analysis.cn.ChineseFilter
- Deprecated.
- stoplist -
Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- This map stores hyphenation exceptions
- STOPWORDS_COMMENT -
Static variable in class org.apache.lucene.analysis.ar.ArabicAnalyzer
- Deprecated. use
WordlistLoader.getWordSet(Reader, String, Version) directly
- STOPWORDS_COMMENT -
Static variable in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
- Deprecated. use
WordlistLoader.getWordSet(Reader, String, Version) directly
- STOPWORDS_COMMENT -
Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
- The comment character in the stopwords file.
- strcmp(char[], int, char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- Compares 2 null terminated char arrays
- strcmp(String, char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- Compares a string with null terminated char array
- strcpy(char[], int, char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[], int) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[]) -
Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- SUB_HEADING -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- SUB_HEADING_ID -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- subSequence(int, int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- substring_i -
Variable in class org.tartarus.snowball.Among
-
- suffix -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- suffixes -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- suffixes -
Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- SUKUN -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- SwedishAnalyzer - Class in org.apache.lucene.analysis.sv
Analyzer for Swedish.- SwedishAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
- Builds an analyzer with the default stop words:
SwedishAnalyzer.DEFAULT_STOPWORD_FILE.
- SwedishAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
- Builds an analyzer with the given stop words.
- SwedishAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
- Builds an analyzer with the given stop words.
- SwedishLightStemFilter - Class in org.apache.lucene.analysis.sv
- A
TokenFilter that applies SwedishLightStemmer to stem Swedish
words. - SwedishLightStemFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilter
-
- SwedishLightStemmer - Class in org.apache.lucene.analysis.sv
- Light Stemmer for Swedish.
- SwedishLightStemmer() -
Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- SwedishStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- SwedishStemmer() -
Constructor for class org.tartarus.snowball.ext.SwedishStemmer
-
- SynonymFilter - Class in org.apache.lucene.analysis.synonym
- Matches single or multi word synonyms in a token stream.
- SynonymFilter(TokenStream, SynonymMap, boolean) -
Constructor for class org.apache.lucene.analysis.synonym.SynonymFilter
-
- SynonymMap - Class in org.apache.lucene.analysis.synonym
- A map of synonyms, keys and values are phrases.
- SynonymMap(FST<BytesRef>, BytesRefHash, int) -
Constructor for class org.apache.lucene.analysis.synonym.SynonymMap
-
- SynonymMap.Builder - Class in org.apache.lucene.analysis.synonym
- Builds an FSTSynonymMap.
- SynonymMap.Builder(boolean) -
Constructor for class org.apache.lucene.analysis.synonym.SynonymMap.Builder
- If dedup is true then identical rules (same input,
same output) will be added only once.
T
- TATWEEL -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- TEH -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- TEH_MARBUTA -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- TEH_MARBUTA -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- termAtt -
Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
- termAtt -
Variable in class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
-
- TernaryTree - Class in org.apache.lucene.analysis.compound.hyphenation
- Ternary Search Tree.
- TernaryTree.Iterator - Class in org.apache.lucene.analysis.compound.hyphenation
-
- TernaryTree.Iterator() -
Constructor for class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
-
- TestApp - Class in org.tartarus.snowball
-
- TestApp() -
Constructor for class org.tartarus.snowball.TestApp
-
- ThaiAnalyzer - Class in org.apache.lucene.analysis.th
Analyzer for Thai language.- ThaiAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.th.ThaiAnalyzer
-
- ThaiWordFilter - Class in org.apache.lucene.analysis.th
TokenFilter that use BreakIterator to break each
Token that is Thai into separate Token(s) for each Thai word.- ThaiWordFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.th.ThaiWordFilter
- Deprecated. Use the ctor with
matchVersion instead!
- ThaiWordFilter(Version, TokenStream) -
Constructor for class org.apache.lucene.analysis.th.ThaiWordFilter
- Creates a new ThaiWordFilter with the specified match version.
- toCharArray() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- TOKEN_SEPARATOR -
Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
- The default string to use when joining adjacent tokens to form a shingle
- TOKEN_TYPES -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- String token types that correspond to token type int constants
- TokenOffsetPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
- Adds the
Token.setStartOffset(int)
and Token.setEndOffset(int)
First 4 bytes are the start - TokenOffsetPayloadTokenFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilter
-
- TokenRangeSinkFilter - Class in org.apache.lucene.analysis.sinks
- Counts the tokens as they go by and saves to the internal list those between the range of lower and upper, exclusive of upper
- TokenRangeSinkFilter(int, int) -
Constructor for class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
-
- tokens -
Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
- TOKENS_ONLY -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Only output tokens
- tokenStream(String, String) -
Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
- Creates a token stream that tokenizes the given string into token terms
(aka words).
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
- Creates a token stream that tokenizes all the text in the given Reader;
This implementation forwards to
tokenStream(String, String) and is
less efficient than tokenStream(String, String).
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- tokenStream(String, Reader) -
Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
- Deprecated. Constructs a
StandardTokenizer filtered by a StandardFilter, a LowerCaseFilter, a StopFilter,
and a SnowballFilter
- TokenTypeSinkFilter - Class in org.apache.lucene.analysis.sinks
-
- TokenTypeSinkFilter(String) -
Constructor for class org.apache.lucene.analysis.sinks.TokenTypeSinkFilter
-
- toString() -
Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- toString() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
- Deprecated.
- toString() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
- Deprecated.
- toString() -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
- Deprecated.
- toString() -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- trimToSize() -
Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
- trimToSize() -
Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
- trimToSize() -
Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- Each node stores a character (splitchar) which is part of some key(s).
- TurkishAnalyzer - Class in org.apache.lucene.analysis.tr
Analyzer for Turkish.- TurkishAnalyzer(Version) -
Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
- Builds an analyzer with the default stop words:
TurkishAnalyzer.DEFAULT_STOPWORD_FILE.
- TurkishAnalyzer(Version, Set<?>) -
Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
- Builds an analyzer with the given stop words.
- TurkishAnalyzer(Version, Set<?>, Set<?>) -
Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
- Builds an analyzer with the given stop words.
- TurkishLowerCaseFilter - Class in org.apache.lucene.analysis.tr
- Normalizes Turkish token text to lower case.
- TurkishLowerCaseFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.tr.TurkishLowerCaseFilter
- Create a new TurkishLowerCaseFilter, that normalizes Turkish token text
to lower case.
- TurkishStemmer - Class in org.tartarus.snowball.ext
- Generated class implementing code defined by a snowball script.
- TurkishStemmer() -
Constructor for class org.tartarus.snowball.ext.TurkishStemmer
-
- txt -
Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
-
- TYPE_SYNONYM -
Static variable in class org.apache.lucene.analysis.synonym.SynonymFilter
-
- TypeAsPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
- Makes the
Token.type() a payload. - TypeAsPayloadTokenFilter(TokenStream) -
Constructor for class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilter
-
U
- uniqueStems(char[], int) -
Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
- Find the unique stem(s) of the provided word
- unpackValues(int) -
Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
- unsafeWrite(char) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- unsafeWrite(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- unsafeWrite(char[], int, int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- UNTOKENIZED_ONLY -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Only output untokenized tokens, which are tokens that would normally be split into several tokens
- UNTOKENIZED_TOKEN_FLAG -
Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- This flag is used to indicate that the produced "Token" would, if
WikipediaTokenizer.TOKENS_ONLY was used, produce multiple tokens.
- updateInputToken(Token, Token) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- updateSuffixToken(Token, Token) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- updateSuffixToken(Token, Token) -
Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- The default implementation adds last prefix token end offset to the suffix token start and end offsets.
- updateToken(Token, List<Token>, int, List<ShingleMatrixFilter.Matrix.Column.Row>, List<Token>) -
Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
- Deprecated. Final touch of a shingle token before it is passed on to the consumer from method
ShingleMatrixFilter.incrementToken().
V
- valueOf(String) -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
- Returns the enum constant of this type with the specified name.
- valueOf(String) -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
- Returns the enum constant of this type with the specified name.
- values() -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
- Returns an array containing the constants of this enum type, in
the order they are declared.
- values() -
Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
- Returns an array containing the constants of this enum type, in
the order they are declared.
- vspace -
Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- value space: stores the interletter values
W
- WAW -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- WHITESPACE_PATTERN -
Static variable in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
"\\s+"; Divides text at whitespaces (Character.isWhitespace(c))
- WikipediaTokenizer - Class in org.apache.lucene.analysis.wikipedia
- Extension of StandardTokenizer that is aware of Wikipedia syntax.
- WikipediaTokenizer(Reader) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer.
- WikipediaTokenizer(Reader, int, Set<String>) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer.
- WikipediaTokenizer(AttributeSource.AttributeFactory, Reader, int, Set<String>) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer.
- WikipediaTokenizer(AttributeSource, Reader, int, Set<String>) -
Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Creates a new instance of the
WikipediaTokenizer.
- WORD_SEPARATOR -
Static variable in class org.apache.lucene.analysis.synonym.SynonymMap
- for multiword support, you must separate words with this separator
- WordnetSynonymParser - Class in org.apache.lucene.analysis.synonym
- Parser for wordnet prolog format
- WordnetSynonymParser(boolean, boolean, Analyzer) -
Constructor for class org.apache.lucene.analysis.synonym.WordnetSynonymParser
-
- words -
Variable in class org.apache.lucene.analysis.synonym.SynonymMap
- map
- write(char) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(char[]) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(char[], int, int) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(OpenStringBuilder) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- write(String) -
Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
Y
- YEH -
Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- YEH -
Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- YEH -
Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
-
- YEH_BARREE -
Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
-
A B C D E F G H I J K L M N O P Q R S T U V W Y
Copyright © 2000-2011 Apache Software Foundation. All Rights Reserved.