Implementations of the SinkTokenizer that might be useful.
- packValues(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Packs the values by storing them in 4 bits, two values into a byte Values
range is from 0 to 9.
- parse(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(File) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(InputSource) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(Class<? extends RSLPStemmerBase>, String) - Static method in class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
Parse a resource file into an RSLP stemmer description.
- PathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for path-like hierarchies.
- PathHierarchyTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PatternAnalyzer - Class in org.apache.lucene.analysis.miscellaneous
-
Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a
Reader, that can flexibly separate text into terms via a regular expression
Pattern
(with behaviour identical to
String.split(String)),
and that combines the functionality of
LetterTokenizer,
LowerCaseTokenizer,
WhitespaceTokenizer,
StopFilter into a single efficient
multi-purpose class.
- PatternAnalyzer(Version, Pattern, boolean, Set<?>) - Constructor for class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
Constructs a new instance with the given parameters.
- PatternConsumer - Interface in org.apache.lucene.analysis.compound.hyphenation
-
This interface is used to connect the XML pattern file parser to the
hyphenation tree.
- PatternParser - Class in org.apache.lucene.analysis.compound.hyphenation
-
A SAX document handler to read and parse hyphenation patterns from a XML
file.
- PatternParser() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PatternParser(PatternConsumer) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PayloadEncoder - Interface in org.apache.lucene.analysis.payloads
-
Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to Payload.
- PayloadHelper - Class in org.apache.lucene.analysis.payloads
-
Utility methods for encoding payloads.
- PayloadHelper() - Constructor for class org.apache.lucene.analysis.payloads.PayloadHelper
-
- permutationIterator() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- PersianAnalyzer - Class in org.apache.lucene.analysis.fa
-
Analyzer for Persian.
- PersianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Builds an analyzer with the given stop words
- PersianAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianAnalyzer(Version, Hashtable<?, ?>) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianCharFilter - Class in org.apache.lucene.analysis.fa
-
CharFilter that replaces instances of Zero-width non-joiner with an
ordinary space.
- PersianCharFilter(CharStream) - Constructor for class org.apache.lucene.analysis.fa.PersianCharFilter
-
- PersianNormalizationFilter - Class in org.apache.lucene.analysis.fa
-
- PersianNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilter
-
- PersianNormalizer - Class in org.apache.lucene.analysis.fa
-
Normalizer for Persian.
- PersianNormalizer() - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizer
-
- PorterStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- PorterStemmer() - Constructor for class org.tartarus.snowball.ext.PorterStemmer
-
- PortugueseAnalyzer - Class in org.apache.lucene.analysis.pt
-
Analyzer for Portuguese.
- PortugueseAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
- PortugueseAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseLightStemFilter - Class in org.apache.lucene.analysis.pt
-
- PortugueseLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
-
- PortugueseLightStemmer - Class in org.apache.lucene.analysis.pt
-
Light Stemmer for Portuguese
This stemmer implements the "UniNE" algorithm in:
Light Stemming Approaches for the French, Portuguese, German and Hungarian Languages
Jacques Savoy
- PortugueseLightStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- PortugueseMinimalStemFilter - Class in org.apache.lucene.analysis.pt
-
- PortugueseMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
-
- PortugueseMinimalStemmer - Class in org.apache.lucene.analysis.pt
-
Minimal Stemmer for Portuguese
This follows the "RSLP-S" algorithm presented in:
A study on the Use of Stemming for Monolingual Ad-Hoc Portuguese
Information Retrieval (Orengo, et al)
which is just the plural reduction step of the RSLP
algorithm from A Stemming Algorithm for the Portuguese Language,
Orengo et al.
- PortugueseMinimalStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- PortugueseStemFilter - Class in org.apache.lucene.analysis.pt
-
- PortugueseStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilter
-
- PortugueseStemmer - Class in org.apache.lucene.analysis.pt
-
Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa)
algorithm.
- PortugueseStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- PortugueseStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- PortugueseStemmer() - Constructor for class org.tartarus.snowball.ext.PortugueseStemmer
-
- PositionFilter - Class in org.apache.lucene.analysis.position
-
Set the positionIncrement of all tokens to the "positionIncrement",
except the first return token which retains its original positionIncrement value.
- PositionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.position.PositionFilter
-
Constructs a PositionFilter that assigns a position increment of zero to
all but the first token from the given input stream.
- PositionFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.position.PositionFilter
-
Constructs a PositionFilter that assigns the given position increment to
all but the first token from the given input stream.
- postBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- preBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- PrefixAndSuffixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
- PrefixAndSuffixAwareTokenFilter(TokenStream, TokenStream, TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- PrefixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Joins two token streams and leaves the last token of the first stream available
to be used when updating the token values in the second stream based on that token.
- PrefixAwareTokenFilter(TokenStream, TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- prefixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- previous() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
- printStats() - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
- printStats() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- PUA_EC00_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+EC00 (PRIVATE USE AREA: EC00)
- put(int, byte) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
- put(int, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
- read() - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
-
- read(char[], int, int) - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
-
- read(char[], int, int) - Method in class org.apache.lucene.analysis.fa.PersianCharFilter
-
- readToken(StringBuffer) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- replace(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- replace_s(int, int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
-
- replace_s(int, int, String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.
for binary back compat. Will be removed in Lucene 4.0
- replacement - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- reserve(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() - Method in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
- reset() - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- reset(Reader) - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- reset(Reader) - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
- reset() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- reset() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.position.PositionFilter
-
- reset() - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
- reset() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
-
- reset() - Method in class org.apache.lucene.analysis.synonym.SynonymFilter
-
- reset() - Method in class org.apache.lucene.analysis.th.ThaiWordFilter
-
- reset() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- reset(Reader) - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- resize(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- resolveEntity(String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- result - Variable in class org.tartarus.snowball.Among
-
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.
Returns a (possibly reused)
StandardTokenizer filtered by a
StandardFilter, a
LowerCaseFilter,
a
StopFilter, and a
SnowballFilter
- reverse(String) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, String) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input string
- reverse(char[]) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, char[]) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input buffer in-place
- reverse(char[], int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, char[], int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from offset 0
up to the given length.
- reverse(char[], int, int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, char[], int, int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from the given offset
up to the given length.
- ReversePathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for domain-like hierarchies.
- ReversePathHierarchyTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReverseStringFilter - Class in org.apache.lucene.analysis.reverse
-
Reverse token string, for example "country" => "yrtnuoc".
- ReverseStringFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- ReverseStringFilter(TokenStream, char) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- ReverseStringFilter(Version, TokenStream) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses all tokens in the
supplied TokenStream.
- ReverseStringFilter(Version, TokenStream, char) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses and marks all tokens in the
supplied TokenStream.
- rewind() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
-
- RomanianAnalyzer - Class in org.apache.lucene.analysis.ro
-
Analyzer for Romanian.
- RomanianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
- RomanianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- RomanianStemmer() - Constructor for class org.tartarus.snowball.ext.RomanianStemmer
-
- root - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- RSLPStemmerBase - Class in org.apache.lucene.analysis.pt
-
Base class for stemmers that use a set of RSLP-like stemming steps.
- RSLPStemmerBase() - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
- RSLPStemmerBase.Rule - Class in org.apache.lucene.analysis.pt
-
A basic rule, with no exceptions.
- RSLPStemmerBase.Rule(String, int, String) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
Create a rule.
- RSLPStemmerBase.RuleWithSetExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of whole-word exceptions.
- RSLPStemmerBase.RuleWithSetExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
-
- RSLPStemmerBase.RuleWithSuffixExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of exceptional suffixes.
- RSLPStemmerBase.RuleWithSuffixExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
-
- RSLPStemmerBase.Step - Class in org.apache.lucene.analysis.pt
-
A step containing a list of rules.
- RSLPStemmerBase.Step(String, RSLPStemmerBase.Rule[], int, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
Create a new step
- RTL_DIRECTION_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+200F (RIGHT-TO-LEFT MARK)
- rules - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- RussianAnalyzer - Class in org.apache.lucene.analysis.ru
-
Analyzer for Russian language.
- RussianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianLetterTokenizer - Class in org.apache.lucene.analysis.ru
-
Deprecated.
Use StandardTokenizer instead, which has the same functionality.
This filter will be removed in Lucene 5.0
- RussianLetterTokenizer(Version, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.
Construct a new RussianLetterTokenizer. * @param matchVersion Lucene version
to match See
above
- RussianLetterTokenizer(Version, AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.
Construct a new RussianLetterTokenizer using a given AttributeSource.
- RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory. * @param
matchVersion Lucene version to match See
above
- RussianLetterTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- RussianLetterTokenizer(AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- RussianLetterTokenizer(AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- RussianLightStemFilter - Class in org.apache.lucene.analysis.ru
-
- RussianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilter
-
- RussianLightStemmer - Class in org.apache.lucene.analysis.ru
-
Light Stemmer for Russian.
- RussianLightStemmer() - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- RussianLowerCaseFilter - Class in org.apache.lucene.analysis.ru
-
Deprecated.
Use LowerCaseFilter instead, which has the same
functionality. This filter will be removed in Lucene 4.0
- RussianLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianLowerCaseFilter
-
Deprecated.
- RussianStemFilter - Class in org.apache.lucene.analysis.ru
-
- RussianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianStemFilter
-
Deprecated.
- RussianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- RussianStemmer() - Constructor for class org.tartarus.snowball.ext.RussianStemmer
-
- s - Variable in class org.tartarus.snowball.Among
-
- s_size - Variable in class org.tartarus.snowball.Among
-
- sameRow - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
-
Deprecated.
- sc - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
The character stored in this node: splitchar.
- searchPatterns(char[], int, byte[]) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Search for all possible partial matches of word starting at index an update
interletter values.
- set(char[], int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setAppend(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the append defined for the affix
- setAppendFlags(char[]) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the flags defined for the affix append
- setArticles(Version, Set<?>) - Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
- setArticles(Set<?>) - Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
- setCharAt(int, char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setCondition(String, String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the condition that must be met before the affix can be applied
- setConsumer(PatternConsumer) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- setCrossProduct(boolean) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets whether the affix is defined as cross product
- setCurrent(String) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setCurrent(char[], int) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setExclusionSet(Set<?>) - Method in class org.apache.lucene.analysis.de.GermanStemFilter
-
Deprecated.
use KeywordAttribute with KeywordMarkerFilter instead.
- setExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.
use KeywordAttribute with KeywordMarkerFilter instead.
- setExclusionTable(HashSet<?>) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
use KeywordAttribute with KeywordMarkerFilter instead.
- setFirst(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- setFlag(char) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the affix flag
- setIgnoringSinglePrefixOrSuffixShingle(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setIndex(int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
- setLast(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- setLength(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setMatrix(ShingleMatrixFilter.Matrix) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMaximumShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMaxShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting maxShingleSize after Analyzer instantiation prevents reuse.
Confgure maxShingleSize during construction.
- setMaxShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the max shingle size (default: 2)
- setMinimumShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMinShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting minShingleSize after Analyzer instantiation prevents reuse.
Confgure minShingleSize during construction.
- setMinShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the min shingle size (default: 2).
- setOutputUnigrams(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting outputUnigrams after Analyzer instantiation prevents reuse.
Confgure outputUnigrams during construction.
- setOutputUnigrams(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall the output stream contain the input tokens (unigrams) as well as
shingles?
- setOutputUnigramsIfNoShingles(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting outputUnigramsIfNoShingles after Analyzer instantiation prevents reuse.
Confgure outputUnigramsIfNoShingles during construction.
- setOutputUnigramsIfNoShingles(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall we override the behavior of outputUnigrams==false for those
times when no shingles are available (because there are fewer than
minShingleSize tokens in the input stream)?
- setPrefix(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setSpacerCharacter(Character) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setStemDictionary(File) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.
This prevents reuse of TokenStreams. If you wish to use a custom
stem dictionary, create your own Analyzer with StemmerOverrideFilter
- setStemDictionary(HashMap<?, ?>) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
- setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
- setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
- setStemExclusionTable(String[]) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
- setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
- setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
- setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
- setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
- setStemExclusionTable(HashSet<?>) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
- setStemmer(GermanStemmer) - Method in class org.apache.lucene.analysis.de.GermanStemFilter
-
- setStemmer(FrenchStemmer) - Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.
- setStemmer(DutchStemmer) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
- setStemmer(RussianStemmer) - Method in class org.apache.lucene.analysis.ru.RussianStemFilter
-
Deprecated.
Set a alternative/custom RussianStemmer for this filter.
- setStrip(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the stripping characters defined for the affix
- setSuffix(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setText(char[], int, int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Set a new region of text to be examined by this iterator
- setToken(Token) - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
Sets the TokenPositioner as token flags int value.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- setTokens(List<Token>) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- setTokenSeparator(String) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting tokenSeparator after Analyzer instantiation prevents reuse.
Confgure tokenSeparator during construction.
- setTokenSeparator(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Sets the string to use when joining adjacent tokens to form a shingle
- setTokenType(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the type of the shingle tokens produced by this filter.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
Stores a 32 bit float in the payload, or set it to null if 1f;
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.
Have this method do nothing in order to 'disable' weights.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- SHADDA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- ShingleAnalyzerWrapper - Class in org.apache.lucene.analysis.shingle
-
A ShingleAnalyzerWrapper wraps a
ShingleFilter around another
Analyzer.
- ShingleAnalyzerWrapper(Analyzer) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int, int, String, boolean, boolean) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Creates a new ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Version) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps StandardAnalyzer.
- ShingleAnalyzerWrapper(Version, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps StandardAnalyzer.
- ShingleFilter - Class in org.apache.lucene.analysis.shingle
-
A ShingleFilter constructs shingles (token n-grams) from a token stream.
- ShingleFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream input
- ShingleFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream input
- ShingleFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with default shingle size: 2.
- ShingleFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with the specified token type for shingle tokens
and the default shingle size: 2
- ShingleMatrixFilter - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
Will be removed in Lucene 4.0. This filter is unmaintained and might not behave
correctly if used with custom Attributes, i.e. Attributes other than
the ones located in org.apache.lucene.analysis.tokenattributes. It also uses
hardcoded payload encoders which makes it not easily adaptable to other use-cases.
- ShingleMatrixFilter(ShingleMatrixFilter.Matrix, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter based on a user defined matrix.
- ShingleMatrixFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter with ad hoc parameter settings.
- ShingleMatrixFilter.Matrix - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
A column focused matrix in three dimensions:
Token[column][row][z-axis] {
{{hello}, {greetings, and, salutations}},
{{world}, {earth}, {tellus}}
};
todo consider row groups
to indicate that shingles is only to contain permutations with texts in that same row group.
- ShingleMatrixFilter.Matrix() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column(Token) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
A full featured codec not to be used for something serious.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
- ShingleMatrixFilter.TokenPositioner - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.TokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
Strategy used to code and decode meta data of the tokens from the input stream
regarding how to position the tokens in the matrix, set and retreive weight, et c.
- ShingleMatrixFilter.TokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
A codec that creates a two dimensional matrix
by treating tokens from the input stream with 0 position increment
as new rows to the current column.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- SINGLE_TYPE - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
when we emit a unigram, its then marked as this type
- SingleTokenTokenStream - Class in org.apache.lucene.analysis.miscellaneous
-
A TokenStream containing a single token.
- SingleTokenTokenStream(Token) - Constructor for class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- size() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- size() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- slice_check() - Method in class org.tartarus.snowball.SnowballProgram
-
- slice_del() - Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.
for binary back compat. Will be removed in Lucene 4.0
- slice_from(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.
for binary back compat. Will be removed in Lucene 4.0
- slice_to(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
- SnowballAnalyzer - Class in org.apache.lucene.analysis.snowball
-
Deprecated.
Use the language-specific analyzer in contrib/analyzers instead.
This analyzer will be removed in Lucene 5.0
- SnowballAnalyzer(Version, String) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.
Builds the named analyzer with no stop words.
- SnowballAnalyzer(Version, String, String[]) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
- SnowballAnalyzer(Version, String, Set<?>) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.
Builds the named analyzer with the given stop words.
- SnowballFilter - Class in org.apache.lucene.analysis.snowball
-
A filter that stems words using a Snowball-generated stemmer.
- SnowballFilter(TokenStream, SnowballProgram) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
- SnowballFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
Construct the named stemming filter.
- SnowballProgram - Class in org.tartarus.snowball
-
This is the rev 502 of the Snowball SVN trunk,
but modified:
made abstract and introduced abstract method stem to avoid expensive reflection in filter class.
- SnowballProgram() - Constructor for class org.tartarus.snowball.SnowballProgram
-
- SolrSynonymParser - Class in org.apache.lucene.analysis.synonym
-
Parser for the Solr synonyms format.
- SolrSynonymParser(boolean, boolean, Analyzer) - Constructor for class org.apache.lucene.analysis.synonym.SolrSynonymParser
-
- SpanishAnalyzer - Class in org.apache.lucene.analysis.es
-
Analyzer for Spanish.
- SpanishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
- SpanishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishLightStemFilter - Class in org.apache.lucene.analysis.es
-
- SpanishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilter
-
- SpanishLightStemmer - Class in org.apache.lucene.analysis.es
-
Light Stemmer for Spanish
This stemmer implements the algorithm described in:
Report on CLEF-2001 Experiments
Jacques Savoy
- SpanishLightStemmer() - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- SpanishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- SpanishStemmer() - Constructor for class org.tartarus.snowball.ext.SpanishStemmer
-
- START_OF_HEADING_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+0001 (START OF HEADING)
- startElement(String, String, String, Attributes) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- startOffset - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
-
- startsWith(char[], int, String) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array starts with the suffix.
- stem(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem an input buffer of Arabic text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.bg.BulgarianStemmer
-
Stem an input buffer of Bulgarian text.
- stem(String) - Method in class org.apache.lucene.analysis.br.BrazilianStemmer
-
Stems the given term to an unique discriminator.
- stem(char[], int) - Method in class org.apache.lucene.analysis.cz.CzechStemmer
-
Stem an input buffer of Czech text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanMinimalStemmer
-
- stem(String) - Method in class org.apache.lucene.analysis.de.GermanStemmer
-
Stemms the given term to an unique discriminator.
- stem(char[], int) - Method in class org.apache.lucene.analysis.el.GreekStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.en.EnglishMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.fi.FinnishLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
-
- stem(String) - Method in class org.apache.lucene.analysis.fr.FrenchStemmer
-
Deprecated.
Stems the given term to a unique discriminator.
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.hi.HindiStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.hu.HungarianLightStemmer
-
- stem(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Find the stem(s) of the provided word
- stem(char[], int) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Find the stem(s) of the provided word
- stem(char[], int, boolean) - Method in class org.apache.lucene.analysis.id.IndonesianStemmer
-
Stem a term (returning its new length).
- stem(char[], int) - Method in class org.apache.lucene.analysis.it.ItalianLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.lv.LatvianStemmer
-
Stem a latvian word. returns the new adjusted length.
- stem(String) - Method in class org.apache.lucene.analysis.nl.DutchStemmer
-
Deprecated.
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.ArmenianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.BasqueStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.CatalanStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.DanishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.DutchStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.EnglishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.FinnishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.FrenchStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.German2Stemmer
-
- stem() - Method in class org.tartarus.snowball.ext.GermanStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.HungarianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.IrishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.ItalianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.KpStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.LovinsStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.NorwegianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.PorterStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.PortugueseStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.RomanianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.RussianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.SpanishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.SwedishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.TurkishStemmer
-
- stem() - Method in class org.tartarus.snowball.SnowballProgram
-
- StemmerOverrideFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Provides the ability to override any KeywordAttribute aware stemmer
with custom dictionary-based stemming.
- StemmerOverrideFilter(Version, TokenStream, Map<?, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
-
Create a new StemmerOverrideFilter, performing dictionary-based stemming
with the provided dictionary.
- StemmerUtil - Class in org.apache.lucene.analysis.util
-
Some commonly-used stemming functions
- StemmerUtil() - Constructor for class org.apache.lucene.analysis.util.StemmerUtil
-
- stemPrefix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem a prefix off an Arabic word.
- stemSuffix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem suffix(es) off an Arabic word.
- STOP_WORDS - Static variable in class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
- STOP_WORDS - Static variable in class org.apache.lucene.analysis.cn.ChineseFilter
-
Deprecated.
- stoplist - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
This map stores hyphenation exceptions
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Deprecated.
use WordlistLoader.getWordSet(Reader, String, Version) directly
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Deprecated.
use WordlistLoader.getWordSet(Reader, String, Version) directly
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
The comment character in the stopwords file.
- strcmp(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares 2 null terminated char arrays
- strcmp(String, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares a string with null terminated char array
- strcpy(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[]) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- SUB_HEADING - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- SUB_HEADING_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- subSequence(int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- substring_i - Variable in class org.tartarus.snowball.Among
-
- suffix - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- suffixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- suffixes - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- SUKUN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- SwedishAnalyzer - Class in org.apache.lucene.analysis.sv
-
Analyzer for Swedish.
- SwedishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
- SwedishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishLightStemFilter - Class in org.apache.lucene.analysis.sv
-
- SwedishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilter
-
- SwedishLightStemmer - Class in org.apache.lucene.analysis.sv
-
Light Stemmer for Swedish.
- SwedishLightStemmer() - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- SwedishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- SwedishStemmer() - Constructor for class org.tartarus.snowball.ext.SwedishStemmer
-
- SynonymFilter - Class in org.apache.lucene.analysis.synonym
-
Matches single or multi word synonyms in a token stream.
- SynonymFilter(TokenStream, SynonymMap, boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymFilter
-
- SynonymMap - Class in org.apache.lucene.analysis.synonym
-
A map of synonyms, keys and values are phrases.
- SynonymMap(FST<BytesRef>, BytesRefHash, int) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap
-
- SynonymMap.Builder - Class in org.apache.lucene.analysis.synonym
-
Builds an FSTSynonymMap.
- SynonymMap.Builder(boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
If dedup is true then identical rules (same input,
same output) will be added only once.