Implementations of the SinkTokenizer that might be useful.
- packValues(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Packs the values by storing them in 4 bits, two values into a byte Values
range is from 0 to 9.
- parse(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(File) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(InputSource) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(Class<? extends RSLPStemmerBase>, String) - Static method in class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
Parse a resource file into an RSLP stemmer description.
- PathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for path-like hierarchies.
- PathHierarchyTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PathHierarchyTokenizer(Reader, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- PatternAnalyzer - Class in org.apache.lucene.analysis.miscellaneous
-
Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a
Reader
, that can flexibly separate text into terms via a regular expression
Pattern
(with behaviour identical to
String.split(String)
),
and that combines the functionality of
LetterTokenizer
,
LowerCaseTokenizer
,
WhitespaceTokenizer
,
StopFilter
into a single efficient
multi-purpose class.
- PatternAnalyzer(Version, Pattern, boolean, Set<?>) - Constructor for class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
Constructs a new instance with the given parameters.
- PatternConsumer - Interface in org.apache.lucene.analysis.compound.hyphenation
-
This interface is used to connect the XML pattern file parser to the
hyphenation tree.
- PatternParser - Class in org.apache.lucene.analysis.compound.hyphenation
-
A SAX document handler to read and parse hyphenation patterns from a XML
file.
- PatternParser() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PatternParser(PatternConsumer) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- PayloadEncoder - Interface in org.apache.lucene.analysis.payloads
-
Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to Payload.
- PayloadHelper - Class in org.apache.lucene.analysis.payloads
-
Utility methods for encoding payloads.
- PayloadHelper() - Constructor for class org.apache.lucene.analysis.payloads.PayloadHelper
-
- permutationIterator() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- PersianAnalyzer - Class in org.apache.lucene.analysis.fa
-
Analyzer
for Persian.
- PersianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Builds an analyzer with the given stop words
- PersianAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianAnalyzer(Version, Hashtable<?, ?>) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
- PersianCharFilter - Class in org.apache.lucene.analysis.fa
-
CharFilter that replaces instances of Zero-width non-joiner with an
ordinary space.
- PersianCharFilter(CharStream) - Constructor for class org.apache.lucene.analysis.fa.PersianCharFilter
-
- PersianNormalizationFilter - Class in org.apache.lucene.analysis.fa
-
- PersianNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilter
-
- PersianNormalizer - Class in org.apache.lucene.analysis.fa
-
Normalizer for Persian.
- PersianNormalizer() - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizer
-
- PorterStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- PorterStemmer() - Constructor for class org.tartarus.snowball.ext.PorterStemmer
-
- PortugueseAnalyzer - Class in org.apache.lucene.analysis.pt
-
Analyzer
for Portuguese.
- PortugueseAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
- PortugueseAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseLightStemFilter - Class in org.apache.lucene.analysis.pt
-
- PortugueseLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
-
- PortugueseLightStemmer - Class in org.apache.lucene.analysis.pt
-
Light Stemmer for Portuguese
This stemmer implements the "UniNE" algorithm in:
Light Stemming Approaches for the French, Portuguese, German and Hungarian Languages
Jacques Savoy
- PortugueseLightStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- PortugueseMinimalStemFilter - Class in org.apache.lucene.analysis.pt
-
- PortugueseMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
-
- PortugueseMinimalStemmer - Class in org.apache.lucene.analysis.pt
-
Minimal Stemmer for Portuguese
This follows the "RSLP-S" algorithm presented in:
A study on the Use of Stemming for Monolingual Ad-Hoc Portuguese
Information Retrieval (Orengo, et al)
which is just the plural reduction step of the RSLP
algorithm from A Stemming Algorithm for the Portuguese Language,
Orengo et al.
- PortugueseMinimalStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- PortugueseStemFilter - Class in org.apache.lucene.analysis.pt
-
- PortugueseStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilter
-
- PortugueseStemmer - Class in org.apache.lucene.analysis.pt
-
Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa)
algorithm.
- PortugueseStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- PortugueseStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- PortugueseStemmer() - Constructor for class org.tartarus.snowball.ext.PortugueseStemmer
-
- PositionFilter - Class in org.apache.lucene.analysis.position
-
Set the positionIncrement of all tokens to the "positionIncrement",
except the first return token which retains its original positionIncrement value.
- PositionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.position.PositionFilter
-
Constructs a PositionFilter that assigns a position increment of zero to
all but the first token from the given input stream.
- PositionFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.position.PositionFilter
-
Constructs a PositionFilter that assigns the given position increment to
all but the first token from the given input stream.
- postBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- preBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
-
- PrefixAndSuffixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
- PrefixAndSuffixAwareTokenFilter(TokenStream, TokenStream, TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- PrefixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Joins two token streams and leaves the last token of the first stream available
to be used when updating the token values in the second stream based on that token.
- PrefixAwareTokenFilter(TokenStream, TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- prefixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- previous() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
- printStats() - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
- printStats() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- PUA_EC00_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+EC00 (PRIVATE USE AREA: EC00)
- put(int, byte) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
- put(int, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
- read() - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
-
- read(char[], int, int) - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
-
- read(char[], int, int) - Method in class org.apache.lucene.analysis.fa.PersianCharFilter
-
- readToken(StringBuffer) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- replace(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- replace_s(int, int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
-
- replace_s(int, int, String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.
for binary back compat. Will be removed in Lucene 4.0
- replacement - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- reserve(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() - Method in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
- reset() - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- reset(Reader) - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- reset(Reader) - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
- reset() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- reset() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- reset() - Method in class org.apache.lucene.analysis.position.PositionFilter
-
- reset() - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
- reset() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
-
- reset() - Method in class org.apache.lucene.analysis.synonym.SynonymFilter
-
- reset() - Method in class org.apache.lucene.analysis.th.ThaiWordFilter
-
- reset() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- reset() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- reset(Reader) - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- resize(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- resolveEntity(String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- result - Variable in class org.tartarus.snowball.Among
-
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.
Returns a (possibly reused)
StandardTokenizer
filtered by a
StandardFilter
, a
LowerCaseFilter
,
a
StopFilter
, and a
SnowballFilter
- reverse(String) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, String) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input string
- reverse(char[]) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, char[]) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input buffer in-place
- reverse(char[], int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, char[], int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from offset 0
up to the given length.
- reverse(char[], int, int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- reverse(Version, char[], int, int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from the given offset
up to the given length.
- ReversePathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for domain-like hierarchies.
- ReversePathHierarchyTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReversePathHierarchyTokenizer(Reader, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
-
- ReverseStringFilter - Class in org.apache.lucene.analysis.reverse
-
Reverse token string, for example "country" => "yrtnuoc".
- ReverseStringFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- ReverseStringFilter(TokenStream, char) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
- ReverseStringFilter(Version, TokenStream) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses all tokens in the
supplied TokenStream
.
- ReverseStringFilter(Version, TokenStream, char) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses and marks all tokens in the
supplied TokenStream
.
- rewind() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
-
- RomanianAnalyzer - Class in org.apache.lucene.analysis.ro
-
Analyzer
for Romanian.
- RomanianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
- RomanianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- RomanianStemmer() - Constructor for class org.tartarus.snowball.ext.RomanianStemmer
-
- root - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- RSLPStemmerBase - Class in org.apache.lucene.analysis.pt
-
Base class for stemmers that use a set of RSLP-like stemming steps.
- RSLPStemmerBase() - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
- RSLPStemmerBase.Rule - Class in org.apache.lucene.analysis.pt
-
A basic rule, with no exceptions.
- RSLPStemmerBase.Rule(String, int, String) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
Create a rule.
- RSLPStemmerBase.RuleWithSetExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of whole-word exceptions.
- RSLPStemmerBase.RuleWithSetExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
-
- RSLPStemmerBase.RuleWithSuffixExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of exceptional suffixes.
- RSLPStemmerBase.RuleWithSuffixExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
-
- RSLPStemmerBase.Step - Class in org.apache.lucene.analysis.pt
-
A step containing a list of rules.
- RSLPStemmerBase.Step(String, RSLPStemmerBase.Rule[], int, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
Create a new step
- RTL_DIRECTION_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+200F (RIGHT-TO-LEFT MARK)
- rules - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- RussianAnalyzer - Class in org.apache.lucene.analysis.ru
-
Analyzer
for Russian language.
- RussianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
- RussianLetterTokenizer - Class in org.apache.lucene.analysis.ru
-
Deprecated.
Use StandardTokenizer
instead, which has the same functionality.
This filter will be removed in Lucene 5.0
- RussianLetterTokenizer(Version, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.
Construct a new RussianLetterTokenizer. * @param matchVersion Lucene version
to match See
above
- RussianLetterTokenizer(Version, AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.
Construct a new RussianLetterTokenizer using a given AttributeSource
.
- RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory
. * @param
matchVersion Lucene version to match See
above
- RussianLetterTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- RussianLetterTokenizer(AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- RussianLetterTokenizer(AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
- RussianLightStemFilter - Class in org.apache.lucene.analysis.ru
-
- RussianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilter
-
- RussianLightStemmer - Class in org.apache.lucene.analysis.ru
-
Light Stemmer for Russian.
- RussianLightStemmer() - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- RussianLowerCaseFilter - Class in org.apache.lucene.analysis.ru
-
Deprecated.
Use LowerCaseFilter
instead, which has the same
functionality. This filter will be removed in Lucene 4.0
- RussianLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianLowerCaseFilter
-
Deprecated.
- RussianStemFilter - Class in org.apache.lucene.analysis.ru
-
- RussianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianStemFilter
-
Deprecated.
- RussianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- RussianStemmer() - Constructor for class org.tartarus.snowball.ext.RussianStemmer
-
- s - Variable in class org.tartarus.snowball.Among
-
- s_size - Variable in class org.tartarus.snowball.Among
-
- sameRow - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
-
Deprecated.
- sc - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
The character stored in this node: splitchar.
- searchPatterns(char[], int, byte[]) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Search for all possible partial matches of word starting at index an update
interletter values.
- set(char[], int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setAppend(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the append defined for the affix
- setAppendFlags(char[]) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the flags defined for the affix append
- setArticles(Version, Set<?>) - Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
- setArticles(Set<?>) - Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
- setCharAt(int, char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setCondition(String, String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the condition that must be met before the affix can be applied
- setConsumer(PatternConsumer) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- setCrossProduct(boolean) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets whether the affix is defined as cross product
- setCurrent(String) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setCurrent(char[], int) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setExclusionSet(Set<?>) - Method in class org.apache.lucene.analysis.de.GermanStemFilter
-
Deprecated.
use KeywordAttribute
with KeywordMarkerFilter
instead.
- setExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.
use KeywordAttribute
with KeywordMarkerFilter
instead.
- setExclusionTable(HashSet<?>) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
use KeywordAttribute
with KeywordMarkerFilter
instead.
- setFirst(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- setFlag(char) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the affix flag
- setIgnoringSinglePrefixOrSuffixShingle(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setIndex(int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
- setLast(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- setLength(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- setMatrix(ShingleMatrixFilter.Matrix) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMaximumShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMaxShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting maxShingleSize after Analyzer instantiation prevents reuse.
Confgure maxShingleSize during construction.
- setMaxShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the max shingle size (default: 2)
- setMinimumShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMinShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting minShingleSize after Analyzer instantiation prevents reuse.
Confgure minShingleSize during construction.
- setMinShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the min shingle size (default: 2).
- setOutputUnigrams(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting outputUnigrams after Analyzer instantiation prevents reuse.
Confgure outputUnigrams during construction.
- setOutputUnigrams(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall the output stream contain the input tokens (unigrams) as well as
shingles?
- setOutputUnigramsIfNoShingles(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting outputUnigramsIfNoShingles after Analyzer instantiation prevents reuse.
Confgure outputUnigramsIfNoShingles during construction.
- setOutputUnigramsIfNoShingles(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall we override the behavior of outputUnigrams==false for those
times when no shingles are available (because there are fewer than
minShingleSize tokens in the input stream)?
- setPrefix(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setSpacerCharacter(Character) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setStemDictionary(File) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.
This prevents reuse of TokenStreams. If you wish to use a custom
stem dictionary, create your own Analyzer with StemmerOverrideFilter
- setStemDictionary(HashMap<?, ?>) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
- setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
- setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
- setStemExclusionTable(String[]) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
- setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
- setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
- setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
- setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
- setStemExclusionTable(HashSet<?>) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
- setStemmer(GermanStemmer) - Method in class org.apache.lucene.analysis.de.GermanStemFilter
-
- setStemmer(FrenchStemmer) - Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.
- setStemmer(DutchStemmer) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
- setStemmer(RussianStemmer) - Method in class org.apache.lucene.analysis.ru.RussianStemFilter
-
Deprecated.
Set a alternative/custom RussianStemmer
for this filter.
- setStrip(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the stripping characters defined for the affix
- setSuffix(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
- setText(char[], int, int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Set a new region of text to be examined by this iterator
- setToken(Token) - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
Sets the TokenPositioner as token flags int value.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- setTokens(List<Token>) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- setTokenSeparator(String) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.
Setting tokenSeparator after Analyzer instantiation prevents reuse.
Confgure tokenSeparator during construction.
- setTokenSeparator(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Sets the string to use when joining adjacent tokens to form a shingle
- setTokenType(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the type of the shingle tokens produced by this filter.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
Stores a 32 bit float in the payload, or set it to null if 1f;
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.
Have this method do nothing in order to 'disable' weights.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- SHADDA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- ShingleAnalyzerWrapper - Class in org.apache.lucene.analysis.shingle
-
A ShingleAnalyzerWrapper wraps a
ShingleFilter
around another
Analyzer
.
- ShingleAnalyzerWrapper(Analyzer) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
- ShingleAnalyzerWrapper(Analyzer, int, int, String, boolean, boolean) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Creates a new ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Version) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps StandardAnalyzer
.
- ShingleAnalyzerWrapper(Version, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps StandardAnalyzer
.
- ShingleFilter - Class in org.apache.lucene.analysis.shingle
-
A ShingleFilter constructs shingles (token n-grams) from a token stream.
- ShingleFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with default shingle size: 2.
- ShingleFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with the specified token type for shingle tokens
and the default shingle size: 2
- ShingleMatrixFilter - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
Will be removed in Lucene 4.0. This filter is unmaintained and might not behave
correctly if used with custom Attributes, i.e. Attributes other than
the ones located in org.apache.lucene.analysis.tokenattributes
. It also uses
hardcoded payload encoders which makes it not easily adaptable to other use-cases.
- ShingleMatrixFilter(ShingleMatrixFilter.Matrix, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter based on a user defined matrix.
- ShingleMatrixFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
Creates a shingle filter with ad hoc parameter settings.
- ShingleMatrixFilter.Matrix - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
A column focused matrix in three dimensions:
Token[column][row][z-axis] {
{{hello}, {greetings, and, salutations}},
{{world}, {earth}, {tellus}}
};
todo consider row groups
to indicate that shingles is only to contain permutations with texts in that same row group.
- ShingleMatrixFilter.Matrix() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column(Token) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
A full featured codec not to be used for something serious.
- ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
- ShingleMatrixFilter.TokenPositioner - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.TokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
Strategy used to code and decode meta data of the tokens from the input stream
regarding how to position the tokens in the matrix, set and retreive weight, et c.
- ShingleMatrixFilter.TokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
A codec that creates a two dimensional matrix
by treating tokens from the input stream with 0 position increment
as new rows to the current column.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- SINGLE_TYPE - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
when we emit a unigram, its then marked as this type
- SingleTokenTokenStream - Class in org.apache.lucene.analysis.miscellaneous
-
A TokenStream
containing a single token.
- SingleTokenTokenStream(Token) - Constructor for class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
-
- size() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- size() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- slice_check() - Method in class org.tartarus.snowball.SnowballProgram
-
- slice_del() - Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
-
- slice_from(String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.
for binary back compat. Will be removed in Lucene 4.0
- slice_from(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.
for binary back compat. Will be removed in Lucene 4.0
- slice_to(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
- SnowballAnalyzer - Class in org.apache.lucene.analysis.snowball
-
Deprecated.
Use the language-specific analyzer in contrib/analyzers instead.
This analyzer will be removed in Lucene 5.0
- SnowballAnalyzer(Version, String) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.
Builds the named analyzer with no stop words.
- SnowballAnalyzer(Version, String, String[]) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
- SnowballAnalyzer(Version, String, Set<?>) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.
Builds the named analyzer with the given stop words.
- SnowballFilter - Class in org.apache.lucene.analysis.snowball
-
A filter that stems words using a Snowball-generated stemmer.
- SnowballFilter(TokenStream, SnowballProgram) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
- SnowballFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
Construct the named stemming filter.
- SnowballProgram - Class in org.tartarus.snowball
-
This is the rev 502 of the Snowball SVN trunk,
but modified:
made abstract and introduced abstract method stem to avoid expensive reflection in filter class.
- SnowballProgram() - Constructor for class org.tartarus.snowball.SnowballProgram
-
- SolrSynonymParser - Class in org.apache.lucene.analysis.synonym
-
Parser for the Solr synonyms format.
- SolrSynonymParser(boolean, boolean, Analyzer) - Constructor for class org.apache.lucene.analysis.synonym.SolrSynonymParser
-
- SpanishAnalyzer - Class in org.apache.lucene.analysis.es
-
Analyzer
for Spanish.
- SpanishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
- SpanishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishLightStemFilter - Class in org.apache.lucene.analysis.es
-
- SpanishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilter
-
- SpanishLightStemmer - Class in org.apache.lucene.analysis.es
-
Light Stemmer for Spanish
This stemmer implements the algorithm described in:
Report on CLEF-2001 Experiments
Jacques Savoy
- SpanishLightStemmer() - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- SpanishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- SpanishStemmer() - Constructor for class org.tartarus.snowball.ext.SpanishStemmer
-
- START_OF_HEADING_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+0001 (START OF HEADING)
- startElement(String, String, String, Attributes) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
- startOffset - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
-
- startsWith(char[], int, String) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array starts with the suffix.
- stem(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem an input buffer of Arabic text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.bg.BulgarianStemmer
-
Stem an input buffer of Bulgarian text.
- stem(String) - Method in class org.apache.lucene.analysis.br.BrazilianStemmer
-
Stems the given term to an unique discriminator.
- stem(char[], int) - Method in class org.apache.lucene.analysis.cz.CzechStemmer
-
Stem an input buffer of Czech text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanMinimalStemmer
-
- stem(String) - Method in class org.apache.lucene.analysis.de.GermanStemmer
-
Stemms the given term to an unique discriminator.
- stem(char[], int) - Method in class org.apache.lucene.analysis.el.GreekStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.en.EnglishMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.es.SpanishLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.fi.FinnishLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
-
- stem(String) - Method in class org.apache.lucene.analysis.fr.FrenchStemmer
-
Deprecated.
Stems the given term to a unique discriminator.
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.hi.HindiStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.hu.HungarianLightStemmer
-
- stem(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Find the stem(s) of the provided word
- stem(char[], int) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Find the stem(s) of the provided word
- stem(char[], int, boolean) - Method in class org.apache.lucene.analysis.id.IndonesianStemmer
-
Stem a term (returning its new length).
- stem(char[], int) - Method in class org.apache.lucene.analysis.it.ItalianLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.lv.LatvianStemmer
-
Stem a latvian word. returns the new adjusted length.
- stem(String) - Method in class org.apache.lucene.analysis.nl.DutchStemmer
-
Deprecated.
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.ru.RussianLightStemmer
-
- stem(char[], int) - Method in class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.ArmenianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.BasqueStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.CatalanStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.DanishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.DutchStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.EnglishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.FinnishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.FrenchStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.German2Stemmer
-
- stem() - Method in class org.tartarus.snowball.ext.GermanStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.HungarianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.IrishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.ItalianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.KpStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.LovinsStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.NorwegianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.PorterStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.PortugueseStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.RomanianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.RussianStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.SpanishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.SwedishStemmer
-
- stem() - Method in class org.tartarus.snowball.ext.TurkishStemmer
-
- stem() - Method in class org.tartarus.snowball.SnowballProgram
-
- StemmerOverrideFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Provides the ability to override any KeywordAttribute
aware stemmer
with custom dictionary-based stemming.
- StemmerOverrideFilter(Version, TokenStream, Map<?, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
-
Create a new StemmerOverrideFilter, performing dictionary-based stemming
with the provided dictionary
.
- StemmerUtil - Class in org.apache.lucene.analysis.util
-
Some commonly-used stemming functions
- StemmerUtil() - Constructor for class org.apache.lucene.analysis.util.StemmerUtil
-
- stemPrefix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem a prefix off an Arabic word.
- stemSuffix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem suffix(es) off an Arabic word.
- STOP_WORDS - Static variable in class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
- STOP_WORDS - Static variable in class org.apache.lucene.analysis.cn.ChineseFilter
-
Deprecated.
- stoplist - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
This map stores hyphenation exceptions
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Deprecated.
use WordlistLoader.getWordSet(Reader, String, Version)
directly
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Deprecated.
use WordlistLoader.getWordSet(Reader, String, Version)
directly
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
The comment character in the stopwords file.
- strcmp(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares 2 null terminated char arrays
- strcmp(String, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares a string with null terminated char array
- strcpy(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- strlen(char[]) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
- SUB_HEADING - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- SUB_HEADING_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
- subSequence(int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
-
- substring_i - Variable in class org.tartarus.snowball.Among
-
- suffix - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
- suffixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
-
- suffixes - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
- SUKUN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
- SwedishAnalyzer - Class in org.apache.lucene.analysis.sv
-
Analyzer
for Swedish.
- SwedishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
- SwedishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishLightStemFilter - Class in org.apache.lucene.analysis.sv
-
- SwedishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilter
-
- SwedishLightStemmer - Class in org.apache.lucene.analysis.sv
-
Light Stemmer for Swedish.
- SwedishLightStemmer() - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemmer
-
- SwedishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- SwedishStemmer() - Constructor for class org.tartarus.snowball.ext.SwedishStemmer
-
- SynonymFilter - Class in org.apache.lucene.analysis.synonym
-
Matches single or multi word synonyms in a token stream.
- SynonymFilter(TokenStream, SynonymMap, boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymFilter
-
- SynonymMap - Class in org.apache.lucene.analysis.synonym
-
A map of synonyms, keys and values are phrases.
- SynonymMap(FST<BytesRef>, BytesRefHash, int) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap
-
- SynonymMap.Builder - Class in org.apache.lucene.analysis.synonym
-
Builds an FSTSynonymMap.
- SynonymMap.Builder(boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
If dedup is true then identical rules (same input,
same output) will be added only once.