All Classes Interface Summary Class Summary Enum Summary Exception Summary 
| Class | Description | 
| AbstractEncoder | Base class for payload encoders. | 
| AbstractWordsFileFilterFactory | Abstract parent class for analysis factories that accept a stopwords file as input. | 
| AffixedWord | An object representing the analysis result of a simple (non-compound) word | 
| AffixedWord.Affix | An object representing a prefix or a suffix applied to a word stem | 
| Among | Internal class used by Snowball stemmers | 
| ApostropheFilter | Strips all characters after an apostrophe (including the apostrophe itself). | 
| ApostropheFilterFactory |  | 
| ArabicAnalyzer |  | 
| ArabicNormalizationFilter |  | 
| ArabicNormalizationFilterFactory |  | 
| ArabicNormalizer | Normalizer for Arabic. | 
| ArabicStemFilter |  | 
| ArabicStemFilterFactory |  | 
| ArabicStemmer | Stemmer for Arabic. | 
| ArabicStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| ArmenianAnalyzer |  | 
| ArmenianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| ASCIIFoldingFilter | This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the
 first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one
 exists. | 
| ASCIIFoldingFilterFactory |  | 
| BaseCharFilter |  | 
| BasqueAnalyzer |  | 
| BasqueStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| BengaliAnalyzer | Analyzer for Bengali. | 
| BengaliNormalizationFilter |  | 
| BengaliNormalizationFilterFactory |  | 
| BengaliNormalizer | Normalizer for Bengali. | 
| BengaliStemFilter |  | 
| BengaliStemFilterFactory |  | 
| BengaliStemmer | Stemmer for Bengali. | 
| BrazilianAnalyzer | Analyzer for Brazilian Portuguese language. | 
| BrazilianStemFilter |  | 
| BrazilianStemFilterFactory |  | 
| BrazilianStemmer | A stemmer for Brazilian Portuguese words. | 
| BulgarianAnalyzer |  | 
| BulgarianStemFilter |  | 
| BulgarianStemFilterFactory |  | 
| BulgarianStemmer | Light Stemmer for Bulgarian. | 
| ByteVector | This class implements a simple byte vector with access to the underlying array. | 
| CapitalizationFilter | A filter to apply normal capitalization rules to Tokens. | 
| CapitalizationFilterFactory |  | 
| CatalanAnalyzer |  | 
| CatalanStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| CharArrayIterator |  | 
| CharTokenizer | An abstract base class for simple, character-oriented tokenizers. | 
| CharVector | This class implements a simple char vector with access to the underlying array. | 
| CJKAnalyzer |  | 
| CJKBigramFilter | Forms bigrams of CJK terms that are generated from StandardTokenizer or ICUTokenizer. | 
| CJKBigramFilterFactory |  | 
| CJKWidthCharFilter | A  CharFilter that normalizes CJK width differences:
 
   Folds fullwidth ASCII variants into the equivalent basic latin
   Folds halfwidth Katakana variants into the equivalent kana
  | 
| CJKWidthCharFilterFactory |  | 
| CJKWidthFilter | A  TokenFilter that normalizes CJK width differences:
 
   Folds fullwidth ASCII variants into the equivalent basic latin
   Folds halfwidth Katakana variants into the equivalent kana
  | 
| CJKWidthFilterFactory |  | 
| ClassicAnalyzer |  | 
| ClassicFilter |  | 
| ClassicFilterFactory |  | 
| ClassicTokenizer | A grammar-based tokenizer constructed with JFlex | 
| ClassicTokenizerFactory |  | 
| CodepointCountFilter | Removes words that are too long or too short from the stream. | 
| CodepointCountFilterFactory |  | 
| CollatedTermAttributeImpl | Extension of  CharTermAttributeImpl that encodes the term text as a binary Unicode
 collation key instead of as UTF-8 bytes. | 
| CollationAttributeFactory | Converts each token into its  CollationKey, and then encodes the bytes as an
 index term. | 
| CollationDocValuesField |  | 
| CollationKeyAnalyzer |  | 
| CommonGramsFilter | Construct bigrams for frequently occurring terms while indexing. | 
| CommonGramsFilterFactory |  | 
| CommonGramsQueryFilter | Wrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are
 not a member of a bigram. | 
| CommonGramsQueryFilterFactory |  | 
| CompoundWordTokenFilterBase | Base class for decomposition token filters. | 
| ConcatenateGraphFilter | Concatenates/Joins every incoming token with a separator into one output token for every path
 through the token stream (which is a graph). | 
| ConcatenateGraphFilter.BytesRefBuilderTermAttribute | Attribute providing access to the term builder and UTF-16 conversion | 
| ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl |  | 
| ConcatenateGraphFilterFactory |  | 
| ConcatenatingTokenStream | A TokenStream that takes an array of input TokenStreams as sources, and concatenates them
 together. | 
| ConditionalTokenFilter | Allows skipping TokenFilters based on the current set of attributes. | 
| ConditionalTokenFilterFactory |  | 
| CSVUtil | Utility class for parsing CSV text | 
| CustomAnalyzer | A general-purpose Analyzer that can be created with a builder-style API. | 
| CustomAnalyzer.Builder |  | 
| CustomAnalyzer.ConditionBuilder |  | 
| CzechAnalyzer |  | 
| CzechStemFilter |  | 
| CzechStemFilterFactory |  | 
| CzechStemmer | Light Stemmer for Czech. | 
| DanishAnalyzer |  | 
| DanishStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| DateRecognizerFilter | Filters all tokens that cannot be parsed to a date, using the provided  DateFormat. | 
| DateRecognizerFilterFactory |  | 
| DecimalDigitFilter | Folds all Unicode digits in [:General_Category=Decimal_Number:]to Basic Latin digits
 (0-9). | 
| DecimalDigitFilterFactory |  | 
| DelimitedBoostTokenFilter | Characters before the delimiter are the "token", those after are the boost. | 
| DelimitedBoostTokenFilterFactory |  | 
| DelimitedPayloadTokenFilter | Characters before the delimiter are the "token", those after are the payload. | 
| DelimitedPayloadTokenFilterFactory |  | 
| DelimitedTermFrequencyTokenFilter | Characters before the delimiter are the "token", the textual integer after is the term frequency. | 
| DelimitedTermFrequencyTokenFilterFactory |  | 
| DictEntries | An object representing homonym dictionary entries. | 
| DictEntry | An object representing *.dic file entry with its word, flags and morphological data. | 
| Dictionary | In-memory structure for the dictionary (.dic) and affix (.aff) data of a hunspell dictionary. | 
| DictionaryCompoundWordTokenFilter | A  TokenFilter that decomposes compound words found in many
 Germanic languages. | 
| DictionaryCompoundWordTokenFilterFactory |  | 
| Dl4jModelReader | Dl4jModelReader reads the file generated by the library Deeplearning4j and provide a
 Word2VecModel with normalized vectors | 
| DropIfFlaggedFilter | Allows Tokens with a given combination of flags to be dropped. | 
| DropIfFlaggedFilterFactory | Provides a filter that will drop tokens matching a set of flags. | 
| DutchAnalyzer |  | 
| DutchStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| EdgeNGramFilterFactory |  | 
| EdgeNGramTokenFilter | Tokenizes the given token into n-grams of given size(s). | 
| EdgeNGramTokenizer | Tokenizes the input from an edge into n-grams of given size(s). | 
| EdgeNGramTokenizerFactory |  | 
| ElisionFilter |  | 
| ElisionFilterFactory |  | 
| EmptyTokenStream | An always exhausted token stream. | 
| EnglishAnalyzer |  | 
| EnglishMinimalStemFilter |  | 
| EnglishMinimalStemFilterFactory |  | 
| EnglishMinimalStemmer | Minimal plural stemmer for English. | 
| EnglishPossessiveFilter | TokenFilter that removes possessives (trailing 's) from words. | 
| EnglishPossessiveFilterFactory |  | 
| EnglishStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| EntrySuggestion |  | 
| EstonianAnalyzer |  | 
| EstonianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| FilesystemResourceLoader | Simple  ResourceLoader that opens resource files from the local file system, optionally
 resolving against a base directory. | 
| FingerprintFilter | Filter outputs a single token which is a concatenation of the sorted and de-duplicated set of
 input tokens. | 
| FingerprintFilterFactory |  | 
| FinnishAnalyzer |  | 
| FinnishLightStemFilter |  | 
| FinnishLightStemFilterFactory |  | 
| FinnishLightStemmer | Light Stemmer for Finnish. | 
| FinnishStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| FixBrokenOffsetsFilter | Deprecated. | 
| FixBrokenOffsetsFilterFactory | Deprecated. | 
| FixedShingleFilter | A FixedShingleFilter constructs shingles (token n-grams) from a token stream. | 
| FixedShingleFilterFactory |  | 
| FlattenGraphFilter | Converts an incoming graph token stream, such as one from  SynonymGraphFilter, into a flat
 form so that all nodes form a single linear chain with no side paths. | 
| FlattenGraphFilterFactory |  | 
| FloatEncoder | Encode a character array Float as a  BytesRef. | 
| FragmentChecker | An oracle for quickly checking that a specific part of a word can never be a valid word. | 
| FrenchAnalyzer |  | 
| FrenchLightStemFilter |  | 
| FrenchLightStemFilterFactory |  | 
| FrenchLightStemmer | Light Stemmer for French. | 
| FrenchMinimalStemFilter |  | 
| FrenchMinimalStemFilterFactory |  | 
| FrenchMinimalStemmer | Light Stemmer for French. | 
| FrenchStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| GalicianAnalyzer |  | 
| GalicianMinimalStemFilter |  | 
| GalicianMinimalStemFilterFactory |  | 
| GalicianMinimalStemmer | Minimal Stemmer for Galician | 
| GalicianStemFilter |  | 
| GalicianStemFilterFactory |  | 
| GalicianStemmer | Galician stemmer implementing "Regras do lematizador para o galego". | 
| German2Stemmer | This class implements the stemming algorithm defined by a snowball script. | 
| GermanAnalyzer |  | 
| GermanLightStemFilter |  | 
| GermanLightStemFilterFactory |  | 
| GermanLightStemmer | Light Stemmer for German. | 
| GermanMinimalStemFilter |  | 
| GermanMinimalStemFilterFactory |  | 
| GermanMinimalStemmer | Minimal Stemmer for German. | 
| GermanNormalizationFilter |  | 
| GermanNormalizationFilterFactory |  | 
| GermanStemFilter |  | 
| GermanStemFilterFactory |  | 
| GermanStemmer | A stemmer for German words. | 
| GermanStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| GreekAnalyzer |  | 
| GreekLowerCaseFilter | Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma
 to sigma. | 
| GreekLowerCaseFilterFactory |  | 
| GreekStemFilter |  | 
| GreekStemFilterFactory |  | 
| GreekStemmer | A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language.
 Georgios Ntais | 
| GreekStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| HindiAnalyzer | Analyzer for Hindi. | 
| HindiNormalizationFilter |  | 
| HindiNormalizationFilterFactory |  | 
| HindiNormalizer | Normalizer for Hindi. | 
| HindiStemFilter |  | 
| HindiStemFilterFactory |  | 
| HindiStemmer | Light Stemmer for Hindi. | 
| HindiStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| HTMLStripCharFilter | A CharFilter that wraps another Reader and attempts to strip out HTML constructs. | 
| HTMLStripCharFilterFactory |  | 
| HungarianAnalyzer |  | 
| HungarianLightStemFilter |  | 
| HungarianLightStemFilterFactory |  | 
| HungarianLightStemmer | Light Stemmer for Hungarian. | 
| HungarianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| Hunspell | A spell checker based on Hunspell dictionaries. | 
| HunspellStemFilter | TokenFilter that uses hunspell affix rules and words to stem tokens. | 
| HunspellStemFilterFactory |  | 
| Hyphen | This class represents a hyphen. | 
| HyphenatedWordsFilter | When the plain text is extracted from documents, we will often have many words hyphenated and
 broken into two lines. | 
| HyphenatedWordsFilterFactory |  | 
| Hyphenation | This class represents a hyphenated word. | 
| HyphenationCompoundWordTokenFilter | A  TokenFilter that decomposes compound words found in many
 Germanic languages. | 
| HyphenationCompoundWordTokenFilterFactory |  | 
| HyphenationTree | This tree structure stores the hyphenation patterns in an efficient way for fast lookup. | 
| IdentityEncoder | Does nothing other than convert the char array to a byte array using the specified encoding. | 
| IndicNormalizationFilter |  | 
| IndicNormalizationFilterFactory |  | 
| IndicNormalizer | Normalizes the Unicode representation of text in Indian languages. | 
| IndonesianAnalyzer | Analyzer for Indonesian (Bahasa) | 
| IndonesianStemFilter |  | 
| IndonesianStemFilterFactory |  | 
| IndonesianStemmer | Stemmer for Indonesian. | 
| IndonesianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| IntegerEncoder | Encode a character array Integer as a  BytesRef. | 
| IrishAnalyzer |  | 
| IrishLowerCaseFilter | Normalises token text to lower case, handling t-prothesis and n-eclipsis (i.e., that 'nAthair'
 should become 'n-athair') | 
| IrishLowerCaseFilterFactory |  | 
| IrishStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| ItalianAnalyzer |  | 
| ItalianLightStemFilter |  | 
| ItalianLightStemFilterFactory |  | 
| ItalianLightStemmer | Light Stemmer for Italian. | 
| ItalianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| KeepWordFilter | A TokenFilter that only keeps tokens with text contained in the required words. | 
| KeepWordFilterFactory |  | 
| KeywordAnalyzer | "Tokenizes" the entire stream as a single token. | 
| KeywordMarkerFilter |  | 
| KeywordMarkerFilterFactory |  | 
| KeywordRepeatFilter | This TokenFilter emits each incoming token twice once as keyword and once non-keyword, in other
 words once with  KeywordAttribute.setKeyword(boolean) set to  true and once
 set to  false. | 
| KeywordRepeatFilterFactory |  | 
| KeywordTokenizer | Emits the entire input as a single token. | 
| KeywordTokenizerFactory |  | 
| KpStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| KStemFilter | A high-performance kstem filter for english. | 
| KStemFilterFactory |  | 
| KStemmer | This class implements the Kstem algorithm | 
| LatvianAnalyzer |  | 
| LatvianStemFilter |  | 
| LatvianStemFilterFactory |  | 
| LatvianStemmer | Light stemmer for Latvian. | 
| LengthFilter | Removes words that are too long or too short from the stream. | 
| LengthFilterFactory |  | 
| LetterTokenizer | A LetterTokenizer is a tokenizer that divides text at non-letters. | 
| LetterTokenizerFactory |  | 
| LimitTokenCountAnalyzer | This Analyzer limits the number of tokens while indexing. | 
| LimitTokenCountFilter | This TokenFilter limits the number of tokens while indexing. | 
| LimitTokenCountFilterFactory |  | 
| LimitTokenOffsetFilter | Lets all tokens pass through until it sees one with a start offset <= a configured limit,
 which won't pass and ends the stream. | 
| LimitTokenOffsetFilterFactory |  | 
| LimitTokenPositionFilter | This TokenFilter limits its emitted tokens to those with positions that are not greater than the
 configured limit. | 
| LimitTokenPositionFilterFactory |  | 
| LithuanianAnalyzer |  | 
| LithuanianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| LovinsStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| LowerCaseFilter | Normalizes token text to lower case. | 
| LowerCaseFilterFactory |  | 
| MappingCharFilter | Simplistic  CharFilter that applies the mappings contained in a  NormalizeCharMap
 to the character stream, and correcting the resulting changes to the offsets. | 
| MappingCharFilterFactory |  | 
| MinHashFilter | Generate min hash tokens from an incoming stream of tokens. | 
| MinHashFilterFactory |  | 
| NepaliAnalyzer | Analyzer for Nepali. | 
| NepaliStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| NGramFilterFactory |  | 
| NGramFragmentChecker | A  FragmentChecker based on all character n-grams possible in a certain language, keeping
 them in a relatively memory-efficient, but probabilistic data structure. | 
| NGramFragmentChecker.NGramConsumer | A callback for n-gram ranges in words | 
| NGramTokenFilter | Tokenizes the input into n-grams of the given size(s). | 
| NGramTokenizer | Tokenizes the input into n-grams of the given size(s). | 
| NGramTokenizerFactory |  | 
| NormalizeCharMap |  | 
| NormalizeCharMap.Builder | Builds an NormalizeCharMap. | 
| NorwegianAnalyzer |  | 
| NorwegianLightStemFilter |  | 
| NorwegianLightStemFilterFactory |  | 
| NorwegianLightStemmer | Light Stemmer for Norwegian. | 
| NorwegianMinimalStemFilter |  | 
| NorwegianMinimalStemFilterFactory |  | 
| NorwegianMinimalStemmer | Minimal Stemmer for Norwegian Bokmål (no-nb) and Nynorsk (no-nn) | 
| NorwegianNormalizationFilter | This filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded
 variants (ae, oe, aa) by transforming them to åÅæÆøØ. | 
| NorwegianNormalizationFilterFactory |  | 
| NorwegianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| NumericPayloadTokenFilter |  | 
| NumericPayloadTokenFilterFactory |  | 
| OpenStringBuilder | A StringBuilder that allows one to access the array. | 
| PathHierarchyTokenizer | Tokenizer for path-like hierarchies. | 
| PathHierarchyTokenizerFactory |  | 
| PatternCaptureGroupFilterFactory |  | 
| PatternCaptureGroupTokenFilter | CaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or
 more patterns. | 
| PatternConsumer | This interface is used to connect the XML pattern file parser to the hyphenation tree. | 
| PatternKeywordMarkerFilter |  | 
| PatternParser | A SAX document handler to read and parse hyphenation patterns from a XML file. | 
| PatternReplaceCharFilter | CharFilter that uses a regular expression for the target of replace string. | 
| PatternReplaceCharFilterFactory |  | 
| PatternReplaceFilter | A TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences
 with the specified replacement string. | 
| PatternReplaceFilterFactory |  | 
| PatternTokenizer | This tokenizer uses regex pattern matching to construct distinct tokens for the input stream. | 
| PatternTokenizerFactory |  | 
| PatternTypingFilter | Set a type attribute to a parameterized value when tokens are matched by any of a several regex
 patterns. | 
| PatternTypingFilter.PatternTypingRule | Value holding class for pattern typing rules. | 
| PatternTypingFilterFactory | Provides a filter that will analyze tokens with the analyzer from an arbitrary field type. | 
| PayloadEncoder | Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to  BytesRef. | 
| PayloadHelper | Utility methods for encoding payloads. | 
| PerFieldAnalyzerWrapper | This analyzer is used to facilitate scenarios where different fields require different analysis
 techniques. | 
| PersianAnalyzer |  | 
| PersianCharFilter | CharFilter that replaces instances of Zero-width non-joiner with an ordinary space. | 
| PersianCharFilterFactory |  | 
| PersianNormalizationFilter |  | 
| PersianNormalizationFilterFactory |  | 
| PersianNormalizer | Normalizer for Persian. | 
| PersianStemFilter |  | 
| PersianStemFilterFactory |  | 
| PersianStemmer | Stemmer for Persian. | 
| PorterStemFilter | Transforms the token stream as per the Porter stemming algorithm. | 
| PorterStemFilterFactory |  | 
| PorterStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| PortugueseAnalyzer |  | 
| PortugueseLightStemFilter |  | 
| PortugueseLightStemFilterFactory |  | 
| PortugueseLightStemmer | Light Stemmer for Portuguese | 
| PortugueseMinimalStemFilter |  | 
| PortugueseMinimalStemFilterFactory |  | 
| PortugueseMinimalStemmer | Minimal Stemmer for Portuguese | 
| PortugueseStemFilter |  | 
| PortugueseStemFilterFactory |  | 
| PortugueseStemmer | Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa) algorithm. | 
| PortugueseStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| ProtectedTermFilter | A ConditionalTokenFilter that only applies its wrapped filters to tokens that are not contained
 in a protected set. | 
| ProtectedTermFilterFactory |  | 
| QueryAutoStopWordAnalyzer | An  Analyzer used primarily at query time to wrap another analyzer and provide a layer of
 protection which prevents very common words from being passed into queries. | 
| RemoveDuplicatesTokenFilter | A TokenFilter which filters out Tokens at the same position and Term text as the previous token
 in the stream. | 
| RemoveDuplicatesTokenFilterFactory |  | 
| ReversePathHierarchyTokenizer | Tokenizer for domain-like hierarchies. | 
| ReverseStringFilter | Reverse token string, for example "country" => "yrtnuoc". | 
| ReverseStringFilterFactory |  | 
| RollingCharBuffer | Acts like a forever growing char[] as you read characters into it from the provided reader, but
 internally it uses a circular buffer to only hold the characters that haven't been freed yet. | 
| RomanianAnalyzer |  | 
| RomanianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| RSLPStemmerBase | Base class for stemmers that use a set of RSLP-like stemming steps. | 
| RSLPStemmerBase.Rule | A basic rule, with no exceptions. | 
| RSLPStemmerBase.RuleWithSetExceptions | A rule with a set of whole-word exceptions. | 
| RSLPStemmerBase.RuleWithSuffixExceptions | A rule with a set of exceptional suffixes. | 
| RSLPStemmerBase.Step | A step containing a list of rules. | 
| RussianAnalyzer |  | 
| RussianLightStemFilter |  | 
| RussianLightStemFilterFactory |  | 
| RussianLightStemmer | Light Stemmer for Russian. | 
| RussianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| ScandinavianFoldingFilter | This filter folds Scandinavian characters åÅäæÄÆ->a and öÖøØ->o. | 
| ScandinavianFoldingFilterFactory |  | 
| ScandinavianNormalizationFilter | This filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded
 variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ. | 
| ScandinavianNormalizationFilterFactory |  | 
| ScandinavianNormalizer | This Normalizer does the heavy lifting for a set of Scandinavian normalization filters,
 normalizing use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa,
 ao, ae, oe and oo) by transforming them to åÅæÆøØ. | 
| ScandinavianNormalizer.Foldings | List of possible foldings that can be used when configuring the filter | 
| SegmentingTokenizerBase | Breaks text into sentences with a  BreakIterator and allows subclasses to decompose these
 sentences into words. | 
| SerbianAnalyzer |  | 
| SerbianNormalizationFilter | Normalizes Serbian Cyrillic and Latin characters to "bald" Latin. | 
| SerbianNormalizationFilterFactory |  | 
| SerbianNormalizationRegularFilter | Normalizes Serbian Cyrillic to Latin. | 
| SerbianStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| SetKeywordMarkerFilter |  | 
| ShingleAnalyzerWrapper |  | 
| ShingleFilter | A ShingleFilter constructs shingles (token n-grams) from a token stream. | 
| ShingleFilterFactory |  | 
| SimpleAnalyzer |  | 
| SimplePatternSplitTokenizer | This tokenizer uses a Lucene  RegExp or (expert usage) a pre-built determinized  Automaton, to locate tokens. | 
| SimplePatternSplitTokenizerFactory |  | 
| SimplePatternTokenizer | This tokenizer uses a Lucene  RegExp or (expert usage) a pre-built determinized  Automaton, to locate tokens. | 
| SimplePatternTokenizerFactory |  | 
| SnowballFilter | A filter that stems words using a Snowball-generated stemmer. | 
| SnowballPorterFilterFactory |  | 
| SnowballProgram | Base class for a snowball stemmer | 
| SnowballStemmer | Parent class of all snowball stemmers, which must implement stem | 
| SolrSynonymParser | Parser for the Solr synonyms format. | 
| SoraniAnalyzer |  | 
| SoraniNormalizationFilter |  | 
| SoraniNormalizationFilterFactory |  | 
| SoraniNormalizer | Normalizes the Unicode representation of Sorani text. | 
| SoraniStemFilter |  | 
| SoraniStemFilterFactory |  | 
| SoraniStemmer | Light stemmer for Sorani | 
| SortingStrategy | The strategy defining how a Hunspell dictionary should be loaded, with different tradeoffs. | 
| SpanishAnalyzer |  | 
| SpanishLightStemFilter |  | 
| SpanishLightStemFilterFactory |  | 
| SpanishLightStemmer | Light Stemmer for Spanish | 
| SpanishMinimalStemFilter | Deprecated. | 
| SpanishMinimalStemFilterFactory | Deprecated. | 
| SpanishMinimalStemmer | Deprecated. | 
| SpanishPluralStemFilter |  | 
| SpanishPluralStemFilterFactory |  | 
| SpanishPluralStemmer | Plural Stemmer for Spanish | 
| SpanishStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| StemmerOverrideFilter | Provides the ability to override any  KeywordAttribute aware stemmer with custom
 dictionary-based stemming. | 
| StemmerOverrideFilter.Builder |  | 
| StemmerOverrideFilter.StemmerOverrideMap | A read-only 4-byte FST backed map that allows fast case-insensitive key value lookups for
  StemmerOverrideFilter | 
| StemmerOverrideFilterFactory |  | 
| StemmerUtil | Some commonly-used stemming functions | 
| StopAnalyzer |  | 
| StopFilter | Removes stop words from a token stream. | 
| StopFilterFactory |  | 
| Suggester | A generator for misspelled word corrections based on Hunspell flags. | 
| SuggestionTimeoutException |  | 
| SwedishAnalyzer |  | 
| SwedishLightStemFilter |  | 
| SwedishLightStemFilterFactory |  | 
| SwedishLightStemmer | Light Stemmer for Swedish. | 
| SwedishMinimalStemFilter |  | 
| SwedishMinimalStemFilterFactory |  | 
| SwedishMinimalStemmer | Minimal Stemmer for Swedish. | 
| SwedishStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| SynonymFilter | Deprecated. | 
| SynonymFilterFactory | Deprecated. | 
| SynonymGraphFilter | Applies single- or multi-token synonyms from a  SynonymMap to an incoming  TokenStream, producing a fully correct graph output. | 
| SynonymGraphFilterFactory |  | 
| SynonymMap | A map of synonyms, keys and values are phrases. | 
| SynonymMap.Builder | Builds an FSTSynonymMap. | 
| SynonymMap.Parser | Abstraction for parsing synonym files. | 
| TamilAnalyzer | Analyzer for Tamil. | 
| TamilStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| TeeSinkTokenFilter | This TokenFilter provides the ability to set aside attribute states that have already been
 analyzed. | 
| TeeSinkTokenFilter.SinkTokenStream | TokenStream output from a tee. | 
| TeluguAnalyzer | Analyzer for Telugu. | 
| TeluguNormalizationFilter |  | 
| TeluguNormalizationFilterFactory |  | 
| TeluguNormalizer | Normalizer for Telugu. | 
| TeluguStemFilter |  | 
| TeluguStemFilterFactory |  | 
| TeluguStemmer | Stemmer for Telugu. | 
| TermAndBoost | Wraps a term and boost | 
| TernaryTree | Ternary Search Tree. | 
| ThaiAnalyzer |  | 
| ThaiTokenizer |  | 
| ThaiTokenizerFactory |  | 
| TimeoutPolicy | A strategy determining what to do when Hunspell API calls take too much time | 
| TokenOffsetPayloadTokenFilter |  | 
| TokenOffsetPayloadTokenFilterFactory |  | 
| TrimFilter | Trims leading and trailing whitespace from Tokens in the stream. | 
| TrimFilterFactory |  | 
| TruncateTokenFilter | A token filter for truncating the terms into a specific length. | 
| TruncateTokenFilterFactory |  | 
| TurkishAnalyzer |  | 
| TurkishLowerCaseFilter | Normalizes Turkish token text to lower case. | 
| TurkishLowerCaseFilterFactory |  | 
| TurkishStemmer | This class implements the stemming algorithm defined by a snowball script. | 
| TypeAsPayloadTokenFilter |  | 
| TypeAsPayloadTokenFilterFactory |  | 
| TypeAsSynonymFilter |  | 
| TypeAsSynonymFilterFactory |  | 
| TypeTokenFilter | Removes tokens whose types appear in a set of blocked types from a token stream. | 
| TypeTokenFilterFactory |  | 
| UAX29URLEmailAnalyzer |  | 
| UAX29URLEmailTokenizer | This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified
 in Unicode Standard Annex #29  URLs and email
 addresses are also tokenized according to the relevant RFCs. | 
| UAX29URLEmailTokenizerFactory |  | 
| UAX29URLEmailTokenizerImpl | This class implements Word Break rules from the Unicode Text Segmentation 
 algorithm, as specified in 
 Unicode Standard Annex #29  
 URLs and email addresses are also tokenized according to the relevant RFCs. | 
| UnicodeProps | This file contains unicode properties used by various  CharTokenizers. | 
| UnicodeWhitespaceAnalyzer |  | 
| UnicodeWhitespaceTokenizer | A UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace. | 
| UpperCaseFilter | Normalizes token text to UPPER CASE. | 
| UpperCaseFilterFactory |  | 
| WhitespaceAnalyzer |  | 
| WhitespaceTokenizer |  | 
| WhitespaceTokenizerFactory |  | 
| WikipediaTokenizer | Extension of StandardTokenizer that is aware of Wikipedia syntax. | 
| WikipediaTokenizerFactory |  | 
| Word2VecModel | Word2VecModel is a class representing the parsed Word2Vec model containing the vectors for each
 word in dictionary | 
| Word2VecSynonymFilter | Applies single-token synonyms from a Word2Vec trained network to an incoming  TokenStream. | 
| Word2VecSynonymFilterFactory |  | 
| Word2VecSynonymProvider | The Word2VecSynonymProvider generates the list of sysnonyms of a term. | 
| Word2VecSynonymProviderFactory | Supply Word2Vec Word2VecSynonymProvider cache avoiding that multiple instances of
 Word2VecSynonymFilterFactory will instantiate multiple instances of the same SynonymProvider. | 
| WordDelimiterFilter | Deprecated. | 
| WordDelimiterFilterFactory | Deprecated. | 
| WordDelimiterGraphFilter | Splits words into subwords and performs optional transformations on subword groups, producing a
 correct token graph so that e.g. | 
| WordDelimiterGraphFilterFactory |  | 
| WordDelimiterIterator | A BreakIterator-like API for iterating over subwords in text, according to
 WordDelimiterGraphFilter rules. | 
| WordFormGenerator |  | 
| WordnetSynonymParser | Parser for wordnet prolog format | 
| YiddishStemmer | This class implements the stemming algorithm defined by a snowball script. |