Index
All Classes and Interfaces|All Packages|Constant Field Values
A
- add(CharSequence, CharSequence) - Method in class org.egothor.stemmer.MultiTrie
-
Add an element to this structure consisting of the given key and patch command.
- add(CharSequence, CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
-
Add an element to this structure consisting of the given key and patch command.
- Algorithm and implementation - Search tag in Overview
- Section
- apply(StringBuilder, CharSequence) - Static method in class org.egothor.stemmer.Diff
-
Apply the given patch string
diff
to the given stringdest
.
B
- Background - Search tag in Overview
- Section
- Bibliography - Search tag in Overview
- Section
C
- Compile - Class in org.egothor.stemmer
-
The Compile class is used to compile a stemmer table.
- Corpus - Search tag in Overview
- Section
- create(TokenStream) - Method in class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
- createComponents(String) - Method in class org.apache.lucene.analysis.pl.PolishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
.
D
- decompose(CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
-
Break the given patch command into its constituent pieces.
- DEFAULT_MIN_LENGTH - Static variable in class org.apache.lucene.analysis.stempel.StempelFilter
-
Minimum length of input words to be processed.
- DEFAULT_STEMMER_FILE - Static variable in class org.apache.lucene.analysis.pl.PolishAnalyzer
-
File containing default Polish stemmer table.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.pl.PolishAnalyzer
-
File containing default Polish stopwords.
- Diff - Class in org.egothor.stemmer
-
The Diff object generates a patch string.
- Diff() - Constructor for class org.egothor.stemmer.Diff
-
Constructor for the Diff object.
- Diff(int, int, int, int) - Constructor for class org.egothor.stemmer.Diff
-
Constructor for the Diff object
- DiffIt - Class in org.egothor.stemmer
-
The DiffIt class is a means generate patch commands from an already prepared stemmer table.
E
- eat(Row, int[]) - Method in class org.egothor.stemmer.Gener
-
Test whether the given Row of Cells in a Trie should be included in an optimized Trie.
- exec(String, String) - Method in class org.egothor.stemmer.Diff
-
Construct a patch string that transforms a to b.
G
- Gener - Class in org.egothor.stemmer
-
The Gener object helps in the discarding of nodes which break the reduction effort and defend the structure against large reductions.
- Gener() - Constructor for class org.egothor.stemmer.Gener
-
Constructor for the Gener object.
- getAll(CharSequence) - Method in class org.egothor.stemmer.Trie
-
Gets the all attribute of the Trie object
- getCells() - Method in class org.egothor.stemmer.Row
-
Return the number of cells in use.
- getCells() - Method in class org.egothor.stemmer.Trie
-
Return the number of cells in this Trie object.
- getCellsPnt() - Method in class org.egothor.stemmer.Row
-
Return the number of references (how many transitions) to other rows.
- getCellsPnt() - Method in class org.egothor.stemmer.Trie
-
Gets the cellsPnt attribute of the Trie object
- getCellsVal() - Method in class org.egothor.stemmer.Row
-
Return the number of patch commands saved in this Row.
- getCellsVal() - Method in class org.egothor.stemmer.Trie
-
Gets the cellsVal attribute of the Trie object
- getCmd(char) - Method in class org.egothor.stemmer.Row
-
Return the command in the Cell associated with the given character.
- getCnt(char) - Method in class org.egothor.stemmer.Row
-
Return the number of patch commands were in the Cell associated with the given character before the Trie containing this Row was reduced.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.pl.PolishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultTable() - Static method in class org.apache.lucene.analysis.pl.PolishAnalyzer
-
Returns an unmodifiable instance of the default stemmer table.
- getFully(CharSequence) - Method in class org.egothor.stemmer.MultiTrie
-
Return the element that is stored in a cell associated with the given key.
- getFully(CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
-
Return the element that is stored in a cell associated with the given key.
- getFully(CharSequence) - Method in class org.egothor.stemmer.Trie
-
Return the element that is stored in a cell associated with the given key.
- getLastOnPath(CharSequence) - Method in class org.egothor.stemmer.MultiTrie
-
Return the element that is stored as last on a path belonging to the given key.
- getLastOnPath(CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
-
Return the element that is stored as last on a path belonging to the given key.
- getLastOnPath(CharSequence) - Method in class org.egothor.stemmer.Trie
-
Return the element that is stored as last on a path associated with the given key.
- getRef(char) - Method in class org.egothor.stemmer.Row
-
Return the reference to the next Row in the Cell associated with the given character.
I
- incrementToken() - Method in class org.apache.lucene.analysis.stempel.StempelFilter
-
Returns the next input Token, after being stemmed
- Introduction - Search tag in Overview
- Section
L
- Lift - Class in org.egothor.stemmer
-
The Lift class is a data structure that is a variation of a Patricia trie.
- Lift(boolean) - Constructor for class org.egothor.stemmer.Lift
-
Constructor for the Lift object.
- liftUp(Row, List<Row>) - Method in class org.egothor.stemmer.Lift
-
Reduce the trie using Lift-Up reduction.
- load(InputStream) - Static method in class org.apache.lucene.analysis.stempel.StempelStemmer
-
Load a stemmer table from an inputstream.
M
- main(String[]) - Static method in class org.egothor.stemmer.Compile
-
Entry point to the Compile application.
- main(String[]) - Static method in class org.egothor.stemmer.DiffIt
-
Entry point to the DiffIt application.
- merge(Cell, Cell) - Method in class org.egothor.stemmer.Optimizer
-
Merge the given Cells and return the resulting Cell.
- merge(Cell, Cell) - Method in class org.egothor.stemmer.Optimizer2
-
Merge the given Cells and return the resulting Cell.
- merge(Row, Row) - Method in class org.egothor.stemmer.Optimizer
-
Merge the given rows and return the resulting Row.
- MultiTrie - Class in org.egothor.stemmer
-
The MultiTrie is a Trie of Tries.
- MultiTrie(boolean) - Constructor for class org.egothor.stemmer.MultiTrie
-
Constructor for the MultiTrie object
- MultiTrie(DataInput) - Constructor for class org.egothor.stemmer.MultiTrie
-
Constructor for the MultiTrie object.
- MultiTrie2 - Class in org.egothor.stemmer
-
The MultiTrie is a Trie of Tries.
- MultiTrie2(boolean) - Constructor for class org.egothor.stemmer.MultiTrie2
-
Constructor for the MultiTrie2 object
- MultiTrie2(DataInput) - Constructor for class org.egothor.stemmer.MultiTrie2
-
Constructor for the MultiTrie object.
N
- NAME - Static variable in class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
-
SPI name
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.pl.PolishAnalyzer
O
- optimize(Trie) - Method in class org.egothor.stemmer.Gener
-
Return a Trie with infrequent values occurring in the given Trie removed.
- optimize(Trie) - Method in class org.egothor.stemmer.Lift
-
Optimize (eliminate rows with no content) the given Trie and return the reduced Trie.
- optimize(Trie) - Method in class org.egothor.stemmer.Optimizer
-
Optimize (remove empty rows) from the given Trie and return the resulting Trie.
- optimize(Trie) - Method in class org.egothor.stemmer.Reduce
-
Optimize (remove holes in the rows) the given Trie and return the restructured Trie.
- Optimizer - Class in org.egothor.stemmer
-
The Optimizer class is a Trie that will be reduced (have empty rows removed).
- Optimizer() - Constructor for class org.egothor.stemmer.Optimizer
-
Constructor for the Optimizer object.
- Optimizer2 - Class in org.egothor.stemmer
-
The Optimizer class is a Trie that will be reduced (have empty rows removed).
- Optimizer2() - Constructor for class org.egothor.stemmer.Optimizer2
-
Constructor for the Optimizer2 object.
- org.apache.lucene.analysis.pl - package org.apache.lucene.analysis.pl
-
Analyzer for Polish.
- org.apache.lucene.analysis.stempel - package org.apache.lucene.analysis.stempel
-
Stempel: Algorithmic Stemmer
- org.egothor.stemmer - package org.egothor.stemmer
-
Egothor stemmer API.
P
- PolishAnalyzer - Class in org.apache.lucene.analysis.pl
-
Analyzer
for Polish. - PolishAnalyzer() - Constructor for class org.apache.lucene.analysis.pl.PolishAnalyzer
-
Builds an analyzer with the default stop words:
PolishAnalyzer.DEFAULT_STOPWORD_FILE
. - PolishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.pl.PolishAnalyzer
-
Builds an analyzer with the given stop words.
- PolishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.pl.PolishAnalyzer
-
Builds an analyzer with the given stop words.
- print(PrintStream) - Method in class org.egothor.stemmer.Row
-
Write the contents of this Row to the printstream.
- printInfo(PrintStream, CharSequence) - Method in class org.egothor.stemmer.MultiTrie
-
Print the given prefix and the position(s) in the Trie where it appears.
- printInfo(PrintStream, CharSequence) - Method in class org.egothor.stemmer.Trie
-
writes debugging info to the printstream
R
- reduce(Reduce) - Method in class org.egothor.stemmer.MultiTrie
-
Remove empty rows from the given Trie and return the newly reduced Trie.
- reduce(Reduce) - Method in class org.egothor.stemmer.MultiTrie2
-
Remove empty rows from the given Trie and return the newly reduced Trie.
- reduce(Reduce) - Method in class org.egothor.stemmer.Trie
-
Remove empty rows from the given Trie and return the newly reduced Trie.
- Reduce - Class in org.egothor.stemmer
-
The Reduce object is used to remove gaps in a Trie which stores a dictionary.
- Reduce() - Constructor for class org.egothor.stemmer.Reduce
-
Constructor for the Reduce object.
- Row - Class in org.egothor.stemmer
-
The Row class represents a row in a matrix representation of a trie.
- Row() - Constructor for class org.egothor.stemmer.Row
-
The default constructor for the Row object.
- Row(DataInput) - Constructor for class org.egothor.stemmer.Row
-
Construct a Row object from input carried in via the given input stream.
- Row(Row) - Constructor for class org.egothor.stemmer.Row
-
Construct a Row using the cells of the given Row.
S
- setCmd(char, int) - Method in class org.egothor.stemmer.Row
-
Set the command in the Cell of the given character to the given integer.
- setRef(char, int) - Method in class org.egothor.stemmer.Row
-
Set the reference to the next row in the Cell of the given character to the given integer.
- stem(CharSequence) - Method in class org.apache.lucene.analysis.stempel.StempelStemmer
-
Stem a word.
- Stempel - Algorithmic Stemmer for Polish Language - Search tag in Overview
- Section
- StempelFilter - Class in org.apache.lucene.analysis.stempel
-
Transforms the token stream as per the stemming algorithm.
- StempelFilter(TokenStream, StempelStemmer) - Constructor for class org.apache.lucene.analysis.stempel.StempelFilter
-
Create filter using the supplied stemming table.
- StempelFilter(TokenStream, StempelStemmer, int) - Constructor for class org.apache.lucene.analysis.stempel.StempelFilter
-
Create filter using the supplied stemming table.
- StempelPolishStemFilterFactory - Class in org.apache.lucene.analysis.stempel
-
Factory for
StempelFilter
using a Polish stemming table. - StempelPolishStemFilterFactory() - Constructor for class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
-
Default ctor for compatibility with SPI
- StempelPolishStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
-
Creates a new StempelPolishStemFilterFactory
- StempelStemmer - Class in org.apache.lucene.analysis.stempel
-
Stemmer class is a convenient facade for other stemmer-related classes.
- StempelStemmer(InputStream) - Constructor for class org.apache.lucene.analysis.stempel.StempelStemmer
-
Create a Stemmer using selected stemmer table
- StempelStemmer(Trie) - Constructor for class org.apache.lucene.analysis.stempel.StempelStemmer
-
Create a Stemmer using pre-loaded stemmer table
- store(DataOutput) - Method in class org.egothor.stemmer.MultiTrie
-
Write this data structure to the given output stream.
- store(DataOutput) - Method in class org.egothor.stemmer.MultiTrie2
-
Write this data structure to the given output stream.
- store(DataOutput) - Method in class org.egothor.stemmer.Row
-
Write the contents of this Row to the given output stream.
- store(DataOutput) - Method in class org.egothor.stemmer.Trie
-
Write this Trie to the given output stream.
- Summary - Search tag in Overview
- Section
T
- Terminology - Search tag in Overview
- Section
- Testing - Search tag in Overview
- Section
- Testing procedure - Search tag in Overview
- Section
- Test results - Search tag in Overview
- Section
- Trie - Class in org.egothor.stemmer
-
A Trie is used to store a dictionary of words and their stems.
- Trie(boolean) - Constructor for class org.egothor.stemmer.Trie
-
Constructor for the Trie object.
- Trie(boolean, int, List<CharSequence>, List<Row>) - Constructor for class org.egothor.stemmer.Trie
-
Constructor for the Trie object.
- Trie(DataInput) - Constructor for class org.egothor.stemmer.Trie
-
Constructor for the Trie object.
U
- uniformCmd(boolean) - Method in class org.egothor.stemmer.Row
-
Return the number of identical Cells (containing patch commands) in this Row.
All Classes and Interfaces|All Packages|Constant Field Values