Index

A B C D E G I L M N O P R S T U 
All Classes and Interfaces|All Packages|Constant Field Values

A

add(CharSequence, CharSequence) - Method in class org.egothor.stemmer.MultiTrie
Add an element to this structure consisting of the given key and patch command.
add(CharSequence, CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
Add an element to this structure consisting of the given key and patch command.
Algorithm and implementation - Search tag in Overview
Section
apply(StringBuilder, CharSequence) - Static method in class org.egothor.stemmer.Diff
Apply the given patch string diff to the given string dest.

B

Background - Search tag in Overview
Section
Bibliography - Search tag in Overview
Section

C

Compile - Class in org.egothor.stemmer
The Compile class is used to compile a stemmer table.
Corpus - Search tag in Overview
Section
create(TokenStream) - Method in class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
 
createComponents(String) - Method in class org.apache.lucene.analysis.pl.PolishAnalyzer
Creates a Analyzer.TokenStreamComponents which tokenizes all the text in the provided Reader.

D

decompose(CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
Break the given patch command into its constituent pieces.
DEFAULT_MIN_LENGTH - Static variable in class org.apache.lucene.analysis.stempel.StempelFilter
Minimum length of input words to be processed.
DEFAULT_STEMMER_FILE - Static variable in class org.apache.lucene.analysis.pl.PolishAnalyzer
File containing default Polish stemmer table.
DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.pl.PolishAnalyzer
File containing default Polish stopwords.
Diff - Class in org.egothor.stemmer
The Diff object generates a patch string.
Diff() - Constructor for class org.egothor.stemmer.Diff
Constructor for the Diff object.
Diff(int, int, int, int) - Constructor for class org.egothor.stemmer.Diff
Constructor for the Diff object
DiffIt - Class in org.egothor.stemmer
The DiffIt class is a means generate patch commands from an already prepared stemmer table.

E

eat(Row, int[]) - Method in class org.egothor.stemmer.Gener
Test whether the given Row of Cells in a Trie should be included in an optimized Trie.
exec(String, String) - Method in class org.egothor.stemmer.Diff
Construct a patch string that transforms a to b.

G

Gener - Class in org.egothor.stemmer
The Gener object helps in the discarding of nodes which break the reduction effort and defend the structure against large reductions.
Gener() - Constructor for class org.egothor.stemmer.Gener
Constructor for the Gener object.
getAll(CharSequence) - Method in class org.egothor.stemmer.Trie
Gets the all attribute of the Trie object
getCells() - Method in class org.egothor.stemmer.Row
Return the number of cells in use.
getCells() - Method in class org.egothor.stemmer.Trie
Return the number of cells in this Trie object.
getCellsPnt() - Method in class org.egothor.stemmer.Row
Return the number of references (how many transitions) to other rows.
getCellsPnt() - Method in class org.egothor.stemmer.Trie
Gets the cellsPnt attribute of the Trie object
getCellsVal() - Method in class org.egothor.stemmer.Row
Return the number of patch commands saved in this Row.
getCellsVal() - Method in class org.egothor.stemmer.Trie
Gets the cellsVal attribute of the Trie object
getCmd(char) - Method in class org.egothor.stemmer.Row
Return the command in the Cell associated with the given character.
getCnt(char) - Method in class org.egothor.stemmer.Row
Return the number of patch commands were in the Cell associated with the given character before the Trie containing this Row was reduced.
getDefaultStopSet() - Static method in class org.apache.lucene.analysis.pl.PolishAnalyzer
Returns an unmodifiable instance of the default stop words set.
getDefaultTable() - Static method in class org.apache.lucene.analysis.pl.PolishAnalyzer
Returns an unmodifiable instance of the default stemmer table.
getFully(CharSequence) - Method in class org.egothor.stemmer.MultiTrie
Return the element that is stored in a cell associated with the given key.
getFully(CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
Return the element that is stored in a cell associated with the given key.
getFully(CharSequence) - Method in class org.egothor.stemmer.Trie
Return the element that is stored in a cell associated with the given key.
getLastOnPath(CharSequence) - Method in class org.egothor.stemmer.MultiTrie
Return the element that is stored as last on a path belonging to the given key.
getLastOnPath(CharSequence) - Method in class org.egothor.stemmer.MultiTrie2
Return the element that is stored as last on a path belonging to the given key.
getLastOnPath(CharSequence) - Method in class org.egothor.stemmer.Trie
Return the element that is stored as last on a path associated with the given key.
getRef(char) - Method in class org.egothor.stemmer.Row
Return the reference to the next Row in the Cell associated with the given character.

I

incrementToken() - Method in class org.apache.lucene.analysis.stempel.StempelFilter
Returns the next input Token, after being stemmed
Introduction - Search tag in Overview
Section

L

Lift - Class in org.egothor.stemmer
The Lift class is a data structure that is a variation of a Patricia trie.
Lift(boolean) - Constructor for class org.egothor.stemmer.Lift
Constructor for the Lift object.
liftUp(Row, List<Row>) - Method in class org.egothor.stemmer.Lift
Reduce the trie using Lift-Up reduction.
load(InputStream) - Static method in class org.apache.lucene.analysis.stempel.StempelStemmer
Load a stemmer table from an inputstream.

M

main(String[]) - Static method in class org.egothor.stemmer.Compile
Entry point to the Compile application.
main(String[]) - Static method in class org.egothor.stemmer.DiffIt
Entry point to the DiffIt application.
merge(Cell, Cell) - Method in class org.egothor.stemmer.Optimizer
Merge the given Cells and return the resulting Cell.
merge(Cell, Cell) - Method in class org.egothor.stemmer.Optimizer2
Merge the given Cells and return the resulting Cell.
merge(Row, Row) - Method in class org.egothor.stemmer.Optimizer
Merge the given rows and return the resulting Row.
MultiTrie - Class in org.egothor.stemmer
The MultiTrie is a Trie of Tries.
MultiTrie(boolean) - Constructor for class org.egothor.stemmer.MultiTrie
Constructor for the MultiTrie object
MultiTrie(DataInput) - Constructor for class org.egothor.stemmer.MultiTrie
Constructor for the MultiTrie object.
MultiTrie2 - Class in org.egothor.stemmer
The MultiTrie is a Trie of Tries.
MultiTrie2(boolean) - Constructor for class org.egothor.stemmer.MultiTrie2
Constructor for the MultiTrie2 object
MultiTrie2(DataInput) - Constructor for class org.egothor.stemmer.MultiTrie2
Constructor for the MultiTrie object.

N

NAME - Static variable in class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
SPI name
normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.pl.PolishAnalyzer
 

O

optimize(Trie) - Method in class org.egothor.stemmer.Gener
Return a Trie with infrequent values occurring in the given Trie removed.
optimize(Trie) - Method in class org.egothor.stemmer.Lift
Optimize (eliminate rows with no content) the given Trie and return the reduced Trie.
optimize(Trie) - Method in class org.egothor.stemmer.Optimizer
Optimize (remove empty rows) from the given Trie and return the resulting Trie.
optimize(Trie) - Method in class org.egothor.stemmer.Reduce
Optimize (remove holes in the rows) the given Trie and return the restructured Trie.
Optimizer - Class in org.egothor.stemmer
The Optimizer class is a Trie that will be reduced (have empty rows removed).
Optimizer() - Constructor for class org.egothor.stemmer.Optimizer
Constructor for the Optimizer object.
Optimizer2 - Class in org.egothor.stemmer
The Optimizer class is a Trie that will be reduced (have empty rows removed).
Optimizer2() - Constructor for class org.egothor.stemmer.Optimizer2
Constructor for the Optimizer2 object.
org.apache.lucene.analysis.pl - package org.apache.lucene.analysis.pl
Analyzer for Polish.
org.apache.lucene.analysis.stempel - package org.apache.lucene.analysis.stempel
Stempel: Algorithmic Stemmer
org.egothor.stemmer - package org.egothor.stemmer
Egothor stemmer API.

P

PolishAnalyzer - Class in org.apache.lucene.analysis.pl
Analyzer for Polish.
PolishAnalyzer() - Constructor for class org.apache.lucene.analysis.pl.PolishAnalyzer
Builds an analyzer with the default stop words: PolishAnalyzer.DEFAULT_STOPWORD_FILE.
PolishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.pl.PolishAnalyzer
Builds an analyzer with the given stop words.
PolishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.pl.PolishAnalyzer
Builds an analyzer with the given stop words.
print(PrintStream) - Method in class org.egothor.stemmer.Row
Write the contents of this Row to the printstream.
printInfo(PrintStream, CharSequence) - Method in class org.egothor.stemmer.MultiTrie
Print the given prefix and the position(s) in the Trie where it appears.
printInfo(PrintStream, CharSequence) - Method in class org.egothor.stemmer.Trie
writes debugging info to the printstream

R

reduce(Reduce) - Method in class org.egothor.stemmer.MultiTrie
Remove empty rows from the given Trie and return the newly reduced Trie.
reduce(Reduce) - Method in class org.egothor.stemmer.MultiTrie2
Remove empty rows from the given Trie and return the newly reduced Trie.
reduce(Reduce) - Method in class org.egothor.stemmer.Trie
Remove empty rows from the given Trie and return the newly reduced Trie.
Reduce - Class in org.egothor.stemmer
The Reduce object is used to remove gaps in a Trie which stores a dictionary.
Reduce() - Constructor for class org.egothor.stemmer.Reduce
Constructor for the Reduce object.
Row - Class in org.egothor.stemmer
The Row class represents a row in a matrix representation of a trie.
Row() - Constructor for class org.egothor.stemmer.Row
The default constructor for the Row object.
Row(DataInput) - Constructor for class org.egothor.stemmer.Row
Construct a Row object from input carried in via the given input stream.
Row(Row) - Constructor for class org.egothor.stemmer.Row
Construct a Row using the cells of the given Row.

S

setCmd(char, int) - Method in class org.egothor.stemmer.Row
Set the command in the Cell of the given character to the given integer.
setRef(char, int) - Method in class org.egothor.stemmer.Row
Set the reference to the next row in the Cell of the given character to the given integer.
stem(CharSequence) - Method in class org.apache.lucene.analysis.stempel.StempelStemmer
Stem a word.
Stempel - Algorithmic Stemmer for Polish Language - Search tag in Overview
Section
StempelFilter - Class in org.apache.lucene.analysis.stempel
Transforms the token stream as per the stemming algorithm.
StempelFilter(TokenStream, StempelStemmer) - Constructor for class org.apache.lucene.analysis.stempel.StempelFilter
Create filter using the supplied stemming table.
StempelFilter(TokenStream, StempelStemmer, int) - Constructor for class org.apache.lucene.analysis.stempel.StempelFilter
Create filter using the supplied stemming table.
StempelPolishStemFilterFactory - Class in org.apache.lucene.analysis.stempel
Factory for StempelFilter using a Polish stemming table.
StempelPolishStemFilterFactory() - Constructor for class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
Default ctor for compatibility with SPI
StempelPolishStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.stempel.StempelPolishStemFilterFactory
Creates a new StempelPolishStemFilterFactory
StempelStemmer - Class in org.apache.lucene.analysis.stempel
Stemmer class is a convenient facade for other stemmer-related classes.
StempelStemmer(InputStream) - Constructor for class org.apache.lucene.analysis.stempel.StempelStemmer
Create a Stemmer using selected stemmer table
StempelStemmer(Trie) - Constructor for class org.apache.lucene.analysis.stempel.StempelStemmer
Create a Stemmer using pre-loaded stemmer table
store(DataOutput) - Method in class org.egothor.stemmer.MultiTrie
Write this data structure to the given output stream.
store(DataOutput) - Method in class org.egothor.stemmer.MultiTrie2
Write this data structure to the given output stream.
store(DataOutput) - Method in class org.egothor.stemmer.Row
Write the contents of this Row to the given output stream.
store(DataOutput) - Method in class org.egothor.stemmer.Trie
Write this Trie to the given output stream.
Summary - Search tag in Overview
Section

T

Terminology - Search tag in Overview
Section
Testing - Search tag in Overview
Section
Testing procedure - Search tag in Overview
Section
Test results - Search tag in Overview
Section
Trie - Class in org.egothor.stemmer
A Trie is used to store a dictionary of words and their stems.
Trie(boolean) - Constructor for class org.egothor.stemmer.Trie
Constructor for the Trie object.
Trie(boolean, int, List<CharSequence>, List<Row>) - Constructor for class org.egothor.stemmer.Trie
Constructor for the Trie object.
Trie(DataInput) - Constructor for class org.egothor.stemmer.Trie
Constructor for the Trie object.

U

uniformCmd(boolean) - Method in class org.egothor.stemmer.Row
Return the number of identical Cells (containing patch commands) in this Row.
A B C D E G I L M N O P R S T U 
All Classes and Interfaces|All Packages|Constant Field Values