Package org.tartarus.snowball
Class SnowballProgram
java.lang.Object
org.tartarus.snowball.SnowballProgram
- All Implemented Interfaces:
Serializable
- Direct Known Subclasses:
SnowballStemmer
Base class for a snowball stemmer
- See Also:
-
Field Summary
Modifier and TypeFieldDescriptionprotected int
protected int
protected int
protected int
protected int
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionprotected void
protected void
copy_from
(SnowballProgram other) protected boolean
eq_s
(CharSequence s) protected boolean
protected int
find_among
(Among[] v) protected int
find_among_b
(Among[] v) Get the current string.char[]
Get the current buffer containing the stem.int
Get the valid length of the character array ingetCurrentBuffer()
.protected boolean
in_grouping
(char[] s, int min, int max) protected boolean
in_grouping_b
(char[] s, int min, int max) protected void
insert
(int c_bra, int c_ket, CharSequence s) protected boolean
out_grouping
(char[] s, int min, int max) protected boolean
out_grouping_b
(char[] s, int min, int max) protected int
replace_s
(int c_bra, int c_ket, CharSequence s) void
setCurrent
(char[] text, int length) Set the current string.void
setCurrent
(String value) Set the current string.protected void
protected void
protected void
protected void
-
Field Details
-
cursor
protected int cursor -
limit
protected int limit -
limit_backward
protected int limit_backward -
bra
protected int bra -
ket
protected int ket
-
-
Constructor Details
-
SnowballProgram
protected SnowballProgram() -
SnowballProgram
-
-
Method Details
-
setCurrent
Set the current string. -
getCurrent
Get the current string. -
setCurrent
public void setCurrent(char[] text, int length) Set the current string.- Parameters:
text
- character array containing inputlength
- valid length of text.
-
getCurrentBuffer
public char[] getCurrentBuffer()Get the current buffer containing the stem.NOTE: this may be a reference to a different character array than the one originally provided with setCurrent, in the exceptional case that stemming produced a longer intermediate or result string.
It is necessary to use
getCurrentBufferLength()
to determine the valid length of the returned buffer. For example, many words are stemmed simply by subtracting from the length to remove suffixes.- See Also:
-
getCurrentBufferLength
public int getCurrentBufferLength()Get the valid length of the character array ingetCurrentBuffer()
.- Returns:
- valid length of the array.
-
copy_from
-
in_grouping
protected boolean in_grouping(char[] s, int min, int max) -
in_grouping_b
protected boolean in_grouping_b(char[] s, int min, int max) -
out_grouping
protected boolean out_grouping(char[] s, int min, int max) -
out_grouping_b
protected boolean out_grouping_b(char[] s, int min, int max) -
eq_s
-
eq_s_b
-
find_among
-
find_among_b
-
replace_s
-
slice_check
protected void slice_check() -
slice_from
-
slice_del
protected void slice_del() -
insert
-
slice_to
-
assign_to
-