@Deprecated public final class Lucene47WordDelimiterFilter extends TokenFilter
WordDelimiterFilterAttributeSource.State| Modifier and Type | Field and Description |
|---|---|
static int |
ALPHA
Deprecated.
|
static int |
ALPHANUM
Deprecated.
|
static int |
CATENATE_ALL
Deprecated.
Causes all subword parts to be catenated:
|
static int |
CATENATE_NUMBERS
Deprecated.
Causes maximum runs of word parts to be catenated:
|
static int |
CATENATE_WORDS
Deprecated.
Causes maximum runs of word parts to be catenated:
|
static int |
DIGIT
Deprecated.
|
static int |
GENERATE_NUMBER_PARTS
Deprecated.
Causes number subwords to be generated:
|
static int |
GENERATE_WORD_PARTS
Deprecated.
Causes parts of words to be generated:
|
static int |
LOWER
Deprecated.
|
static int |
PRESERVE_ORIGINAL
Deprecated.
Causes original words are preserved and added to the subword list (Defaults to false)
|
static int |
SPLIT_ON_CASE_CHANGE
Deprecated.
If not set, causes case changes to be ignored (subwords will only be generated
given SUBWORD_DELIM tokens)
|
static int |
SPLIT_ON_NUMERICS
Deprecated.
If not set, causes numeric changes to be ignored (subwords will only be generated
given SUBWORD_DELIM tokens).
|
static int |
STEM_ENGLISH_POSSESSIVE
Deprecated.
Causes trailing "'s" to be removed for each subword
|
static int |
SUBWORD_DELIM
Deprecated.
|
static int |
UPPER
Deprecated.
|
inputDEFAULT_TOKEN_ATTRIBUTE_FACTORY| Constructor and Description |
|---|
Lucene47WordDelimiterFilter(TokenStream in,
byte[] charTypeTable,
int configurationFlags,
CharArraySet protWords)
Deprecated.
Creates a new WordDelimiterFilter
|
Lucene47WordDelimiterFilter(TokenStream in,
int configurationFlags,
CharArraySet protWords)
Deprecated.
Creates a new WordDelimiterFilter using
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTable |
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken()
Deprecated.
|
void |
reset()
Deprecated.
|
close, endaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toStringpublic static final int LOWER
public static final int UPPER
public static final int DIGIT
public static final int SUBWORD_DELIM
public static final int ALPHA
public static final int ALPHANUM
public static final int GENERATE_WORD_PARTS
"PowerShot" => "Power" "Shot"
public static final int GENERATE_NUMBER_PARTS
"500-42" => "500" "42"
public static final int CATENATE_WORDS
"wi-fi" => "wifi"
public static final int CATENATE_NUMBERS
"wi-fi" => "wifi"
public static final int CATENATE_ALL
"wi-fi-4000" => "wifi4000"
public static final int PRESERVE_ORIGINAL
"500-42" => "500" "42" "500-42"
public static final int SPLIT_ON_CASE_CHANGE
public static final int SPLIT_ON_NUMERICS
public static final int STEM_ENGLISH_POSSESSIVE
"O'Neil's" => "O", "Neil"
public Lucene47WordDelimiterFilter(TokenStream in, byte[] charTypeTable, int configurationFlags, CharArraySet protWords)
in - TokenStream to be filteredcharTypeTable - table containing character typesconfigurationFlags - Flags configuring the filterprotWords - If not null is the set of tokens to protect from being delimitedpublic Lucene47WordDelimiterFilter(TokenStream in, int configurationFlags, CharArraySet protWords)
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTablein - TokenStream to be filteredconfigurationFlags - Flags configuring the filterprotWords - If not null is the set of tokens to protect from being delimitedpublic boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class TokenFilterIOExceptionCopyright © 2000-2016 Apache Software Foundation. All Rights Reserved.