public final class EdgeNGramTokenFilter extends TokenFilter
This TokenFilter create n-grams from the beginning edge or ending edge of a input token.
As of Lucene 4.4, this filter does not support
EdgeNGramTokenFilter.Side.BACK (you can use ReverseStringFilter up-front and
afterward to get the same behavior), handles supplementary characters
correctly and does not update offsets anymore.
| Modifier and Type | Class and Description |
|---|---|
static class |
EdgeNGramTokenFilter.Side
Specifies which side of the input the n-gram should be generated from
|
AttributeSource.AttributeFactory, AttributeSource.State| Modifier and Type | Field and Description |
|---|---|
static int |
DEFAULT_MAX_GRAM_SIZE |
static int |
DEFAULT_MIN_GRAM_SIZE |
static EdgeNGramTokenFilter.Side |
DEFAULT_SIDE |
input| Constructor and Description |
|---|
EdgeNGramTokenFilter(Version version,
TokenStream input,
EdgeNGramTokenFilter.Side side,
int minGram,
int maxGram)
Deprecated.
|
EdgeNGramTokenFilter(Version version,
TokenStream input,
int minGram,
int maxGram)
Creates EdgeNGramTokenFilter that can generate n-grams in the sizes of the given range
|
EdgeNGramTokenFilter(Version version,
TokenStream input,
String sideLabel,
int minGram,
int maxGram)
Deprecated.
|
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken() |
void |
reset() |
close, endaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreStatepublic static final EdgeNGramTokenFilter.Side DEFAULT_SIDE
public static final int DEFAULT_MAX_GRAM_SIZE
public static final int DEFAULT_MIN_GRAM_SIZE
@Deprecated public EdgeNGramTokenFilter(Version version, TokenStream input, EdgeNGramTokenFilter.Side side, int minGram, int maxGram)
version - the Lucene match versioninput - TokenStream holding the input to be tokenizedside - the EdgeNGramTokenFilter.Side from which to chop off an n-gramminGram - the smallest n-gram to generatemaxGram - the largest n-gram to generate@Deprecated public EdgeNGramTokenFilter(Version version, TokenStream input, String sideLabel, int minGram, int maxGram)
version - the Lucene match versioninput - TokenStream holding the input to be tokenizedsideLabel - the name of the EdgeNGramTokenFilter.Side from which to chop off an n-gramminGram - the smallest n-gram to generatemaxGram - the largest n-gram to generatepublic EdgeNGramTokenFilter(Version version, TokenStream input, int minGram, int maxGram)
version - the Lucene match versioninput - TokenStream holding the input to be tokenizedminGram - the smallest n-gram to generatemaxGram - the largest n-gram to generatepublic final boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class TokenFilterIOExceptionCopyright © 2000-2013 Apache Software Foundation. All Rights Reserved.