public final class StopFilter extends FilteringTokenFilter
You must specify the required Version
compatibility when creating StopFilter:
AttributeSource.AttributeFactory, AttributeSource.Stateinput| Constructor and Description |
|---|
StopFilter(Version matchVersion,
TokenStream in,
CharArraySet stopWords)
Constructs a filter which removes words from the input TokenStream that are
named in the Set.
|
| Modifier and Type | Method and Description |
|---|---|
protected boolean |
accept()
Returns the next input Token whose term() is not a stop word.
|
static CharArraySet |
makeStopSet(Version matchVersion,
List<?> stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
makeStopSet(Version matchVersion,
List<?> stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword list.
|
static CharArraySet |
makeStopSet(Version matchVersion,
String... stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
makeStopSet(Version matchVersion,
String[] stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword array.
|
getEnablePositionIncrements, incrementToken, reset, setEnablePositionIncrementsclose, endaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreStatepublic StopFilter(Version matchVersion, TokenStream in, CharArraySet stopWords)
matchVersion - Lucene version to enable correct Unicode 4.0 behavior in the stop
set if Version > 3.0. See above for details.in - Input streamstopWords - A CharArraySet representing the stopwords.makeStopSet(Version, java.lang.String...)public static CharArraySet makeStopSet(Version matchVersion, String... stopWords)
matchVersion - Lucene version to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0stopWords - An array of stopwordspassing false to ignoreCasepublic static CharArraySet makeStopSet(Version matchVersion, List<?> stopWords)
matchVersion - Lucene version to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0stopWords - A List of Strings or char[] or any other toString()-able list representing the stopwordsCharArraySet) containing the wordspassing false to ignoreCasepublic static CharArraySet makeStopSet(Version matchVersion, String[] stopWords, boolean ignoreCase)
matchVersion - Lucene version to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0stopWords - An array of stopwordsignoreCase - If true, all words are lower cased first.public static CharArraySet makeStopSet(Version matchVersion, List<?> stopWords, boolean ignoreCase)
matchVersion - Lucene version to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0stopWords - A List of Strings or char[] or any other toString()-able list representing the stopwordsignoreCase - if true, all words are lower cased firstCharArraySet) containing the wordsprotected boolean accept()
accept in class FilteringTokenFilterCopyright © 2000-2013 Apache Software Foundation. All Rights Reserved.