public class SmartChineseWordTokenFilterFactory extends TokenFilterFactory
WordTokenFilter
Note: this class will currently emit tokens for punctuation. So you should either add
a WordDelimiterFilter after to remove these (with concatenate off), or use the
SmartChinese stoplist with a StopFilterFactory via:
words="org/apache/lucene/analysis/cn/smart/stopwords.txt"
args, luceneMatchVersion
Constructor and Description |
---|
SmartChineseWordTokenFilterFactory() |
Modifier and Type | Method and Description |
---|---|
TokenFilter |
create(TokenStream input) |
availableTokenFilters, forName, lookupClass, reloadTokenFilters
assureMatchVersion, getArgs, getBoolean, getBoolean, getInt, getInt, getInt, getLines, getLuceneMatchVersion, getPattern, getSnowballWordSet, getWordSet, init, setLuceneMatchVersion, splitFileNames
public TokenFilter create(TokenStream input)
create
in class TokenFilterFactory
Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.