HMMChineseTokenizerFactory instead@Deprecated public class SmartChineseWordTokenFilterFactory extends TokenFilterFactory
WordTokenFilter
Note: this class will currently emit tokens for punctuation. So you should either add
a WordDelimiterFilter after to remove these (with concatenate off), or use the
SmartChinese stoplist with a StopFilterFactory via:
words="org/apache/lucene/analysis/cn/smart/stopwords.txt"
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion| Constructor and Description |
|---|
SmartChineseWordTokenFilterFactory(Map<String,String> args)
Deprecated.
Creates a new SmartChineseWordTokenFilterFactory
|
| Modifier and Type | Method and Description |
|---|---|
TokenFilter |
create(TokenStream input)
Deprecated.
|
availableTokenFilters, forName, lookupClass, reloadTokenFiltersget, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitFileNamespublic TokenFilter create(TokenStream input)
create in class TokenFilterFactoryCopyright © 2000-2015 Apache Software Foundation. All Rights Reserved.