HMMChineseTokenizerFactory
instead@Deprecated public class SmartChineseWordTokenFilterFactory extends TokenFilterFactory
WordTokenFilter
Note: this class will currently emit tokens for punctuation. So you should either add
a WordDelimiterFilter after to remove these (with concatenate off), or use the
SmartChinese stoplist with a StopFilterFactory via:
words="org/apache/lucene/analysis/cn/smart/stopwords.txt"
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
Constructor and Description |
---|
SmartChineseWordTokenFilterFactory(Map<String,String> args)
Deprecated.
Creates a new SmartChineseWordTokenFilterFactory
|
Modifier and Type | Method and Description |
---|---|
TokenFilter |
create(TokenStream input)
Deprecated.
|
availableTokenFilters, forName, lookupClass, reloadTokenFilters
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitFileNames
public TokenFilter create(TokenStream input)
create
in class TokenFilterFactory
Copyright © 2000-2015 Apache Software Foundation. All Rights Reserved.