public class WhitespaceTokenizerFactory extends TokenizerFactory
WhitespaceTokenizer.
<fieldType name="text_ws" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.WhitespaceTokenizerFactory" rule="unicode" maxTokenLen="256"/>
</analyzer>
</fieldType>
Options:
WhitespaceTokenizer
or "unicode" for UnicodeWhitespaceTokenizerCharTokenizer::DEFAULT_MAX_TOKEN_LEN| Modifier and Type | Field and Description |
|---|---|
static String |
RULE_JAVA |
static String |
RULE_UNICODE |
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion| Constructor and Description |
|---|
WhitespaceTokenizerFactory(Map<String,String> args)
Creates a new WhitespaceTokenizerFactory
|
| Modifier and Type | Method and Description |
|---|---|
Tokenizer |
create(AttributeFactory factory)
Creates a TokenStream of the specified input using the given AttributeFactory
|
availableTokenizers, create, forName, lookupClass, reloadTokenizersget, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNamespublic static final String RULE_JAVA
public static final String RULE_UNICODE
public Tokenizer create(AttributeFactory factory)
TokenizerFactorycreate in class TokenizerFactoryCopyright © 2000-2019 Apache Software Foundation. All Rights Reserved.