public class WikipediaTokenizerFactory extends TokenizerFactory
WikipediaTokenizer
.
<fieldType name="text_wiki" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.WikipediaTokenizerFactory"/> </analyzer> </fieldType>
Modifier and Type | Field and Description |
---|---|
static String |
NAME
SPI name
|
static String |
TOKEN_OUTPUT |
protected int |
tokenOutput |
static String |
UNTOKENIZED_TYPES |
protected Set<String> |
untokenizedTypes |
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
Constructor and Description |
---|
WikipediaTokenizerFactory(Map<String,String> args)
Creates a new WikipediaTokenizerFactory
|
Modifier and Type | Method and Description |
---|---|
WikipediaTokenizer |
create(AttributeFactory factory)
Creates a TokenStream of the specified input using the given AttributeFactory
|
availableTokenizers, create, findSPIName, forName, lookupClass, reloadTokenizers
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
public static final String NAME
public static final String TOKEN_OUTPUT
public static final String UNTOKENIZED_TYPES
protected final int tokenOutput
public WikipediaTokenizer create(AttributeFactory factory)
TokenizerFactory
create
in class TokenizerFactory
Copyright © 2000-2020 Apache Software Foundation. All Rights Reserved.