public class HyphenationCompoundWordTokenFilterFactory extends TokenFilterFactory implements ResourceLoaderAware
HyphenationCompoundWordTokenFilter.
This factory accepts the following parameters:
hyphenator (mandatory): path to the FOP xml hyphenation pattern.
See http://offo.sourceforge.net/hyphenation/.
encoding (optional): encoding of the xml hyphenation file. defaults to UTF-8.
dictionary (optional): dictionary of words. defaults to no dictionary.
minWordSize (optional): minimal word length that gets decomposed. defaults to 5.
minSubwordSize (optional): minimum length of subwords. defaults to 2.
maxSubwordSize (optional): maximum length of subwords. defaults to 15.
onlyLongestMatch (optional): if true, adds only the longest matching subword
to the stream. defaults to false.
<fieldType name="text_hyphncomp" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.HyphenationCompoundWordTokenFilterFactory" hyphenator="hyphenator.xml" encoding="UTF-8"
dictionary="dictionary.txt" minWordSize="5" minSubwordSize="2" maxSubwordSize="15" onlyLongestMatch="false"/>
</analyzer>
</fieldType>HyphenationCompoundWordTokenFilterargs, luceneMatchVersion| Constructor and Description |
|---|
HyphenationCompoundWordTokenFilterFactory() |
| Modifier and Type | Method and Description |
|---|---|
HyphenationCompoundWordTokenFilter |
create(TokenStream input)
Transform the specified input TokenStream
|
void |
inform(ResourceLoader loader)
Initializes this component with the provided ResourceLoader
(used for loading classes, files, etc).
|
void |
init(Map<String,String> args)
Initialize this factory via a set of key-value pairs.
|
availableTokenFilters, forName, lookupClass, reloadTokenFiltersassureMatchVersion, getArgs, getBoolean, getBoolean, getInt, getInt, getInt, getLines, getLuceneMatchVersion, getPattern, getSnowballWordSet, getWordSet, setLuceneMatchVersion, splitFileNamespublic HyphenationCompoundWordTokenFilterFactory()
public void init(Map<String,String> args)
AbstractAnalysisFactoryinit in class AbstractAnalysisFactorypublic void inform(ResourceLoader loader) throws IOException
ResourceLoaderAwareinform in interface ResourceLoaderAwareIOExceptionpublic HyphenationCompoundWordTokenFilter create(TokenStream input)
TokenFilterFactorycreate in class TokenFilterFactoryCopyright © 2000-2013 Apache Software Foundation. All Rights Reserved.