public interface TokenizerFactory
TokenizerFactory
breaks up a stream of characters
into tokens.
TokenizerFactories are registered for FieldType
s with the
IndexSchema
through the schema.xml
file.
Example schema.xml
entry to register a TokenizerFactory
implementation to tokenize fields of type "cool"
<fieldtype name="cool" class="solr.TextField"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> ...
A single instance of any registered TokenizerFactory is created via the default constructor and is reused for each FieldType.
Modifier and Type | Method and Description |
---|---|
Tokenizer |
create(Reader input)
Creates a TokenStream of the specified input
|
Map<String,String> |
getArgs()
Accessor method for reporting the args used to initialize this factory.
|
void |
init(Map<String,String> args)
init will be called just once, immediately after creation. |
void init(Map<String,String> args)
init
will be called just once, immediately after creation.
The args are user-level initialization parameters that may be specified when declaring a the factory in the schema.xml
Map<String,String> getArgs()
Implementations are strongly encouraged to return the contents of the Map passed to to the init method