org.apache.lucene.analysis.in
Class IndicTokenizer

java.lang.Object
  extended by org.apache.lucene.util.AttributeSource
      extended by org.apache.lucene.analysis.TokenStream
          extended by org.apache.lucene.analysis.Tokenizer
              extended by org.apache.lucene.analysis.util.CharTokenizer
                  extended by org.apache.lucene.analysis.in.IndicTokenizer
All Implemented Interfaces:
Closeable

Deprecated. (3.6) Use StandardTokenizer instead.

@Deprecated
public final class IndicTokenizer
extends CharTokenizer

Simple Tokenizer for text in Indian Languages.


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
AttributeSource.AttributeFactory, AttributeSource.State
 
Field Summary
 
Fields inherited from class org.apache.lucene.analysis.Tokenizer
input
 
Constructor Summary
IndicTokenizer(Version matchVersion, AttributeSource.AttributeFactory factory, Reader input)
          Deprecated.  
IndicTokenizer(Version matchVersion, Reader input)
          Deprecated.  
 
Method Summary
protected  boolean isTokenChar(int c)
          Deprecated. Returns true iff a codepoint should be included in a token.
 
Methods inherited from class org.apache.lucene.analysis.util.CharTokenizer
end, incrementToken, normalize, reset
 
Methods inherited from class org.apache.lucene.analysis.Tokenizer
close, correctOffset, setReader
 
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

IndicTokenizer

public IndicTokenizer(Version matchVersion,
                      AttributeSource.AttributeFactory factory,
                      Reader input)
Deprecated. 

IndicTokenizer

public IndicTokenizer(Version matchVersion,
                      Reader input)
Deprecated. 
Method Detail

isTokenChar

protected boolean isTokenChar(int c)
Deprecated. 
Description copied from class: CharTokenizer
Returns true iff a codepoint should be included in a token. This tokenizer generates as tokens adjacent sequences of codepoints which satisfy this predicate. Codepoints for which this is false are used to define token boundaries and are not included in tokens.

Specified by:
isTokenChar in class CharTokenizer


Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.