RussianLetterTokenizer (Lucene 4.10.0 API)

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

java.lang.Object
- org.apache.lucene.util.AttributeSource
- - org.apache.lucene.analysis.TokenStream
  - - org.apache.lucene.analysis.Tokenizer
    - - org.apache.lucene.analysis.util.CharTokenizer
      - org.apache.lucene.analysis.ru.RussianLetterTokenizer

All Implemented Interfaces:

Closeable, AutoCloseable

Deprecated.
(3.1) Use StandardTokenizer instead, which has the same functionality. This filter will be removed in Lucene 5.0
```
@Deprecated
public class RussianLetterTokenizer
extends CharTokenizer
```
A RussianLetterTokenizer is a Tokenizer that extends LetterTokenizer by also allowing the basic Latin digits 0-9.
You must specify the required Version compatibility when creating RussianLetterTokenizer:
- As of 3.1, CharTokenizer uses an int based API to normalize and detect token characters. See CharTokenizer.isTokenChar(int) and CharTokenizer.normalize(int) for details.

Nested Class Summary
- Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
  AttributeSource.State

Field Summary
- Fields inherited from class org.apache.lucene.analysis.Tokenizer
  input
- Fields inherited from class org.apache.lucene.analysis.TokenStream
  DEFAULT_TOKEN_ATTRIBUTE_FACTORY
- Fields inherited from class org.apache.lucene.util.AttributeSource
  DEFAULT_ATTRIBUTE_FACTORY

Constructor Summary

Constructors
Constructor and Description
`RussianLetterTokenizer(Version matchVersion, AttributeFactory factory, Reader in)` Deprecated. Construct a new RussianLetterTokenizer using a given `AttributeFactory`.
`RussianLetterTokenizer(Version matchVersion, Reader in)` Deprecated. Construct a new RussianLetterTokenizer.

Method Summary

Methods
Modifier and Type Method and Description

protected boolean isTokenChar(int c)
Deprecated.

Collects only characters which satisfy Character.isLetter(int).
- Methods inherited from class org.apache.lucene.analysis.util.CharTokenizer
  end, incrementToken, normalize, reset
- Methods inherited from class org.apache.lucene.analysis.Tokenizer
  close, correctOffset, setReader
- Methods inherited from class org.apache.lucene.util.AttributeSource
  addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
- Methods inherited from class java.lang.Object
  clone, finalize, getClass, notify, notifyAll, wait, wait, wait

- Constructor Detail
  - RussianLetterTokenizer
```
public RussianLetterTokenizer(Version matchVersion,
                      Reader in)
```
    Deprecated.
    
    Construct a new RussianLetterTokenizer. * @param matchVersion Lucene version to match See above
    
    Parameters:
    in - the input to split up into tokens
  - RussianLetterTokenizer
```
public RussianLetterTokenizer(Version matchVersion,
                      AttributeFactory factory,
                      Reader in)
```
    Deprecated.
    
    Construct a new RussianLetterTokenizer using a given AttributeFactory. * @param matchVersion Lucene version to match See above
    
    Parameters:
    factory - the attribute factory to use for this Tokenizer
    in - the input to split up into tokens
- Method Detail
  - isTokenChar
```
protected boolean isTokenChar(int c)
```
    Deprecated.
    
    Collects only characters which satisfy Character.isLetter(int).
    
    Specified by:
    
    isTokenChar in class CharTokenizer

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.