SentenceTokenizer (Lucene 4.10.3 API)

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

java.lang.Object
- org.apache.lucene.util.AttributeSource
- - org.apache.lucene.analysis.TokenStream
  - - org.apache.lucene.analysis.Tokenizer
    - - org.apache.lucene.analysis.cn.smart.SentenceTokenizer

All Implemented Interfaces:

Closeable, AutoCloseable

Deprecated.
Use HMMChineseTokenizer instead
```
@Deprecated
public final class SentenceTokenizer
extends Tokenizer
```
Tokenizes input text into sentences.
The output tokens can then be broken into words with WordTokenFilter

WARNING: This API is experimental and might change in incompatible ways in the next release.

- Nested Class Summary
  - Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
    AttributeSource.State
- Field Summary
  - Fields inherited from class org.apache.lucene.analysis.Tokenizer
    input
  - Fields inherited from class org.apache.lucene.analysis.TokenStream
    DEFAULT_TOKEN_ATTRIBUTE_FACTORY
  - Fields inherited from class org.apache.lucene.util.AttributeSource
    DEFAULT_ATTRIBUTE_FACTORY
- Constructor Summary
  
  Constructors
  Constructor and Description
  
  SentenceTokenizer(AttributeFactory factory, Reader reader)
  Deprecated.
  
  SentenceTokenizer(Reader reader)
  Deprecated.
- Method Summary
  
  Methods
  Modifier and Type Method and Description
  
  void end()
  Deprecated.
  
  boolean incrementToken()
  Deprecated.
  
  void reset()
  Deprecated.
  - Methods inherited from class org.apache.lucene.analysis.Tokenizer
    close, correctOffset, setReader
  - Methods inherited from class org.apache.lucene.util.AttributeSource
    addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
  - Methods inherited from class java.lang.Object
    clone, finalize, getClass, notify, notifyAll, wait, wait, wait

Constructor Detail

SentenceTokenizer

public SentenceTokenizer(Reader reader)

Deprecated.

SentenceTokenizer

public SentenceTokenizer(AttributeFactory factory,
                 Reader reader)

Deprecated.

Method Detail
- incrementToken
```
public boolean incrementToken()
                       throws IOException
```
  Deprecated.
  
  Specified by:
  
  incrementToken in class TokenStream
  
  Throws:
  
  IOException
- reset
```
public void reset()
           throws IOException
```
  Deprecated.
  
  Overrides:
  
  reset in class Tokenizer
  
  Throws:
  
  IOException
- end
```
public void end()
         throws IOException
```
  Deprecated.
  
  Overrides:
  
  end in class TokenStream
  
  Throws:
  
  IOException

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.