org.apache.lucene.analysis.cn
Class ChineseFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.cn.ChineseFilter
- All Implemented Interfaces:
- Closeable
Deprecated. (3.1) Use StopFilter
instead, which has the same functionality.
This filter will be removed in Lucene 5.0
@Deprecated
public final class ChineseFilter
- extends TokenFilter
A TokenFilter
with a stop word table.
- Numeric tokens are removed.
- English tokens must be larger than 1 character.
- One Chinese character as one Chinese word.
TO DO:
- Add Chinese stop words, such as
- Dictionary based Chinese word extraction
- Intelligent Chinese word extraction
Methods inherited from class org.apache.lucene.util.AttributeSource |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState |
STOP_WORDS
public static final String[] STOP_WORDS
- Deprecated.
ChineseFilter
public ChineseFilter(TokenStream in)
- Deprecated.
incrementToken
public boolean incrementToken()
throws IOException
- Deprecated.
- Specified by:
incrementToken
in class TokenStream
- Throws:
IOException
Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.