Package org.apache.lucene.analysis.cn

Analyzer for Chinese, which indexes unigrams (individual chinese characters).

See:
          Description

Class Summary
ChineseAnalyzer Deprecated. (3.1) Use StandardAnalyzer instead, which has the same functionality.
ChineseFilter Deprecated. (3.1) Use StopFilter instead, which has the same functionality.
ChineseFilterFactory Deprecated. Use StopFilterFactory instead.
ChineseTokenizer Deprecated. (3.1) Use StandardTokenizer instead, which has the same functionality.
ChineseTokenizerFactory Deprecated. Use StandardTokenizerFactory instead.
 

Package org.apache.lucene.analysis.cn Description

Analyzer for Chinese, which indexes unigrams (individual chinese characters).

Three analyzers are provided for Chinese, each of which treats Chinese text in a different way.

Example phrase: "我是中国人"
  1. StandardAnalyzer: 我-是-中-国-人
  2. CJKAnalyzer: 我是-是中-中国-国人
  3. SmartChineseAnalyzer: 我-是-中国-人



Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.