public class JapaneseAnalyzer extends StopwordAnalyzerBase
JapaneseTokenizerReusableAnalyzerBase.TokenStreamComponentsmatchVersion, stopwords| Constructor and Description |
|---|
JapaneseAnalyzer(Version matchVersion) |
JapaneseAnalyzer(Version matchVersion,
UserDictionary userDict,
JapaneseTokenizer.Mode mode,
CharArraySet stopwords,
Set<String> stoptags) |
| Modifier and Type | Method and Description |
|---|---|
protected ReusableAnalyzerBase.TokenStreamComponents |
createComponents(String fieldName,
Reader reader)
Creates a new
ReusableAnalyzerBase.TokenStreamComponents instance for this analyzer. |
static CharArraySet |
getDefaultStopSet() |
static Set<String> |
getDefaultStopTags() |
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSetinitReader, reusableTokenStream, tokenStreamclose, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setPreviousTokenStreampublic JapaneseAnalyzer(Version matchVersion)
public JapaneseAnalyzer(Version matchVersion, UserDictionary userDict, JapaneseTokenizer.Mode mode, CharArraySet stopwords, Set<String> stoptags)
public static CharArraySet getDefaultStopSet()
protected ReusableAnalyzerBase.TokenStreamComponents createComponents(String fieldName, Reader reader)
ReusableAnalyzerBaseReusableAnalyzerBase.TokenStreamComponents instance for this analyzer.createComponents in class ReusableAnalyzerBasefieldName - the name of the fields content passed to the
ReusableAnalyzerBase.TokenStreamComponents sink as a readerreader - the reader passed to the Tokenizer constructorReusableAnalyzerBase.TokenStreamComponents for this analyzer.