public class JapaneseAnalyzer extends StopwordAnalyzerBase
JapaneseTokenizer
ReusableAnalyzerBase.TokenStreamComponents
matchVersion, stopwords
Constructor and Description |
---|
JapaneseAnalyzer(Version matchVersion) |
JapaneseAnalyzer(Version matchVersion,
UserDictionary userDict,
JapaneseTokenizer.Mode mode,
CharArraySet stopwords,
Set<String> stoptags) |
Modifier and Type | Method and Description |
---|---|
protected ReusableAnalyzerBase.TokenStreamComponents |
createComponents(String fieldName,
Reader reader)
Creates a new
ReusableAnalyzerBase.TokenStreamComponents instance for this analyzer. |
static CharArraySet |
getDefaultStopSet() |
static Set<String> |
getDefaultStopTags() |
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSet
initReader, reusableTokenStream, tokenStream
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setPreviousTokenStream
public JapaneseAnalyzer(Version matchVersion)
public JapaneseAnalyzer(Version matchVersion, UserDictionary userDict, JapaneseTokenizer.Mode mode, CharArraySet stopwords, Set<String> stoptags)
public static CharArraySet getDefaultStopSet()
protected ReusableAnalyzerBase.TokenStreamComponents createComponents(String fieldName, Reader reader)
ReusableAnalyzerBase
ReusableAnalyzerBase.TokenStreamComponents
instance for this analyzer.createComponents
in class ReusableAnalyzerBase
fieldName
- the name of the fields content passed to the
ReusableAnalyzerBase.TokenStreamComponents
sink as a readerreader
- the reader passed to the Tokenizer
constructorReusableAnalyzerBase.TokenStreamComponents
for this analyzer.