|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Field
that is
tokenized
,
not stored
,
termVectorStored with positions
(or
termVectorStored with positions and offsets
),
addField(fieldName, stream, 1.0f)
.
Field
.
maxTokens
tokens from the underlying child analyzer,
ignoring all remaining tokens.
SynonymTokenFilter
.
tokenStream(String fieldName, Reader reader)
if the fieldName has been seen before, altogether ignoring the Reader parameter on cache lookup.
"\\W+"
; Divides text at non-letters (NOT Character.isLetter(c))
Reader
, that can flexibly separate text into terms via a regular expression Pattern
(with behaviour identical to String.split(String)
),
and that combines the functionality of
LetterTokenizer
,
LowerCaseTokenizer
,
WhitespaceTokenizer
,
StopFilter
into a single efficient
multi-purpose class.PatternAnalyzer.PatternAnalyzer(Version, Pattern, boolean, Set)
instead
tokenStream(String, String)
and is
less efficient than tokenStream(String, String)
.
"\\s+"
; Divides text at whitespaces (Character.isWhitespace(c))
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |