Package | Description |
---|---|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
Modifier and Type | Class and Description |
---|---|
class |
LetterTokenizer
A LetterTokenizer is a tokenizer that divides text at non-letters.
|
class |
UnicodeWhitespaceTokenizer
A UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace.
|
class |
WhitespaceTokenizer
A tokenizer that divides text at whitespace characters as defined by
Character.isWhitespace(int) . |
Modifier and Type | Method and Description |
---|---|
static CharTokenizer |
CharTokenizer.fromSeparatorCharPredicate(AttributeFactory factory,
IntPredicate separatorCharPredicate)
Creates a new instance of CharTokenizer with the supplied attribute factory using a custom predicate, supplied as method reference or lambda expression.
|
static CharTokenizer |
CharTokenizer.fromSeparatorCharPredicate(IntPredicate separatorCharPredicate)
Creates a new instance of CharTokenizer using a custom predicate, supplied as method reference or lambda expression.
|
static CharTokenizer |
CharTokenizer.fromTokenCharPredicate(AttributeFactory factory,
IntPredicate tokenCharPredicate)
Creates a new instance of CharTokenizer with the supplied attribute factory using a custom predicate, supplied as method reference or lambda expression.
|
static CharTokenizer |
CharTokenizer.fromTokenCharPredicate(IntPredicate tokenCharPredicate)
Creates a new instance of CharTokenizer using a custom predicate, supplied as method reference or lambda expression.
|
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.