Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.charfilter |
CharFilters: process text before the Tokenizer
|
org.apache.lucene.analysis.fa |
Analyzer for Persian.
|
Modifier and Type | Class and Description |
---|---|
class |
BaseCharFilter
Base utility class for implementing a
CharFilter . |
class |
CharFilter
Subclasses of CharFilter can be chained to filter CharStream.
|
class |
CharReader
CharReader is a Reader wrapper.
|
class |
MappingCharFilter
Simplistic
CharFilter that applies the mappings
contained in a NormalizeCharMap to the character
stream, and correcting the resulting changes to the
offsets. |
class |
MockCharFilter
the purpose of this charfilter is to send offsets out of bounds
if the analyzer doesn't use correctOffset or does incorrect offset math.
|
Modifier and Type | Field and Description |
---|---|
protected CharStream |
CharFilter.input |
Modifier and Type | Method and Description |
---|---|
static CharStream |
CharReader.get(Reader input) |
Constructor and Description |
---|
BaseCharFilter(CharStream in) |
CharFilter(CharStream in) |
MappingCharFilter(NormalizeCharMap normMap,
CharStream in)
Default constructor that takes a
CharStream . |
Modifier and Type | Class and Description |
---|---|
class |
HTMLStripCharFilter
A CharFilter that wraps another Reader and attempts to strip out HTML constructs.
|
Constructor and Description |
---|
HTMLStripCharFilter(CharStream source) |
HTMLStripCharFilter(CharStream source,
Set<String> escapedTags) |
Modifier and Type | Class and Description |
---|---|
class |
PersianCharFilter
CharFilter that replaces instances of Zero-width non-joiner with an
ordinary space.
|
Constructor and Description |
---|
PersianCharFilter(CharStream in) |