|
|||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||
See:
Description
| Interface Summary | |
|---|---|
| BaseTokenStreamTestCase.CheckClearAttributesAttribute | Attribute that records if it was cleared or not. |
| CannedBinaryTokenStream.BinaryTermAttribute | An attribute extending TermToBytesRefAttribute but exposing CannedBinaryTokenStream.BinaryTermAttribute.setBytesRef(org.apache.lucene.util.BytesRef) method. |
| Class Summary | |
|---|---|
| BaseTokenStreamTestCase | Base class for all Lucene unit tests that use TokenStreams. |
| BaseTokenStreamTestCase.CheckClearAttributesAttributeImpl | Attribute that records if it was cleared or not. |
| CannedBinaryTokenStream | TokenStream from a canned list of binary (BytesRef-based) tokens. |
| CannedBinaryTokenStream.BinaryTermAttributeImpl | Implementation for CannedBinaryTokenStream.BinaryTermAttribute. |
| CannedBinaryTokenStream.BinaryToken | Represents a binary token. |
| CannedTokenStream | TokenStream from a canned list of Tokens. |
| CollationTestBase | Base test class for testing Unicode collation. |
| LookaheadTokenFilter<T extends LookaheadTokenFilter.Position> | An abstract TokenFilter to make it easier to build graph token filters requiring some lookahead. |
| LookaheadTokenFilter.Position | Holds all state for a single position; subclass this to record other state at each position. |
| MockAnalyzer | Analyzer for testing |
| MockBytesAnalyzer | Analyzer for testing that encodes terms as UTF-16 bytes. |
| MockBytesAttributeFactory | Attribute factory that implements CharTermAttribute with
MockUTF16TermAttributeImpl |
| MockCharFilter | the purpose of this charfilter is to send offsets out of bounds if the analyzer doesn't use correctOffset or does incorrect offset math. |
| MockFixedLengthPayloadFilter | TokenFilter that adds random fixed-length payloads. |
| MockGraphTokenFilter | Randomly inserts overlapped (posInc=0) tokens with posLength sometimes > 1. |
| MockHoleInjectingTokenFilter | Randomly injects holes (similar to what a stopfilter would do) |
| MockPayloadAnalyzer | Wraps a whitespace tokenizer with a filter that sets the first token, and odd tokens to posinc=1, and all others to 0, encoding the position as pos: XXX in the payload. |
| MockRandomLookaheadTokenFilter | Uses LookaheadTokenFilter to randomly peek at future tokens. |
| MockReaderWrapper | Wraps a Reader, and can throw random or fixed exceptions, and spoon feed read chars. |
| MockTokenFilter | A tokenfilter for testing that removes terms accepted by a DFA. |
| MockTokenizer | Tokenizer for testing. |
| MockUTF16TermAttributeImpl | Extension of CharTermAttributeImpl that encodes the term
text as UTF-16 bytes instead of as UTF-8 bytes. |
| MockVariableLengthPayloadFilter | TokenFilter that adds random variable-length payloads. |
| TokenStreamToDot | Consumes a TokenStream and outputs the dot (graphviz) string (graph). |
| ValidatingTokenFilter | A TokenFilter that checks consistency of the tokens (eg offsets are consistent with one another). |
| VocabularyAssert | Utility class for doing vocabulary-based stemming tests |
Support for testing analysis components.
The main classes of interest are:
BaseTokenStreamTestCase: Highly recommended
to use its helper methods, (especially in conjunction with
MockAnalyzer or MockTokenizer),
as it contains many assertions and checks to catch bugs. MockTokenizer: Tokenizer for testing.
Tokenizer that serves as a replacement for WHITESPACE, SIMPLE, and KEYWORD
tokenizers. If you are writing a component such as a TokenFilter, its a great idea to test
it wrapping this tokenizer instead for extra checks. MockAnalyzer: Analyzer for testing.
Analyzer that uses MockTokenizer for additional verification. If you are testing a custom
component such as a queryparser or analyzer-wrapper that consumes analysis streams, its a great
idea to test it with this analyzer instead.
|
|||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | ||||||||