A B C E H I N O R S T U W

A

ACRONYM_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
ALPHANUM_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
APOSTROPHE_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

B

BOLD - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
BOLD_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
BOLD_ITALICS - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
BOLD_ITALICS_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
BOTH - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
Output the both the untokenized token and the splits

C

CATEGORY - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
CATEGORY_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
CITATION - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
CITATION_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
CJ_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
COMPANY_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

E

EMAIL_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
end() - Method in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
EXTERNAL_LINK - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
EXTERNAL_LINK_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
EXTERNAL_LINK_URL - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
EXTERNAL_LINK_URL_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

H

HEADING - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
HEADING_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
HOST_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

I

incrementToken() - Method in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
INTERNAL_LINK - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
INTERNAL_LINK_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
ITALICS - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
ITALICS_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

N

NUM_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

O

org.apache.lucene.wikipedia.analysis - package org.apache.lucene.wikipedia.analysis
Tokenizer that is aware of Wikipedia syntax.

R

reset() - Method in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
reset(Reader) - Method in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

S

SUB_HEADING - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 
SUB_HEADING_ID - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
 

T

TOKEN_TYPES - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
String token types that correspond to token type int constants
TOKENS_ONLY - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
Only output tokens

U

UNTOKENIZED_ONLY - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
Only output untokenized tokens, which are tokens that would normally be split into several tokens
UNTOKENIZED_TOKEN_FLAG - Static variable in class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
This flag is used to indicate that the produced "Token" would, if TOKENS_ONLY was used, produce multiple tokens.

W

WikipediaTokenizer - Class in org.apache.lucene.wikipedia.analysis
Extension of StandardTokenizer that is aware of Wikipedia syntax.
WikipediaTokenizer(Reader) - Constructor for class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
Creates a new instance of the WikipediaTokenizer.
WikipediaTokenizer(Reader, int, Set<String>) - Constructor for class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
Creates a new instance of the WikipediaTokenizer.
WikipediaTokenizer(AttributeSource.AttributeFactory, Reader, int, Set<String>) - Constructor for class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
Creates a new instance of the WikipediaTokenizer.
WikipediaTokenizer(AttributeSource, Reader, int, Set<String>) - Constructor for class org.apache.lucene.wikipedia.analysis.WikipediaTokenizer
Creates a new instance of the WikipediaTokenizer.

A B C E H I N O R S T U W

Copyright © 2000-2010 Apache Software Foundation. All Rights Reserved.