Package org.apache.lucene.analysis.util
Class ElisionFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.util.ElisionFilter
- All Implemented Interfaces:
Closeable
,AutoCloseable
,Unwrappable<TokenStream>
Removes elisions from a
TokenStream
. For example, "l'avion" (the plane) will be tokenized
as "avion" (plane).- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
AttributeSource.State
-
Field Summary
Fields inherited from class org.apache.lucene.analysis.TokenFilter
input
Fields inherited from class org.apache.lucene.analysis.TokenStream
DEFAULT_TOKEN_ATTRIBUTE_FACTORY
-
Constructor Summary
ConstructorDescriptionElisionFilter
(TokenStream input, CharArraySet articles) Constructs an elision filter with a Set of stop words -
Method Summary
Modifier and TypeMethodDescriptionfinal boolean
Increments theTokenStream
with aCharTermAttribute
without elisioned startMethods inherited from class org.apache.lucene.analysis.TokenFilter
close, end, reset, unwrap
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toString
-
Constructor Details
-
ElisionFilter
Constructs an elision filter with a Set of stop words- Parameters:
input
- the sourceTokenStream
articles
- a set of stopword articles
-
-
Method Details
-
incrementToken
Increments theTokenStream
with aCharTermAttribute
without elisioned start- Specified by:
incrementToken
in classTokenStream
- Throws:
IOException
-