public class TikaEntityProcessor extends EntityProcessorBase
An implementation of EntityProcessor
which reads data from rich docs using Apache Tika
ABORT, cacheSupport, context, CONTINUE, entityName, isFirstInit, ON_ERROR, onError, query, rowIterator, SKIP, SKIP_DOC, TRANSFORM_ROW, TRANSFORMER
Constructor and Description |
---|
TikaEntityProcessor() |
Modifier and Type | Method and Description |
---|---|
protected void |
firstInit(Context context)
first time init call. do one-time operations here
|
Map<String,Object> |
nextRow()
For a simple implementation, this is the only method that the sub-class should implement.
|
destroy, getNext, init, initCache, nextDeletedRowKey, nextModifiedParentRowKey, nextModifiedRowKey
close, postTransform
protected void firstInit(Context context)
EntityProcessorBase
firstInit
in class EntityProcessorBase
public Map<String,Object> nextRow()
EntityProcessorBase
nextRow
in class EntityProcessorBase