org.apache.solr.handler.extraction
Class ExtractingRequestHandler

java.lang.Object
  extended by org.apache.solr.handler.RequestHandlerBase
      extended by org.apache.solr.handler.ContentStreamHandlerBase
          extended by org.apache.solr.handler.extraction.ExtractingRequestHandler
All Implemented Interfaces:
SolrInfoMBean, SolrRequestHandler, SolrCoreAware

public class ExtractingRequestHandler
extends ContentStreamHandlerBase
implements SolrCoreAware

Handler for rich documents like PDF or Word or any other file format that Tika handles that need the text to be extracted first from the document.


Nested Class Summary
 
Nested classes/interfaces inherited from interface org.apache.solr.core.SolrInfoMBean
SolrInfoMBean.Category
 
Field Summary
protected  org.apache.tika.config.TikaConfig config
           
static String CONFIG_LOCATION
           
static String DATE_FORMATS
           
protected  Collection<String> dateFormats
           
protected  SolrContentHandlerFactory factory
           
 
Fields inherited from class org.apache.solr.handler.RequestHandlerBase
appends, defaults, httpCaching, initArgs, invariants
 
Constructor Summary
ExtractingRequestHandler()
           
 
Method Summary
protected  SolrContentHandlerFactory createFactory()
           
 String getDescription()
           
 String getSource()
           
 void inform(SolrCore core)
           
 void init(NamedList args)
           
protected  ContentStreamLoader newLoader(SolrQueryRequest req, UpdateRequestProcessor processor)
           
 
Methods inherited from class org.apache.solr.handler.ContentStreamHandlerBase
handleRequestBody
 
Methods inherited from class org.apache.solr.handler.RequestHandlerBase
getCategory, getDocs, getInitArgs, getName, getStatistics, getVersion, handleRequest
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

CONFIG_LOCATION

public static final String CONFIG_LOCATION
See Also:
Constant Field Values

DATE_FORMATS

public static final String DATE_FORMATS
See Also:
Constant Field Values

config

protected org.apache.tika.config.TikaConfig config

dateFormats

protected Collection<String> dateFormats

factory

protected SolrContentHandlerFactory factory
Constructor Detail

ExtractingRequestHandler

public ExtractingRequestHandler()
Method Detail

init

public void init(NamedList args)
Specified by:
init in interface SolrRequestHandler
Overrides:
init in class ContentStreamHandlerBase

inform

public void inform(SolrCore core)
Specified by:
inform in interface SolrCoreAware

createFactory

protected SolrContentHandlerFactory createFactory()

newLoader

protected ContentStreamLoader newLoader(SolrQueryRequest req,
                                        UpdateRequestProcessor processor)
Specified by:
newLoader in class ContentStreamHandlerBase

getDescription

public String getDescription()
Specified by:
getDescription in interface SolrInfoMBean
Specified by:
getDescription in class RequestHandlerBase

getSource

public String getSource()
Specified by:
getSource in interface SolrInfoMBean
Specified by:
getSource in class RequestHandlerBase


Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.