Daisy Open Source CMS

org.outerj.daisy.textextraction
Interface TextExtractor


public interface TextExtractor

A textextractor extracts text from a certain type of document.

TextExtractor implementations should be thread safe.


Method Summary
 List<String> getMimeTypes()
          Returns the mime types this text extractor can handle.
 String getText(InputStream is)
           
 

Method Detail

getMimeTypes

List<String> getMimeTypes()
Returns the mime types this text extractor can handle.


getText

String getText(InputStream is)
               throws Exception
Throws:
Exception

Daisy Open Source CMS