com.liferay.util.lucene
Class JerichoHTMLTextExtractor

java.lang.Object
  extended by org.apache.jackrabbit.extractor.AbstractTextExtractor
      extended by org.apache.jackrabbit.extractor.HTMLTextExtractor
          extended by com.liferay.util.lucene.JerichoHTMLTextExtractor
All Implemented Interfaces:
org.apache.jackrabbit.extractor.TextExtractor

public class JerichoHTMLTextExtractor
extends org.apache.jackrabbit.extractor.HTMLTextExtractor

View Source

Author:
Brian Wing Shun Chan

Constructor Summary
JerichoHTMLTextExtractor()
           
 
Method Summary
 java.io.Reader extractText(java.io.InputStream stream, java.lang.String type, java.lang.String encoding)
           
 
Methods inherited from class org.apache.jackrabbit.extractor.AbstractTextExtractor
getContentTypes
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

JerichoHTMLTextExtractor

public JerichoHTMLTextExtractor()
Method Detail

extractText

public java.io.Reader extractText(java.io.InputStream stream,
                                  java.lang.String type,
                                  java.lang.String encoding)
                           throws java.io.IOException
Specified by:
extractText in interface org.apache.jackrabbit.extractor.TextExtractor
Overrides:
extractText in class org.apache.jackrabbit.extractor.HTMLTextExtractor
Throws:
java.io.IOException