Textractor API textractor-716 (20091105163204)

textractor.mg4j.document
Class AbstractTextractorDocumentIterator

java.lang.Object
  extended by textractor.mg4j.document.AbstractTextractorDocumentIterator
All Implemented Interfaces:
DocumentIterator, Closeable
Direct Known Subclasses:
TextractorDBDocumentIterator, TextractorQueueDocumentIterator

public abstract class AbstractTextractorDocumentIterator
extends Object
implements DocumentIterator

An iterator over textractor documents.


Field Summary
protected  int filteredSentenceCount
           
protected  SentenceFilter sentenceFilter
           
 
Constructor Summary
AbstractTextractorDocumentIterator(DocumentFactory factory, SentenceFilter filter)
           
 
Method Summary
protected  Document createDocument(Reference2ObjectMap<Enum<?>,Object> metadata, String text)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface it.unimi.dsi.mg4j.document.DocumentIterator
close, nextDocument
 

Field Detail

filteredSentenceCount

protected int filteredSentenceCount

sentenceFilter

protected SentenceFilter sentenceFilter
Constructor Detail

AbstractTextractorDocumentIterator

public AbstractTextractorDocumentIterator(DocumentFactory factory,
                                          SentenceFilter filter)
Method Detail

createDocument

protected final Document createDocument(Reference2ObjectMap<Enum<?>,Object> metadata,
                                        String text)
                                 throws IOException
Throws:
IOException

Textractor API textractor-716 (20091105163204)

Copyright © 2003-2008 Institute for Computational Biomedicine, All Rights Reserved.