Textractor API textractor-738 (20110316124242)

Package textractor.html

Interface Summary
CheckpointCallback Created by IntelliJ IDEA.
 

Class Summary
AbstractHtml2Text Converts HTML to Text.
DatabaseTextConsumer A consumer that submits text to a database.
Html2Text Converts HTML into UTF-8 Text.
Html2Text2DB Converts HTML to Text.
Html2TextNoref Created by IntelliJ IDEA.
Html2TextTagRef  
ParseResult  
PubmedAbstracts2Text2DB Parse XML Pubmed abstracts and load their text into the database.
QueueTextConsumer A consumer of text that stuffs articles and their sentences into a queue that can be read by an indexing task.
SwissProtNames2DB Import SwissProt/Trembl names from XML files in the SigPath format.
TextConsumer A consumer of text.
TextractorTextExtractingVisitor Extracts text from a web page and remembers the numeric position of each kept character in the file.
 


Textractor API textractor-738 (20110316124242)

Copyright © 2003-2008 Institute for Computational Biomedicine, All Rights Reserved.