|
Textractor API textractor-720 (20091120123250) | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objecttextractor.datamodel.TextractorDocument
textractor.datamodel.Sentence
public final class Sentence
| Field Summary | |
|---|---|
static String |
REFERENCE_TAG
|
static String |
REFERENCE_TAG_HTML
|
static int |
TEXT_MAX_LENGTH
|
| Fields inherited from class textractor.datamodel.TextractorDocument |
|---|
ABSTRACT_SECTION, documentNumber, OTHER_SECTION, REFERENCE_SECTION |
| Constructor Summary | |
|---|---|
Sentence()
|
|
Sentence(Article article)
|
|
Sentence(Article article,
String text)
|
|
| Method Summary | |
|---|---|
Article |
getArticle()
|
Reference2ObjectMap<Enum<?>,Object> |
getMetaData()
Get the metadata for this document. |
List<Integer> |
getPositions()
Get the positions of words in the sentence text relative to the original document source. |
String[] |
getPotentialMutations()
Obtains the terms that look like mutations in this sentence. |
MutableString |
getSpaceDelimitedProcessedTerms(DocumentIndexManager docmanager)
Returns the text of this sentence in a format where words have been processed and are delimited by a single space character. |
static MutableString |
getSpaceDelimitedProcessedTerms(DocumentIndexManager docmanager,
String spaceDelimitedTerms)
Process each term with the termProcessor and returns the result. |
MutableString |
getSpaceDelimitedTerms(DocumentIndexManager docmanager)
Returns the text of this sentence in a format where words are delimited by a single space character. |
int |
getTermNumber()
Returns the number of terms in this sentence. |
String |
getText()
Get the text contained in this document. |
boolean |
hasPositons()
Does this sentence have position information associated with the text? |
boolean |
isMaybeProteinMutation()
|
boolean |
isMaybeProteinName()
|
void |
setArticle(Article article)
|
void |
setMaybeProteinMutation(boolean maybeProteinMutation)
|
void |
setMaybeProteinName(boolean maybeProteinName)
|
void |
setPositions(List<Integer> positions)
Set the positions of words in the sentence text relative to the original document source. |
void |
setPotentialMutations(String[] potentialMutations)
Stores the terms that look like mutations in this sentence. |
void |
setText(String newText)
|
| Methods inherited from class textractor.datamodel.TextractorDocument |
|---|
getDocumentNumber, getDocumentSection, setDocumentNumber, setDocumentSection |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final String REFERENCE_TAG
public static final String REFERENCE_TAG_HTML
public static final int TEXT_MAX_LENGTH
| Constructor Detail |
|---|
public Sentence()
public Sentence(Article article)
public Sentence(Article article,
String text)
| Method Detail |
|---|
public String[] getPotentialMutations()
public void setPotentialMutations(String[] potentialMutations)
potentialMutations - An array where each element is the term that is
a potential mutation.public boolean isMaybeProteinMutation()
public void setMaybeProteinMutation(boolean maybeProteinMutation)
public boolean isMaybeProteinName()
public void setMaybeProteinName(boolean maybeProteinName)
public String getText()
TextractorDocument
getText in class TextractorDocumentpublic void setText(String newText)
public Article getArticle()
public void setArticle(Article article)
public List<Integer> getPositions()
public void setPositions(List<Integer> positions)
positions - a list of positionspublic boolean hasPositons()
public int getTermNumber()
public MutableString getSpaceDelimitedTerms(DocumentIndexManager docmanager)
docmanager - DocumentIndexManager used to process terms
public MutableString getSpaceDelimitedProcessedTerms(DocumentIndexManager docmanager)
docmanager - DocumentIndexManager used to process terms
Implementations of this
interface are used to process terms to construct the result.
public static MutableString getSpaceDelimitedProcessedTerms(DocumentIndexManager docmanager,
String spaceDelimitedTerms)
docmanager - DocumentIndexManager used to process termsspaceDelimitedTerms - Input string, where terms are delimited by
single space characters.
public Reference2ObjectMap<Enum<?>,Object> getMetaData()
getMetaData in class TextractorDocument
|
Textractor API textractor-720 (20091120123250) | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||