|
Textractor API textractor-720 (20091120123250) | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectit.unimi.dsi.mg4j.document.AbstractDocumentSequence
it.unimi.dsi.mg4j.document.AbstractDocumentCollection
textractor.mg4j.document.DocStoreDocumentCollection
public final class DocStoreDocumentCollection
The DocStore document collection for all TEXT indicies.
| Field Summary |
|---|
| Fields inherited from interface it.unimi.dsi.mg4j.document.DocumentCollection |
|---|
DEFAULT_EXTENSION |
| Constructor Summary | |
|---|---|
DocStoreDocumentCollection(DocumentIndexManager docmanagerVal)
Creates a document collection based on the documents in the document store associated to a full text index. |
|
| Method Summary | |
|---|---|
void |
close()
Closes this stream and releases any system resources associated with it. |
DocumentCollection |
copy()
Make a copy of this DocStoreDocumentCollection. |
Document |
document(int index)
Obtain the Document for document number index within the "text" index. |
Document |
document(String indexAlias,
int index)
Obtain the Document for document number index within the indexAlias index. |
Document |
documentNoContent(int index)
Obtain the Document for document number index within the "text" index, but do not retrieve the document content. |
Document |
documentNoContent(String indexAlias,
int index)
Obtain the Document for document number index within the indexAlias index, but do not retrieve the document content. |
String |
doi(int index)
Obtain the DOI for document number index within the "text" index. |
DocumentFactory |
factory()
Return the DocumentFactory associated with this DocStoreDocumentCollection. |
DocumentStoreReader |
getDocumentStoreReader()
Returns the current DocumentStoreReader for the "text" alias. |
DocumentStoreReader |
getDocumentStoreReader(String indexAlias)
Returns the current DocumentStoreReader. |
static void |
main(String[] arg)
|
Reference2ObjectMap<Enum<?>,Object> |
metadata(int index)
Creates metadata for the document at index within the "text" index. |
Reference2ObjectMap<Enum<?>,Object> |
metadata(String indexAlias,
int index)
Creates metadata for the document at index within the indexAlias index. |
int |
size()
Returns the number of documents in the document store/ full text index. |
InputStream |
stream(int index)
Obtain the stream for document number index within the "text" index. |
InputStream |
stream(String indexAlias,
int index)
Obtain the stream for document number index within the indexAlias index. |
| Methods inherited from class it.unimi.dsi.mg4j.document.AbstractDocumentCollection |
|---|
ensureDocumentIndex, iterator, printAllDocuments, toString |
| Methods inherited from class it.unimi.dsi.mg4j.document.AbstractDocumentSequence |
|---|
finalize |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, getClass, hashCode, notify, notifyAll, wait, wait, wait |
| Constructor Detail |
|---|
public DocStoreDocumentCollection(DocumentIndexManager docmanagerVal)
throws IOException
docmanagerVal - Manager for the full text index associated with the
document store.
IOException - error opening appropriate files?| Method Detail |
|---|
public DocumentStoreReader getDocumentStoreReader()
public DocumentStoreReader getDocumentStoreReader(String indexAlias)
indexAlias - the index alias to obtain the document store reader for.
public int size()
size in interface DocumentCollection
public Document document(int index)
throws IOException
document in interface DocumentCollectionindex - the document number
IOException - error retrieving the document
public Document document(String indexAlias,
int index)
throws IOException
indexAlias - the indexAlias from which to retrieve the documentindex - the document number
IOException - error retrieving the document
public Document documentNoContent(int index)
throws IOException
index - the document number
IOException - error retrieving the document
public Document documentNoContent(String indexAlias,
int index)
throws IOException
indexAlias - the indexAlias from which to retrieve the documentindex - the document number
IOException - error retrieving the document
public String doi(int index)
throws IOException
index - the document number
IOException - error retrieving the document
public InputStream stream(int index)
throws IOException
stream in interface DocumentCollectionindex - the document number
IOException - error retrieving the document
public InputStream stream(String indexAlias,
int index)
throws IOException
indexAlias - the indexAlias from which to retrieve the streamindex - the document number
IOException - error retrieving the document
public Reference2ObjectMap<Enum<?>,Object> metadata(int index)
throws IOException
metadata in interface DocumentCollectionindex - a document index.
index.
IOException - xx
public Reference2ObjectMap<Enum<?>,Object> metadata(String indexAlias,
int index)
throws IOException
indexAlias - the indexAlias for which to create metadataindex - a document index.
index.
IOException - xxpublic DocumentCollection copy()
copy in interface DocumentCollectioncopy in interface FlyweightPrototype<DocumentCollection>public DocumentFactory factory()
factory in interface DocumentSequence
public void close()
throws IOException
close in interface DocumentSequenceclose in interface Closeableclose in class AbstractDocumentSequenceIOException - if an I/O error occurs
public static void main(String[] arg)
throws IOException,
com.martiansoftware.jsap.JSAPException,
InstantiationException,
IllegalAccessException,
InvocationTargetException,
NoSuchMethodException,
ConfigurationException,
ClassNotFoundException,
URISyntaxException
IOException
com.martiansoftware.jsap.JSAPException
InstantiationException
IllegalAccessException
InvocationTargetException
NoSuchMethodException
ConfigurationException
ClassNotFoundException
URISyntaxException
|
Textractor API textractor-720 (20091120123250) | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||