textractor.mg4j.document
Class SfnDocumentFactory
java.lang.Object
it.unimi.dsi.mg4j.document.AbstractDocumentFactory
it.unimi.dsi.mg4j.document.PropertyBasedDocumentFactory
textractor.mg4j.document.AbstractTextractorDocumentFactory
textractor.mg4j.document.SfnDocumentFactory
- All Implemented Interfaces:
- DocumentFactory, FlyweightPrototype<DocumentFactory>, Serializable
public final class SfnDocumentFactory
- extends AbstractTextractorDocumentFactory
A factory that can produce MG4J documents from
SfnArticles and their associated
Sentences.
- See Also:
- Serialized Form
| Methods inherited from class it.unimi.dsi.mg4j.document.PropertyBasedDocumentFactory |
ensureJustOne, getInstance, getInstance, getInstance, getInstance, parseProperties, parseProperties, resolve, resolve, resolveNotNull, sameKey |
SfnDocumentFactory
public SfnDocumentFactory()
throws ConfigurationException,
ClassNotFoundException,
IllegalAccessException,
InstantiationException
- Construct a new
DocumentFactory.
- Throws:
ConfigurationException - if there is a
problem with the configuration of the factory.
ClassNotFoundException - if the specified WordReader
cannot be found.
IllegalAccessException - if the factory is unable to create an
instance of the specified WordReader.
InstantiationException - if the factory is unable to create an
instance of the specified WordReader.
SfnDocumentFactory
public SfnDocumentFactory(Reference2ObjectMap<Enum<?>,Object> defaultMetadata)
throws ConfigurationException,
ClassNotFoundException,
IllegalAccessException,
InstantiationException
- Construct a new
DocumentFactory.
- Parameters:
defaultMetadata - meta data used to configure this factory
- Throws:
ConfigurationException - if there
is a problem with the configuration of the factory.
ClassNotFoundException - if the specified WordReader
cannot be found.
IllegalAccessException - if the factory is unable to create an
instance of the specified WordReader.
InstantiationException - if the factory is unable to create an
instance of the specified WordReader.
SfnDocumentFactory
public SfnDocumentFactory(Properties properties)
throws ConfigurationException,
ClassNotFoundException,
IllegalAccessException,
InstantiationException
- Construct a new
DocumentFactory.
- Parameters:
properties - properties used to configure this factory
- Throws:
ConfigurationException - if there
is a problem with the configuration of the factory.
ClassNotFoundException - if the specified WordReader
cannot be found.
IllegalAccessException - if the factory is unable to create an
instance of the specified WordReader.
InstantiationException - if the factory is unable to create an
instance of the specified WordReader.
SfnDocumentFactory
public SfnDocumentFactory(String[] property)
throws ConfigurationException,
ClassNotFoundException,
IllegalAccessException,
InstantiationException
- Construct a new
DocumentFactory.
- Parameters:
property - properties used to configure this factory
- Throws:
ConfigurationException - if there
is a problem with the configuration of the factory.
ClassNotFoundException - if the specified WordReader
cannot be found.
IllegalAccessException - if the factory is unable to create an
instance of the specified WordReader.
InstantiationException - if the factory is unable to create an
instance of the specified WordReader.
copy
public TextractorDocumentFactory copy()
- Creates a copy of this factory.
- Returns:
- a copy of this factory.
getDocument
public Document getDocument(InputStream rawContent,
Reference2ObjectMap<Enum<?>,Object> metadata)
- Returns the document obtained by parsing the given byte stream.
- Parameters:
rawContent - the raw content from which the document should be
extracted; it must not be closed, as resource management is a
responsibility of the
DocumentCollection.metadata - a map from enums (e.g., keys taken in
PropertyBasedDocumentFactory) to various kind of objects.
- Returns:
- the document obtained by parsing the given character sequence.
Copyright © 2003-2008 Institute for Computational Biomedicine, All Rights Reserved.