public class Reuters21578TxtReader extends JCasResourceCollectionReader_ImplBase
ExtractReuters
in
the lucene-benchmarks
project.
The ResourceCollectionReaderBase.PARAM_SOURCE_LOCATION
parameter should typically point to the file name pattern
reut2-*.txt
, preceded by the corpus root directory.
ResourceCollectionReaderBase.Resource
EXCLUDE_PREFIX, INCLUDE_PREFIX, JAR_PREFIX, KEY_RESOURCE_RESOLVER, PARAM_INCLUDE_HIDDEN, PARAM_LANGUAGE, PARAM_LOG_FREQ, PARAM_PATH, PARAM_PATTERNS, PARAM_SOURCE_LOCATION, PARAM_USE_DEFAULT_EXCLUDES
Constructor and Description |
---|
Reuters21578TxtReader() |
Modifier and Type | Method and Description |
---|---|
void |
getNext(org.apache.uima.jcas.JCas jCas)
Subclasses implement this method rather than
JCasResourceCollectionReader_ImplBase.getNext(CAS) |
getNext, initCas, initCas
getBase, getBase, getDefaultExcludes, getLanguage, getProgress, getResolver, getResourceIterator, getResources, getSourceLocation, hasNext, initCas, initCas, initialize, isSingleLocation, locationToUrl, nextFile, scan
close, getLogger, initialize
destroy, getCasInitializer, getProcessingResourceMetaData, initialize, isConsuming, reconfigure, setCasInitializer, typeSystemInit
getConfigParameterValue, getConfigParameterValue, setConfigParameterValue, setConfigParameterValue
getCasManager, getMetaData, getRelativePathResolver, getResourceManager, getUimaContext, getUimaContextAdmin, setLogger, setMetaData
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
public void getNext(org.apache.uima.jcas.JCas jCas) throws IOException, org.apache.uima.collection.CollectionException
JCasResourceCollectionReader_ImplBase
JCasResourceCollectionReader_ImplBase.getNext(CAS)
getNext
in class JCasResourceCollectionReader_ImplBase
jCas
- the JCas.IOException
- if an i/o error occurs reading the data.org.apache.uima.collection.CollectionException
- if another type of error occurs.Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.