Class | Description |
---|---|
ExtractReuters |
Extract all the documents from a Reuters-21587 corpus in SGML format.
|
Reuters21578SgmlReader |
Read a Reuters-21578 corpus in SGML format.
|
Reuters21578TxtReader |
Read a Reuters-21578 corpus that has been transformed into text format using
ExtractReuters in
the lucene-benchmarks project. |
ReutersDocument |
A class that holds text and metadata for a Reuters-21578 document.
|
Enum | Description |
---|---|
ReutersDocument.CGISPLIT | |
ReutersDocument.LEWISSPLIT |
Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.