public class ExtractReuters extends Object
This is an adaption of the ExtractReuters
class in the lucene-benchmarks
package.
Constructor and Description |
---|
ExtractReuters() |
Modifier and Type | Method and Description |
---|---|
static List<ReutersDocument> |
extract(Path reutersDir)
Reag all the SGML file in the given directory.
|
public static List<ReutersDocument> extract(Path reutersDir) throws IOException, ParseException
reutersDir
- the directory that contains the Reuters SGML files.ReutersDocument
sIOException
- if any of the files cannot be read.ParseException
Copyright © 2007–2016 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.