Interface | Description |
---|---|
IWiktionaryDumpParser |
Parser for Wiktionary dump files obtained from
http://download.wikimedia.org/backup-index.html.
|
IWiktionaryEntryParser |
A parser for separating an article page's text into individual
Wiktionary word entries.
|
IWiktionaryMultistreamDumpParser | |
IWiktionaryPageParser |
Generic interface for parsing XML dumps in a MediaWiki format.
|
IWritableWiktionaryEdition |
Generic interface for writable Wiktionary language editions used by the
parsers to store the extracted entries and information types.
|
MultistreamFilter |
Class | Description |
---|---|
MultistreamFilter.IncludingNames |
A filter which includes only page titles contained in the specified list
|
WiktionaryArticleParser |
Parses a Wiktionary XML dump and stores the parsed information as a
Berkeley DB within a specified directory.
|
WiktionaryDumpParser |
Extension of the
XMLDumpParser that reads the different XML tags
of the Wiktionary XML dump file format and provides hotspots for each
type of information. |
WiktionaryEntryParser |
Base implementation for parsing the textual contents of an article page in
order to construct
IWiktionaryEntry and IWiktionarySense
instances. |
WiktionaryPageParser<PageType extends WiktionaryPage> |
Abstract base class for implementations of the
IWiktionaryPageParser interface. |
WritableBerkeleyDBWiktionaryEdition |
Extends the Berkeley DB implementation by providing the possibility for
modifying the contents.
|
XMLDumpParser |
Implementation of
IWiktionaryDumpParser for processing XML files
downloaded from http://download.wikimedia.org/backup-index.html. |
Copyright © 2011-2016 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.