| Interface | Description |
|---|---|
| IWiktionaryDumpParser |
Parser for Wiktionary dump files obtained from
http://download.wikimedia.org/backup-index.html.
|
| IWiktionaryEntryParser |
A parser for separating an article page's text into individual
Wiktionary word entries.
|
| IWiktionaryMultistreamDumpParser | |
| IWiktionaryPageParser |
Generic interface for parsing XML dumps in a MediaWiki format.
|
| IWritableWiktionaryEdition |
Generic interface for writable Wiktionary language editions used by the
parsers to store the extracted entries and information types.
|
| MultistreamFilter |
| Class | Description |
|---|---|
| MultistreamFilter.IncludingNames |
A filter which includes only page titles contained in the specified list
|
| WiktionaryArticleParser |
Parses a Wiktionary XML dump and stores the parsed information as a
Berkeley DB within a specified directory.
|
| WiktionaryDumpParser |
Extension of the
XMLDumpParser that reads the different XML tags
of the Wiktionary XML dump file format and provides hotspots for each
type of information. |
| WiktionaryEntryParser |
Base implementation for parsing the textual contents of an article page in
order to construct
IWiktionaryEntry and IWiktionarySense
instances. |
| WiktionaryPageParser<PageType extends WiktionaryPage> |
Abstract base class for implementations of the
IWiktionaryPageParser interface. |
| WritableBerkeleyDBWiktionaryEdition |
Extends the Berkeley DB implementation by providing the possibility for
modifying the contents.
|
| XMLDumpParser |
Implementation of
IWiktionaryDumpParser for processing XML files
downloaded from http://download.wikimedia.org/backup-index.html. |
Copyright © 2011-2016 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.