de.tudarmstadt.ukp.jwktl.parser (Java Wiktionary Library (JWKTL) 1.1.0 API)

Interface Summary
Interface	Description
IWiktionaryDumpParser	Parser for Wiktionary dump files obtained from http://download.wikimedia.org/backup-index.html.
IWiktionaryEntryParser	A parser for separating an article page's text into individual Wiktionary word entries.
IWiktionaryMultistreamDumpParser
IWiktionaryPageParser	Generic interface for parsing XML dumps in a MediaWiki format.
IWritableWiktionaryEdition	Generic interface for writable Wiktionary language editions used by the parsers to store the extracted entries and information types.
MultistreamFilter

Class Summary
Class	Description
MultistreamFilter.IncludingNames	A filter which includes only page titles contained in the specified list
WiktionaryArticleParser	Parses a Wiktionary XML dump and stores the parsed information as a Berkeley DB within a specified directory.
WiktionaryDumpParser	Extension of the `XMLDumpParser` that reads the different XML tags of the Wiktionary XML dump file format and provides hotspots for each type of information.
WiktionaryEntryParser	Base implementation for parsing the textual contents of an article page in order to construct `IWiktionaryEntry` and `IWiktionarySense` instances.
WiktionaryPageParser<PageType extends WiktionaryPage>	Abstract base class for implementations of the `IWiktionaryPageParser` interface.
WritableBerkeleyDBWiktionaryEdition	Extends the Berkeley DB implementation by providing the possibility for modifying the contents.
XMLDumpParser	Implementation of `IWiktionaryDumpParser` for processing XML files downloaded from http://download.wikimedia.org/backup-index.html.

Package de.tudarmstadt.ukp.jwktl.parser