Package | Description |
---|---|
de.tudarmstadt.ukp.dkpro.core.clearnlp | |
de.tudarmstadt.ukp.dkpro.core.jtok | |
de.tudarmstadt.ukp.dkpro.core.languagetool |
Grammar and style checker based on LanguageTool.
|
de.tudarmstadt.ukp.dkpro.core.mecab |
Integration of the MeCab part-of-speech and
morphological analyzer.
|
de.tudarmstadt.ukp.dkpro.core.opennlp |
Integration of the Apache OpenNLP tools.
|
de.tudarmstadt.ukp.dkpro.core.stanfordnlp |
Integration of NLP components from the
Stanford CoreNLP suite.
|
de.tudarmstadt.ukp.dkpro.core.testing.harness | |
de.tudarmstadt.ukp.dkpro.core.tokit |
Collection of tokenization and segmentation components.
|
Modifier and Type | Class and Description |
---|---|
class |
ClearNlpSegmenter
Tokenizer using Clear NLP.
|
Modifier and Type | Class and Description |
---|---|
class |
JTokSegmenter
JTok segmenter.
|
Modifier and Type | Class and Description |
---|---|
class |
LanguageToolSegmenter
Segmenter using LanguageTool to do the heavy lifting.
|
Modifier and Type | Class and Description |
---|---|
class |
MeCabTagger
Annotator for the MeCab Japanese POS Tagger.
|
Modifier and Type | Class and Description |
---|---|
class |
OpenNlpSegmenter
Tokenizer and sentence splitter using OpenNLP.
|
Modifier and Type | Class and Description |
---|---|
class |
StanfordSegmenter |
Modifier and Type | Method and Description |
---|---|
static void |
SegmenterHarness.testLaxZoning(Class<? extends SegmenterBase> aSegmenter,
String aLanguage) |
static void |
SegmenterHarness.testOufOfBoundsZones(Class<? extends SegmenterBase> aSegmenter,
String aLanguage) |
static void |
SegmenterHarness.testStrictZoning(Class<? extends SegmenterBase> aSegmenter,
String aLanguage) |
static void |
SegmenterHarness.testZoning(Class<? extends SegmenterBase> aSegmenter) |
static void |
SegmenterHarness.testZoning(Class<? extends SegmenterBase> aSegmenter,
String aLanguage) |
Modifier and Type | Class and Description |
---|---|
class |
BreakIteratorSegmenter
BreakIterator segmenter.
|
class |
LineBasedSentenceSegmenter
Annotates each line in the source text as a sentence.
|
class |
RegexTokenizer
This segmenter splits sentences and tokens based on regular expressions that define the sentence
and token boundaries.
|
class |
WhitespaceTokenizer
Deprecated.
Use
RegexTokenizer |
Copyright © 2007–2016 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.