public class CisStemmer extends FeaturePathAnnotatorBase
UIMA wrapper for the CISTEM algorithm.
CISTEM is a stemming algorithm for the German language, developed by Leonie Weißweiler and Alexander Fraser.
Annotation types to be stemmed can be configured by a FeaturePath
.
If you use this component in a pipeline which uses stop word removal, make sure that it runs after the stop word removal step, so only words that are no stop words are stemmed.
FeaturePathAnnotatorBase
Modifier and Type | Field and Description |
---|---|
protected boolean |
lowerCase |
static String |
PARAM_LOWER_CASE
Per default the stemmer runs in case-sensitive mode.
|
filterCondition, filterConditionOperator, filterConditionValue, filterFeaturePath, filterFeaturePathInfo, fp, PARAM_FILTER_CONDITION_OPERATOR, PARAM_FILTER_CONDITION_VALUE, PARAM_FILTER_FEATUREPATH, PARAM_PATHS, paths
Constructor and Description |
---|
CisStemmer() |
Modifier and Type | Method and Description |
---|---|
protected void |
generateAnnotations(org.apache.uima.jcas.JCas jcas)
Method to create annotations.
|
protected Set<String> |
getDefaultPaths() |
initialize, initializeFeaturePathInfoFrom, process
getRequiredCasInterface, process
getCasInstancesRequired, hasNext, next
public static final String PARAM_LOWER_CASE
protected boolean lowerCase
protected Set<String> getDefaultPaths()
getDefaultPaths
in class FeaturePathAnnotatorBase
protected void generateAnnotations(org.apache.uima.jcas.JCas jcas) throws FeaturePathException, org.apache.uima.analysis_engine.AnalysisEngineProcessException
FeaturePathAnnotatorBase
generateAnnotations
in class FeaturePathAnnotatorBase
jcas
- the JCas.FeaturePathException
- if an error occurs during initialization of the FeaturePathInfo
objectorg.apache.uima.analysis_engine.AnalysisEngineProcessException
- if another error occurs.Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.