public class PatternBasedTokenSegmenter
extends org.apache.uima.fit.component.JCasAnnotator_ImplBase
Tokens
.
If the INCLUDE_PREFIX
precedes the split pattern, the pattern is included.
Consequently, patterns following the EXCLUDE_PREFIX
, will not be added as a Token.Modifier and Type | Field and Description |
---|---|
static String |
EXCLUDE_PREFIX |
static String |
INCLUDE_PREFIX |
static String |
PARAM_DELETE_COVER
Wether to remove the original token.
|
static String |
PARAM_PATTERNS
A list of regular expressions, prefixed with
INCLUDE_PREFIX or
EXCLUDE_PREFIX . |
Constructor and Description |
---|
PatternBasedTokenSegmenter() |
Modifier and Type | Method and Description |
---|---|
void |
initialize(org.apache.uima.UimaContext aContext) |
void |
process(org.apache.uima.jcas.JCas aJCas) |
getRequiredCasInterface, process
getCasInstancesRequired, hasNext, next
public static final String INCLUDE_PREFIX
public static final String EXCLUDE_PREFIX
public static final String PARAM_DELETE_COVER
true
public static final String PARAM_PATTERNS
INCLUDE_PREFIX
or
EXCLUDE_PREFIX
. If neither of the prefixes is used, EXCLUDE_PREFIX
is
assumed.public void initialize(org.apache.uima.UimaContext aContext) throws org.apache.uima.resource.ResourceInitializationException
initialize
in interface org.apache.uima.analysis_component.AnalysisComponent
initialize
in class org.apache.uima.fit.component.JCasAnnotator_ImplBase
org.apache.uima.resource.ResourceInitializationException
public void process(org.apache.uima.jcas.JCas aJCas) throws org.apache.uima.analysis_engine.AnalysisEngineProcessException
process
in class org.apache.uima.analysis_component.JCasAnnotator_ImplBase
org.apache.uima.analysis_engine.AnalysisEngineProcessException
Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.