public class PatternBasedTokenSegmenter
extends org.apache.uima.fit.component.JCasAnnotator_ImplBase
Tokens.
If the INCLUDE_PREFIX precedes the split pattern, the pattern is included.
Consequently, patterns following the EXCLUDE_PREFIX, will not be added as a Token.| Modifier and Type | Field and Description |
|---|---|
static String |
EXCLUDE_PREFIX |
static String |
INCLUDE_PREFIX |
static String |
PARAM_DELETE_COVER
Whether to remove the original token.
|
static String |
PARAM_PATTERNS
A list of regular expressions, prefixed with
INCLUDE_PREFIX or
EXCLUDE_PREFIX. |
| Constructor and Description |
|---|
PatternBasedTokenSegmenter() |
| Modifier and Type | Method and Description |
|---|---|
void |
initialize(org.apache.uima.UimaContext aContext) |
void |
process(org.apache.uima.jcas.JCas aJCas) |
getRequiredCasInterface, processgetCasInstancesRequired, hasNext, nextpublic static final String INCLUDE_PREFIX
public static final String EXCLUDE_PREFIX
public static final String PARAM_DELETE_COVER
public static final String PARAM_PATTERNS
INCLUDE_PREFIX or
EXCLUDE_PREFIX. If neither of the prefixes is used, EXCLUDE_PREFIX is
assumed.public void initialize(org.apache.uima.UimaContext aContext)
throws org.apache.uima.resource.ResourceInitializationException
initialize in interface org.apache.uima.analysis_component.AnalysisComponentinitialize in class org.apache.uima.fit.component.JCasAnnotator_ImplBaseorg.apache.uima.resource.ResourceInitializationExceptionpublic void process(org.apache.uima.jcas.JCas aJCas)
throws org.apache.uima.analysis_engine.AnalysisEngineProcessException
process in class org.apache.uima.analysis_component.JCasAnnotator_ImplBaseorg.apache.uima.analysis_engine.AnalysisEngineProcessExceptionCopyright © 2007–2019 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.