public class TreeTaggerChunker
extends org.apache.uima.fit.component.JCasAnnotator_ImplBase
Modifier and Type | Field and Description |
---|---|
protected String |
chunkMappingLocation |
protected String |
language |
protected String |
modelLocation |
static String |
PARAM_CHUNK_MAPPING_LOCATION
Location of the mapping file for chunk tags to UIMA types.
|
static String |
PARAM_EXECUTABLE_PATH
Use this TreeTagger executable instead of trying to locate the executable automatically.
|
static String |
PARAM_FLUSH_SEQUENCE
A sequence to flush the internal TreeTagger buffer and to force it to output the rest of the
completed analysis.
|
static String |
PARAM_INTERN_TAGS
Use the
String.intern() method on tags. |
static String |
PARAM_LANGUAGE
Use this language instead of the document language to resolve the model.
|
static String |
PARAM_MODEL_LOCATION
Load the model from this location instead of locating the model automatically.
|
static String |
PARAM_PERFORMANCE_MODE
TT4J setting: Disable some sanity checks, e.g.
|
static String |
PARAM_PRINT_TAGSET
Log the tag set(s) when a model is loaded.
|
static String |
PARAM_VARIANT
Override the default variant used to locate the model.
|
protected boolean |
printTagSet |
protected String |
variant |
Constructor and Description |
---|
TreeTaggerChunker() |
Modifier and Type | Method and Description |
---|---|
void |
initialize(org.apache.uima.UimaContext aContext) |
void |
process(org.apache.uima.jcas.JCas aJCas) |
getRequiredCasInterface, process
getCasInstancesRequired, hasNext, next
public static final String PARAM_LANGUAGE
protected String language
public static final String PARAM_VARIANT
protected String variant
public static final String PARAM_EXECUTABLE_PATH
public static final String PARAM_MODEL_LOCATION
protected String modelLocation
public static final String PARAM_CHUNK_MAPPING_LOCATION
protected String chunkMappingLocation
public static final String PARAM_INTERN_TAGS
String.intern()
method on tags. This is usually a good idea to avoid
spaming the heap with thousands of strings representing only a few different tags.
Default: true
public static final String PARAM_PRINT_TAGSET
false
protected boolean printTagSet
public static final String PARAM_PERFORMANCE_MODE
public static final String PARAM_FLUSH_SEQUENCE
Nous-PRO:PER\n...
.public void initialize(org.apache.uima.UimaContext aContext) throws org.apache.uima.resource.ResourceInitializationException
initialize
in interface org.apache.uima.analysis_component.AnalysisComponent
initialize
in class org.apache.uima.fit.component.JCasAnnotator_ImplBase
org.apache.uima.resource.ResourceInitializationException
public void process(org.apache.uima.jcas.JCas aJCas) throws org.apache.uima.analysis_engine.AnalysisEngineProcessException
process
in class org.apache.uima.analysis_component.JCasAnnotator_ImplBase
org.apache.uima.analysis_engine.AnalysisEngineProcessException
Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.