public class CoreNlpParser
extends org.apache.uima.fit.component.JCasAnnotator_ImplBase
Modifier and Type | Field and Description |
---|---|
static String |
PARAM_CONSTITUENT_MAPPING_LOCATION
Location of the mapping file for dependency tags to UIMA types.
|
static String |
PARAM_DEPENDENCY_MAPPING_LOCATION
Location of the mapping file for dependency tags to UIMA types.
|
static String |
PARAM_EXTRA_DEPENDENCIES |
static String |
PARAM_INTERN_TAGS
Use the
String.intern() method on tags. |
static String |
PARAM_KEEP_PUNCTUATION |
static String |
PARAM_LANGUAGE
Use this language instead of the document language to resolve the model and tag set mapping.
|
static String |
PARAM_MAX_SENTENCE_LENGTH |
static String |
PARAM_MAX_TIME |
static String |
PARAM_MODEL_ENCODING
The character encoding used by the model.
|
static String |
PARAM_MODEL_LOCATION
Location from which the model is read.
|
static String |
PARAM_NUM_THREADS |
static String |
PARAM_ORIGINAL_DEPENDENCIES |
static String |
PARAM_POS_MAPPING_LOCATION
Location of the mapping file for part-of-speech tags to UIMA types.
|
static String |
PARAM_PRINT_TAGSET
Log the tag set(s) when a model is loaded.
|
static String |
PARAM_PTB3_ESCAPING
Enable all traditional PTB3 token transforms (like -LRB-, -RRB-).
|
static String |
PARAM_QUOTE_BEGIN
List of extra token texts (usually single character strings) that should be treated like
opening quotes and escaped accordingly before being sent to the parser.
|
static String |
PARAM_QUOTE_END
List of extra token texts (usually single character strings) that should be treated like
closing quotes and escaped accordingly before being sent to the parser.
|
static String |
PARAM_READ_POS
Sets whether to use or not to use existing POS tags.
|
static String |
PARAM_VARIANT
Variant of a model the model.
|
static String |
PARAM_WRITE_CONSTITUENT
Sets whether to create or not to create constituent tags.
|
static String |
PARAM_WRITE_DEPENDENCY
Sets whether to create or not to create dependency annotations.
|
static String |
PARAM_WRITE_PENN_TREE
If this parameter is set to true, each sentence is annotated with a PennTree-Annotation,
containing the whole parse tree in Penn Treebank style format.
|
static String |
PARAM_WRITE_POS
Sets whether to create or not to create POS tags.
|
Constructor and Description |
---|
CoreNlpParser() |
Modifier and Type | Method and Description |
---|---|
void |
initialize(org.apache.uima.UimaContext aContext) |
void |
process(org.apache.uima.jcas.JCas aJCas) |
getRequiredCasInterface, process
getCasInstancesRequired, hasNext, next
public static final String PARAM_PRINT_TAGSET
false
public static final String PARAM_LANGUAGE
public static final String PARAM_VARIANT
public static final String PARAM_MODEL_LOCATION
public static final String PARAM_MODEL_ENCODING
public static final String PARAM_DEPENDENCY_MAPPING_LOCATION
public static final String PARAM_CONSTITUENT_MAPPING_LOCATION
public static final String PARAM_POS_MAPPING_LOCATION
public static final String PARAM_INTERN_TAGS
String.intern()
method on tags. This is usually a good idea to avoid
spaming the heap with thousands of strings representing only a few different tags.
Default: false
public static final String PARAM_MAX_SENTENCE_LENGTH
public static final String PARAM_NUM_THREADS
public static final String PARAM_MAX_TIME
public static final String PARAM_PTB3_ESCAPING
PTBEscapingProcessor
,
Constant Field Valuespublic static final String PARAM_QUOTE_BEGIN
public static final String PARAM_QUOTE_END
public static final String PARAM_EXTRA_DEPENDENCIES
public static final String PARAM_WRITE_CONSTITUENT
Default: true
public static final String PARAM_WRITE_PENN_TREE
Default: false
public static final String PARAM_READ_POS
Default: true
public static final String PARAM_WRITE_POS
Default: false
public static final String PARAM_WRITE_DEPENDENCY
Default: true
public static final String PARAM_ORIGINAL_DEPENDENCIES
public static final String PARAM_KEEP_PUNCTUATION
public void initialize(org.apache.uima.UimaContext aContext) throws org.apache.uima.resource.ResourceInitializationException
initialize
in interface org.apache.uima.analysis_component.AnalysisComponent
initialize
in class org.apache.uima.fit.component.JCasAnnotator_ImplBase
org.apache.uima.resource.ResourceInitializationException
public void process(org.apache.uima.jcas.JCas aJCas) throws org.apache.uima.analysis_engine.AnalysisEngineProcessException
process
in class org.apache.uima.analysis_component.JCasAnnotator_ImplBase
org.apache.uima.analysis_engine.AnalysisEngineProcessException
Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.