public final class ComponentParameters extends Object
Modifier and Type | Field and Description |
---|---|
static String |
AUTO_NUM_THREADS
Use smart number of threads if PARAM_NUM_THREADS is set to this value
|
static String |
DEFAULT_ENCODING |
static String |
PARAM_ACCEPTED_TAGS_REGEX
Regex to filter tags, e. g.
|
static String |
PARAM_CHUNK_MAPPING_LOCATION
Location of the mapping file for chunk tags to UIMA types.
|
static String |
PARAM_CHUNK_TAG_SET
Use this chunk tag set to use to resolve the tag set mapping instead of using the
tag set defined as part of the model meta data.
|
static String |
PARAM_CONSTITUENT_MAPPING_LOCATION
Location of the mapping file for constituent tags to UIMA types.
|
static String |
PARAM_CONSTITUENT_TAG_SET |
static String |
PARAM_DELETE_COVER
When splitting an annotation into multiple parts, e.g.
|
static String |
PARAM_DEPENDENCY_MAPPING_LOCATION
Location of the mapping file for dependency tags to UIMA types.
|
static String |
PARAM_FILENAME_EXTENSION
Use this filename extension.
|
static String |
PARAM_INTERN_TAGS
Use the
String.intern() method on tags. |
static String |
PARAM_LANGUAGE
For analysis engines: Use this language instead of the document language to resolve the model
and tag set mapping.
|
static String |
PARAM_MAX_SENTENCE_LENGTH
Maximal sentence length in tokens that is still being processed.
|
static String |
PARAM_MODEL_ENCODING
The character encoding used by the model.
|
static String |
PARAM_MODEL_LOCATION
Location from which the model is read.
|
static String |
PARAM_MORPH_MAPPING_LOCATION
Location of the mapping file for morphological analysis strings to features.
|
static String |
PARAM_MORPH_TAG_SET |
static String |
PARAM_NAMED_ENTITY_MAPPING_LOCATION
Location of the mapping file for named entity tags to UIMA types.
|
static String |
PARAM_NUM_THREADS
The number of threads to use for components that implement multi-threading
|
static String |
PARAM_PATTERNS
Variant of the model.
|
static String |
PARAM_POS_MAPPING_LOCATION
Location of the mapping file for part-of-speech tags to UIMA types.
|
static String |
PARAM_POS_TAG_SET
Use this part-of-speech tag set to use to resolve the tag set mapping instead of using the
tag set defined as part of the model meta data.
|
static String |
PARAM_PRINT_TAGSET
Log the tag set(s) when a model is loaded.
|
static String |
PARAM_READ_CHUNK |
static String |
PARAM_READ_CONSTITUENT |
static String |
PARAM_READ_COREFERENCE |
static String |
PARAM_READ_CPOS |
static String |
PARAM_READ_DEPENDENCY |
static String |
PARAM_READ_FORM |
static String |
PARAM_READ_LEMMA |
static String |
PARAM_READ_MORPH |
static String |
PARAM_READ_NAMED_ENTITY |
static String |
PARAM_READ_PARAGRAPH |
static String |
PARAM_READ_PENN_TREE |
static String |
PARAM_READ_POS |
static String |
PARAM_READ_SENTENCE |
static String |
PARAM_READ_TOKEN |
static String |
PARAM_SEGMENTATION_MODEL_LOCATION
Location from which the segmentation model is read.
|
static String |
PARAM_SOURCE_ENCODING
Character encoding of the input data.
|
static String |
PARAM_SOURCE_LOCATION
Location from which the input is read.
|
static String |
PARAM_STRIP_EXTENSION
Remove the original extension.
|
static String |
PARAM_TARGET_ENCODING
Character encoding of the output data.
|
static String |
PARAM_TARGET_LOCATION
Location to which the output is written.
|
static String |
PARAM_TOKENIZATION_MODEL_LOCATION
Location from which the tokenization model is read.
|
static String |
PARAM_VARIANT
Variant of a model the model.
|
static String |
PARAM_WRITE_CHUNK |
static String |
PARAM_WRITE_CONSTITUENT |
static String |
PARAM_WRITE_COREFERENCE |
static String |
PARAM_WRITE_CPOS |
static String |
PARAM_WRITE_DEPENDENCY |
static String |
PARAM_WRITE_FORM |
static String |
PARAM_WRITE_LEMMA |
static String |
PARAM_WRITE_MORPH |
static String |
PARAM_WRITE_NAMED_ENTITY |
static String |
PARAM_WRITE_PARAGRAPH |
static String |
PARAM_WRITE_PENN_TREE |
static String |
PARAM_WRITE_POS |
static String |
PARAM_WRITE_SENTENCE |
static String |
PARAM_WRITE_TOKEN |
Modifier and Type | Method and Description |
---|---|
static int |
computeNumThreads(int value)
Compute the number of threads to use for components that can make use of multi-threading.
|
public static final String DEFAULT_ENCODING
public static final String PARAM_LANGUAGE
public static final String PARAM_PATTERNS
public static final String PARAM_VARIANT
public static final String PARAM_ACCEPTED_TAGS_REGEX
public static final String PARAM_MODEL_LOCATION
public static final String PARAM_SEGMENTATION_MODEL_LOCATION
public static final String PARAM_TOKENIZATION_MODEL_LOCATION
public static final String PARAM_MODEL_ENCODING
public static final String PARAM_SOURCE_LOCATION
public static final String PARAM_SOURCE_ENCODING
public static final String PARAM_TARGET_LOCATION
public static final String PARAM_TARGET_ENCODING
public static final String PARAM_FILENAME_EXTENSION
public static final String PARAM_STRIP_EXTENSION
public static final String PARAM_PRINT_TAGSET
public static final String PARAM_INTERN_TAGS
String.intern()
method on tags. This is usually a good idea to avoid
spamming the heap with thousands of strings representing only a few different tags.public static final String PARAM_DELETE_COVER
public static final String PARAM_MAX_SENTENCE_LENGTH
public static final String PARAM_NUM_THREADS
public static final String AUTO_NUM_THREADS
public static final String PARAM_READ_PARAGRAPH
public static final String PARAM_READ_SENTENCE
public static final String PARAM_READ_TOKEN
public static final String PARAM_READ_FORM
public static final String PARAM_READ_LEMMA
public static final String PARAM_READ_POS
public static final String PARAM_READ_CPOS
public static final String PARAM_READ_CHUNK
public static final String PARAM_READ_MORPH
public static final String PARAM_READ_CONSTITUENT
public static final String PARAM_READ_COREFERENCE
public static final String PARAM_READ_PENN_TREE
public static final String PARAM_READ_DEPENDENCY
public static final String PARAM_READ_NAMED_ENTITY
public static final String PARAM_WRITE_PARAGRAPH
public static final String PARAM_WRITE_SENTENCE
public static final String PARAM_WRITE_TOKEN
public static final String PARAM_WRITE_FORM
public static final String PARAM_WRITE_LEMMA
public static final String PARAM_WRITE_POS
public static final String PARAM_WRITE_CPOS
public static final String PARAM_WRITE_CHUNK
public static final String PARAM_WRITE_MORPH
public static final String PARAM_WRITE_CONSTITUENT
public static final String PARAM_WRITE_COREFERENCE
public static final String PARAM_WRITE_PENN_TREE
public static final String PARAM_WRITE_DEPENDENCY
public static final String PARAM_WRITE_NAMED_ENTITY
public static final String PARAM_POS_TAG_SET
public static final String PARAM_CHUNK_TAG_SET
public static final String PARAM_CONSTITUENT_TAG_SET
public static final String PARAM_MORPH_TAG_SET
public static final String PARAM_POS_MAPPING_LOCATION
public static final String PARAM_CONSTITUENT_MAPPING_LOCATION
public static final String PARAM_CHUNK_MAPPING_LOCATION
public static final String PARAM_NAMED_ENTITY_MAPPING_LOCATION
public static final String PARAM_MORPH_MAPPING_LOCATION
public static final String PARAM_DEPENDENCY_MAPPING_LOCATION
public static int computeNumThreads(int value)
AUTO_NUM_THREADS
(0): use the number of available CPUs minus one.value
- the user-proposed number of threads (positive, negative, or 0)Copyright © 2007–2018 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.