The document provides detailed information about the DKPro Core type system.

The DKPro Core type system forms the interface between all the integrated components. Components store and retrieve their data from the UIMA CAS based on this type system. The type system design is using a rather flat hierarchy and a mostly loose coupling between annotations. It is offered as a set of modules, not as a single monolithic type system.

Types

Table 1. Top-level Types
Type Description

Anomaly

No description

ArticleInfo

Contains basic information about the article.

Chunk

No description

Compound

This type represents a decompounding word, i.e.: flowerpot.

Constituent

No description

CoreferenceLink

A link in the coreference chain.

DBConfig

Database configuration for the connection to the database where the CAS data was retrieved.

Dependency

A dependency relation between two tokens.

Div

Document structure element.

Field

No description

Lemma

No description

MetaDataStringField

A general purpose annotation to store document-wide information in the form of arbitrary key-value string pairs.

Morpheme

No description

MorphologicalFeatures

Morphological categories that can be attached to tokens.

NGram

No description

NamedEntity

Named entities refer e.g. to persons, locations, organizations and so on.

POS

The part of speech of a word or a phrase.

PennTree

The Penn Treebank-style phrase structure string.

PhoneticTranscription

Represents the phonetic transcription of some textual element (usually a Token).

ReadabilityScore

No description

SemanticArgument

The SemanticArgument annotation is attached to semantic arguments of semantic predicates.

SemanticField

The SemanticField is a coarse-grained semantic category that can be attached to nouns, verbs or adjectives.

SemanticPredicate

One of the predicates of a sentence (often a main verb, but nouns and adjectives can also be predicates).

Sentence

No description

SofaChangeAnnotation

Encodes an edit operation that can be interpreted by the ApplyChangesAnnotator.

Split

This type represents a part of a decompounding word.

StanfordSentimentAnnotation

Stanford CoreNLP Sentiment annotation

Stem

No description

StopWord

No description

SuggestedAction

No description

Tag

A multi purpose tag

TagsetDescription

Information about a tagset (controlled vocabulary).

Tfidf

Annotates the tf.idf score of a token, stem, or lemma.

TimerAnnotation

Used for storing timing information (e.g. for performance testing).

Token

Token is one of the two types commonly produced by a segmenter (the other being Sentence).

TopicDistribution

An array representing the topic proportions in a document.

WikipediaLink

Wikipedia link

WikipediaRevision

Represents a revision in Wikipedia.

WordSense

No description

Anomalies

Anomaly

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.anomaly.type.Anomaly
Name: de.tudarmstadt.ukp.dkpro.core.api.anomaly.type.Anomaly
Supertype: Annotation

Features of Anomaly (3)
description (String)

No description

suggestions (FSArray of SuggestedAction)

An array of the suggested actions to be taken for this anomaly.

category (String)

No description

Producers and consumers of Anomaly

Producers

None declared

Consumers

None declared
Sub-types of Anomaly (2)
Type Description

GrammarAnomaly

No description

SpellingAnomaly

No description

SuggestedAction

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.anomaly.type.SuggestedAction
Name: de.tudarmstadt.ukp.dkpro.core.api.anomaly.type.SuggestedAction
Supertype: Annotation

Features of SuggestedAction (2)
replacement (String)

The text covered by the Anomaly annotation should be replaced with the contents of this feature.

certainty (Float)

A score representing how certain is this suggested action. Usually in [0,1].

Producers and consumers of SuggestedAction

Producers

Consumers

None declared

GrammarAnomaly

Producers and consumers of GrammarAnomaly

Producers

Consumers

None declared

SpellingAnomaly

Producers and consumers of SpellingAnomaly

Producers

Consumers

Coreference

ts coreference
Figure 1. Coreference types

This type system contains two types: CoreferenceChain and CoreferenceLink. The CoreferenceChain marks the beginning of a chain. It points to the first CoreferenceLink in the chain. Each CoreferenceLink then points to the next link.

CoreferenceChain

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.coref.type.CoreferenceChain
Name: de.tudarmstadt.ukp.dkpro.core.api.coref.type.CoreferenceChain
Supertype: AnnotationBase

Marks the beginning of a chain.

Features of CoreferenceChain (1)
first (CoreferenceLink)

This is the first corefernce link in coreference chain

Producers and consumers of CoreferenceChain

Producers

Consumers

Tcf (format)

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.coref.type.CoreferenceLink
Name: de.tudarmstadt.ukp.dkpro.core.api.coref.type.CoreferenceLink
Supertype: Annotation

A link in the coreference chain.

next (CoreferenceLink)

If there is one, it is the next coreference link to the current coreference link

referenceType (String)

The role or type which the covered text has in the coreference chain.

referenceRelation (String)

The type of relation between this link and the next link in the chain.

Producers

Consumers

Tcf (format)

Metadata

Recording tagset and tag descriptions in the CAS is still a feature under development. It is not supported by all components and it is not yet well defined. Expect changes and enhancements to this feature in future versions of DKPro Core.

ts metadata
Figure 2. Metadata types

DocumentMetaData

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.metadata.type.DocumentMetaData
Name: de.tudarmstadt.ukp.dkpro.core.api.metadata.type.DocumentMetaData
Supertype: DocumentAnnotation
similar: http://vocab.lappsgrid.org/Document (LAPPS)

The DocumentMetaData annotation stores information about a single processed document. There can only be one of these annotations per CAS. The annotation is created by readers and contains information to uniquely identify the document from which a CAS was created. Writer components use this information when determining under which filename a CAS is stored.

There are two principle ways of identifying a document:

  • collection id / document id: this simple system identifies a document within a collection. The ID of the collection and the document are each simple strings without any further semantics such as e.g. a hierarchy. For this reason, this identification scheme is not well suited to preserve information about directory structures.
  • document base URI / document URI: this system identifies a document using a URI. The base URI is used to derive the relative path of the document with respect to the base location from where it has been read. E.g. if the base URI is file:/texts and the document URI is file:/texts/english/text1.txt, then the relativ path of the document is english/text1.txt. This information is used by writers to recreate the directory structure found under the base location in the target location.

It is possible and indeed common for a writer to initialize both systems of identification. If both systems are present, most writers default to using the URI-based systems. However, most writers also allow forcing the use of the ID-based systems.

In addition to the features given here, there is a language feature inherited from UIMA's DocumentAnnotation. DKPro Core components expect a two letter ISO 639-1 language code there.

Features of DocumentMetaData (6)
documentTitle (String)

The human readable title of the document.

documentId (String)

The id of the document.

documentUri (String)

The URI of the document.

collectionId (String)

The ID of the whole document collection.

documentBaseUri (String)

Base URI of the document.

isLastSegment (Boolean)

CAS de-multipliers need to know whether a CAS is the last multiplied segment. Thus CAS multipliers should set this field to true for the last CAS they produce.

Producers and consumers of DocumentMetaData

Producers

ApplyChangesAnnotator AclAnthology (format) BlikiWikipedia (format) Bnc (format) Conll2000 (format) Conll2002 (format) Conll2006 (format) Conll2009 (format) Conll2012 (format) Html (format) ImsCwb (format) Jdbc (format) NegraExport (format) Pdf (format) PennTreebankChunked (format) PennTreebankCombined (format) String (format) Tcf (format) Tei (format) Text (format) WikipediaArticleInfo (format) WikipediaRevision (format) WikipediaRevisionPair (format) WikipediaTemplateFilteredArticle (format) Xmi (format) Xml (format) XmlText (format) XmlXPath (format)

Consumers

ApplyChangesAnnotator BinaryCas (format) Conll2000 (format) Conll2002 (format) Conll2006 (format) Conll2009 (format) Conll2012 (format) ImsCwb (format) InlineXml (format) Json (format) PennTreebankCombined (format) SerializedCas (format) Tcf (format) Tei (format) Text (format) TigerXml (format) Xmi (format)

MetaDataStringField

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.metadata.type.MetaDataStringField
Name: de.tudarmstadt.ukp.dkpro.core.api.metadata.type.MetaDataStringField
Supertype: Annotation

A general purpose annotation to store document-wide information in the form of arbitrary key-value string pairs.

Features of MetaDataStringField (2)
key (String)

Name of a metadata field.

value (String)

The field value.

Producers and consumers of MetaDataStringField

Producers

None declared

Consumers

None declared

TagDescription

Description of an individual tag.

Features of TagDescription (1)
name (String)

The name of the tag.

Producers and consumers of TagDescription

Producers

None declared

Consumers

None declared

TagsetDescription

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.metadata.type.TagsetDescription
Name: de.tudarmstadt.ukp.dkpro.core.api.metadata.type.TagsetDescription
Supertype: Annotation

Information about a tagset (controlled vocabulary).

Features of TagsetDescription (3)
layer (String)

The layer to which the tagset applies. This is typically the name of an UIMA type such as "de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.pos.POS".

name (String)

The name of the tagset.

tags (FSArray of TagDescription)

Descriptions of the tags belonging to this tagset.

Producers and consumers of TagsetDescription

Producers

None declared

Consumers

None declared

Morphology

ts morphology
Figure 3. Morphology types

Morpheme

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.morph.Morpheme
Name: de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.morph.Morpheme
Supertype: Annotation

Features of Morpheme (1)
morphTag (String)

No description

Producers and consumers of Morpheme

Producers

Consumers

None declared

MorphologicalFeatures

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.morph.MorphologicalFeatures
Name: de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.morph.MorphologicalFeatures
Supertype: Annotation

Morphological categories that can be attached to tokens.

Features of MorphologicalFeatures (18)
gender (String)

No description

number (String)

Singular/plural

case (String)

Nouns: nominative, genetiv, dative, …​

degree (String)

Adjectives: comparative/Superlative

verbForm (String)

No description

tense (String)

Verbs: past tense, present tense, future tense, etc.

mood (String)

Verbs: indicative, imperative, subjunctive

voice (String)

Verbs: active/passive

definiteness (String)

Definite or indefinite

value (String)

The original morphological analysis results as produced by a tool or as recorded in a corpus (if available). If the categories were originally encoded in such a string, the other features are filled by analyzing this string. If the categories were provided separately, e.g. by different attributed in an XML-encoded corpus, this field may remain empty.

person (String)

Verbs: 1st, 2nd, 3rd person

aspect (String)

Verbs: perfective, imperfective

animacy (String)

No description

negative (String)

No description

numType (String)

No description

possessive (String)

No description

pronType (String)

No description

reflex (String)

No description

Producers and consumers of MorphologicalFeatures

Producers

Consumers

Conll2006 (format) Conll2009 (format)

POS

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.pos.POS
Name: de.tudarmstadt.ukp.dkpro.core.api.lexmorph.type.pos.POS
Supertype: Annotation

The part of speech of a word or a phrase.

Features of POS (1)
PosValue (String)

The string representation of POS. Mostly it is the POS representation of the wrapped POS tagger.

Producers and consumers of POS

Producers

Consumers

Sub-types of POS (12)
Type Description

ADJ

Adjectives

ADV

Adverbs

ART

Determiners and articles

CARD

Numerals

CONJ

Conjunctions

N

Nouns

O

Catch-all for other categories such as abbreviations or foreign words

PP

Prepositions and postpositions

PR

Pronoun

PRT

Particles

PUNC

Punctuation marks

V

Verbs

ADJ

Adjectives

Producers and consumers of ADJ

Producers

None declared

Consumers

None declared

ADV

Adverbs

Producers and consumers of ADV

Producers

None declared

Consumers

None declared

ART

Determiners and articles

Producers and consumers of ART

Producers

None declared

Consumers

None declared

AT

at-mention (indicates another user as a recipient of a tweet)

Producers and consumers of AT

Producers

None declared

Consumers

None declared

CARD

Numerals

Producers and consumers of CARD

Producers

None declared

Consumers

None declared

CONJ

Conjunctions

Producers and consumers of CONJ

Producers

None declared

Consumers

None declared

DM

discourse marker, indications of continuation of a message across multiple tweets

Producers and consumers of DM

Producers

None declared

Consumers

None declared

EMO

emoticon

Producers and consumers of EMO

Producers

None declared

Consumers

None declared

HASH

Hashtag (indicates topic/category for tweet)

Producers and consumers of HASH

Producers

None declared

Consumers

None declared

INT

proper noun + verbal

Producers and consumers of INT

Producers

None declared

Consumers

None declared

N

Nouns

Producers and consumers of N

Producers

None declared

Consumers

None declared
Sub-types of N (4)
Type Description

NN

Common noun

NNV

nominal + verbal

NP

Noun phrase

NPV

proper noun + verbal

NN

Common noun

Producers and consumers of NN

Producers

None declared

Consumers

None declared

NNV

nominal + verbal

Producers and consumers of NNV

Producers

None declared

Consumers

None declared

NP

Noun phrase

Producers and consumers of NP

Producers

None declared

Consumers

None declared

NPV

proper noun + verbal

Producers and consumers of NPV

Producers

None declared

Consumers

None declared

O

Catch-all for other categories such as abbreviations or foreign words

Producers and consumers of O

Producers

None declared

Consumers

None declared
Sub-types of O (6)
Type Description

AT

at-mention (indicates another user as a recipient of a tweet)

DM

discourse marker, indications of continuation of a message across multiple tweets

EMO

emoticon

HASH

Hashtag (indicates topic/category for tweet)

INT

proper noun + verbal

URL

URL or email address

PP

Prepositions and postpositions

Producers and consumers of PP

Producers

None declared

Consumers

None declared

PR

Pronoun

Producers and consumers of PR

Producers

None declared

Consumers

None declared

PRT

Particles

Producers and consumers of PRT

Producers

None declared

Consumers

None declared

PUNC

Punctuation marks

Producers and consumers of PUNC

Producers

None declared

Consumers

None declared

URL

URL or email address

Producers and consumers of URL

Producers

None declared

Consumers

None declared

V

Verbs

Producers and consumers of V

Producers

None declared

Consumers

None declared

Phonetics

ts phonetics
Figure 4. Phonetics types

PhoneticTranscription

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.phonetics.type.PhoneticTranscription
Name: de.tudarmstadt.ukp.dkpro.core.api.phonetics.type.PhoneticTranscription
Supertype: Annotation

Represents the phonetic transcription of some textual element (usually a Token). Phonetic transcriptions are e.g. generated by transcription processes like Soundex or Metaphone.

Features of PhoneticTranscription (2)
transcription (String)

The actual transcription

name (String)

The name of the transcription process that was used

Producers and consumers of PhoneticTranscription

Producers

Consumers

None declared

ReadabilityScore

ReadabilityScore

Features of ReadabilityScore (2)
measureName (String)

No description

score (Double)

No description

Producers and consumers of ReadabilityScore

Producers

None declared

Consumers

None declared

Segmentation

ts segmentation
Figure 5. Segmentation types

The segmentation type system consists of two primary areas: tokenization (including sentences), compound words, and document structure.

The Sentence annotation type is simply a span with no futher attributes.

The Token type may be explicitly linked to a part of speech, lemma, and stem. It is expected that if either of these annotations are present, the token explicitly refers to them. If more than one annotation of such a type, e.g. multiple part-of-speech annotations are present, then it is expected that the token links to the most probable one, while the others are only located at the same offsets.

Additionally, the Token can link into the syntactic constituency structure via the parent feature.

The document structure can be encoded using the Div types. The type Div itself is a generic type representing some element of the document structure more closely specified by the divType attribute. The value of divType corresponds to the tag used in some original document format or to the output of a text segmentation tool. E.g. when reading an HTML document, the divType for a paragraph would be p, whereas in a DocBook XML file, it would instead be para.

For typical structural elements, the subtypes Document, Heading, and Paragrah are available. Document is rarely used, since the basic assumption is that a CAS always represents a document.

Compound

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Compound
Name: de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Compound
Supertype: Annotation

This type represents a decompounding word, i.e.: flowerpot. Each Compound one have at least two Splits.

Features of Compound (1)
splits (FSArray of Split)

A word that can be decomposed into different parts.

Producers and consumers of Compound

Producers

Consumers

None declared

Div

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Div
Name: de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Div
Supertype: Annotation

Document structure element.

Features of Div (2)
divType (String)

No description

id (String)

If this unit had an ID in the source format from which it was imported, it may be stored here. IDs are typically not assiged by DKPro Core components. If an ID is present, it should be respected by writers.

Producers and consumers of Div

Producers

None declared

Consumers

None declared
Sub-types of Div (3)
Type Description

Document

No description

Heading

Document title, section heading, etc.

Paragraph

No description

JapaneseToken

Features of JapaneseToken (4)
kana (String)

No description

ibo (String)

No description

kei (String)

No description

dan (String)

Specifies the kind of the verb if the current token is a verb. Either it is a vowel stem verb (ichi-dan) or a consonant stem verb (go-dan). Blank if not a verb.

Producers and consumers of JapaneseToken

Producers

Consumers

None declared

NGram

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.NGram
Name: de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.NGram
Supertype: Annotation

Features of NGram (1)
text (String)

No description

Producers and consumers of NGram

Producers

Consumers

None declared

Sentence

Features of Sentence (1)
id (String)

If this unit had an ID in the source format from which it was imported, it may be stored here. IDs are typically not assiged by DKPro Core components. If an ID is present, it should be respected by writers.

Producers and consumers of Sentence

Producers

Consumers

Split

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Split
Name: de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Split
Supertype: Annotation

This type represents a part of a decompounding word. A Split can be either a CompoundPart or a LinkingMorpheme.

Features of Split (1)
splits (FSArray of Split)

Sub-splits of the current split.

Producers and consumers of Split

Producers

Consumers

None declared
Sub-types of Split (2)
Type Description

CompoundPart

A CompoundPart represents one fragment from the compounding word.

LinkingMorpheme

This type represents a linking morpheme between two CompoundParts.

Stem

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Stem
Name: de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Stem
Supertype: Annotation

Features of Stem (1)
value (String)

No description

Producers and consumers of Stem

Producers

Consumers

None declared

StopWord

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.StopWord
Name: de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.StopWord
Supertype: Annotation

Producers and consumers of StopWord

Producers

None declared

Consumers

Token

Token is one of the two types commonly produced by a segmenter (the other being Sentence). A Token usually represents a word, although it may be used to represent multiple tightly connected words (e.g. "New York") or parts of a word (e.g. the possessive "'s"). One may choose to split compound words into multiple tokens, e.g. ("CamelCase" -> "Camel", "Case"; "Zauberstab" -> "Zauber", "stab"). Most processing components operate on Tokens, usually within the limits of the surrounding Sentence. E.g. a part-of-speech tagger analyses each Token in a Sentence and assigns a part-of-speech to each Token.

Features of Token (6)
parent (Annotation)

the parent of this token. This feature is meant to be used in when the token participates in a constituency parse and then refers to a constituent containing this token. The type of this feature is {@link Annotation} to avoid adding a dependency on the syntax API module.

lemma (Lemma)

No description

stem (Stem)

No description

pos (POS)

No description

morph (MorphologicalFeatures)

The morphological feature associated with this token.

id (String)

If this unit had an ID in the source format from which it was imported, it may be stored here. IDs are typically not assiged by DKPro Core components. If an ID is present, it should be respected by writers.

Producers and consumers of Token

Producers

Consumers

Sub-types of Token (1)
Type Description

JapaneseToken

No description

CompoundPart

A CompoundPart represents one fragment from the compounding word. Besides that, it can store other CompoundParts if it can be split again. The way it stores a decompounding word represents a decompounding tree.

Producers and consumers of CompoundPart

Producers

Consumers

None declared

Document

Producers and consumers of Document

Producers

None declared

Consumers

None declared

Heading

Document title, section heading, etc.

Producers and consumers of Heading

Producers

None declared

Consumers

None declared

LinkingMorpheme

This type represents a linking morpheme between two CompoundParts.

Producers and consumers of LinkingMorpheme

Producers

Consumers

None declared

Paragraph

Producers and consumers of Paragraph

Producers

Consumers

Tei (format)

Semantics

ts semantics
Figure 6. Segmentation types

NamedEntity

Named entities refer e.g. to persons, locations, organizations and so on. They often consist of multiple tokens.

Features of NamedEntity (1)
value (String)

No description

Producers and consumers of NamedEntity

Producers

Consumers

StanfordCoreferenceResolver Conll2002 (format) Tcf (format) Tei (format)
Sub-types of NamedEntity (30)
Type Description

Animal

No description

Cardinal

No description

ContactInfo

No description

Date

No description

Disease

No description

Event

No description

Fac

No description

FacDesc

No description

Game

No description

Gpe

No description

GpeDesc

No description

Language

No description

Law

No description

Location

No description

Money

No description

Nationality

No description

Norp

No description

Ordinal

No description

OrgDesc

No description

Organization

No description

PerDesc

No description

Percent

No description

Person

No description

Plant

No description

Product

No description

ProductDesc

No description

Quantity

No description

Substance

No description

Time

No description

WorkOfArt

No description

SemanticArgument

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.semantics.type.SemanticArgument
Name: de.tudarmstadt.ukp.dkpro.core.api.semantics.type.SemanticArgument
Supertype: Annotation

The SemanticArgument annotation is attached to semantic arguments of semantic predicates. Semantic arguments are characterized by their semantic role, e.g. Agent, Experiencer, Topic. The semantic role of an argument is related to its semantic type (for communication verbs, the Agent can be a person or an organization, but typically not food). The semantic type of arguments is not yet covered by the SemanticType.

Features of SemanticArgument (1)
role (String)

The role which the argument takes. The value depends on the theory being used, e.g. Arg0, Arg1, etc. or Buyer, Seller, etc.

Producers and consumers of SemanticArgument

Producers

Consumers

Conll2009 (format) Conll2012 (format)

SemanticField

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.semantics.type.SemanticField
Name: de.tudarmstadt.ukp.dkpro.core.api.semantics.type.SemanticField
Supertype: Annotation

The SemanticField is a coarse-grained semantic category that can be attached to nouns, verbs or adjectives. Semantic field information is present e.g. in WordNet as lexicographer file names. Previously, this kind of semantic information has also been called supersenses or semantic types.

Features of SemanticField (1)
value (String)

The value or name of the semantic field. Examples of semantic field values are: location, artifact, event, communication, attribute

Producers and consumers of SemanticField

Producers

None declared

Consumers

None declared

SemanticPredicate

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.semantics.type.SemanticPredicate
Name: de.tudarmstadt.ukp.dkpro.core.api.semantics.type.SemanticPredicate
Supertype: Annotation

One of the predicates of a sentence (often a main verb, but nouns and adjectives can also be predicates). The SemanticPredicate annotation can be attached to predicates in a sentences. Semantic predicates express events or situations and take semantic arguments expressing the participants in these events ore situations. All forms of main verbs can be annotated with a SemanticPredicate. However, there are also many nouns and adjectives that take arguments and can thus be annotated with a SemanticPredicate, e.g. event nouns, such as "suggestion" (with arguments what and by whom), or relational adjectives, such as "proud" (with arguments who and of what).

Features of SemanticPredicate (2)
category (String)

A more detailed specification of the predicate type depending on the theory being used, e.g. a frame name.

arguments (FSArray of SemanticArgument)

The predicate’s arguments.

Producers and consumers of SemanticPredicate

Producers

Consumers

Conll2009 (format) Conll2012 (format)

StanfordSentimentAnnotation

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.sentiment.type.StanfordSentimentAnnotation
Name: de.tudarmstadt.ukp.dkpro.core.sentiment.type.StanfordSentimentAnnotation
Supertype: Annotation

Stanford CoreNLP Sentiment annotation

Features of StanfordSentimentAnnotation (5)
veryNegative (Double)

Value of veryNegative

negative (Double)

Value of negative

neutral (Double)

Value of neutral

positive (Double)

Value of positive

veryPositive (Double)

Value of veryPositive

Producers and consumers of StanfordSentimentAnnotation

Producers

None declared

Consumers

None declared

WordSense

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.semantics.type.WordSense
Name: de.tudarmstadt.ukp.dkpro.core.api.semantics.type.WordSense
Supertype: Annotation

Features of WordSense (1)
value (String)

The sense identifier.

Producers and consumers of WordSense

Producers

None declared

Consumers

None declared

Animal

Producers and consumers of Animal

Producers

None declared

Consumers

None declared

Cardinal

Producers and consumers of Cardinal

Producers

None declared

Consumers

None declared

ContactInfo

Producers and consumers of ContactInfo

Producers

None declared

Consumers

None declared

Date

Producers and consumers of Date

Producers

None declared

Consumers

None declared

Disease

Producers and consumers of Disease

Producers

None declared

Consumers

None declared

Event

Producers and consumers of Event

Producers

None declared

Consumers

None declared

Fac

Producers and consumers of Fac

Producers

None declared

Consumers

None declared

FacDesc

Producers and consumers of FacDesc

Producers

None declared

Consumers

None declared

Game

Producers and consumers of Game

Producers

None declared

Consumers

None declared

Gpe

Producers and consumers of Gpe

Producers

None declared

Consumers

None declared

GpeDesc

Producers and consumers of GpeDesc

Producers

None declared

Consumers

None declared

Language

Producers and consumers of Language

Producers

None declared

Consumers

None declared

Law

Producers and consumers of Law

Producers

None declared

Consumers

None declared

Location

Producers and consumers of Location

Producers

None declared

Consumers

None declared

Money

Producers and consumers of Money

Producers

None declared

Consumers

None declared

Nationality

Producers and consumers of Nationality

Producers

None declared

Consumers

None declared

Norp

Producers and consumers of Norp

Producers

None declared

Consumers

None declared

Ordinal

Producers and consumers of Ordinal

Producers

None declared

Consumers

None declared

OrgDesc

Producers and consumers of OrgDesc

Producers

None declared

Consumers

None declared

Organization

Producers and consumers of Organization

Producers

None declared

Consumers

None declared

PerDesc

Producers and consumers of PerDesc

Producers

None declared

Consumers

None declared

Percent

Producers and consumers of Percent

Producers

None declared

Consumers

None declared

Person

Producers and consumers of Person

Producers

None declared

Consumers

None declared

Plant

Producers and consumers of Plant

Producers

None declared

Consumers

None declared

Product

Producers and consumers of Product

Producers

None declared

Consumers

None declared

ProductDesc

Producers and consumers of ProductDesc

Producers

None declared

Consumers

None declared

Quantity

Producers and consumers of Quantity

Producers

None declared

Consumers

None declared

Substance

Producers and consumers of Substance

Producers

None declared

Consumers

None declared

Time

Producers and consumers of Time

Producers

None declared

Consumers

None declared

WorkOfArt

Producers and consumers of WorkOfArt

Producers

None declared

Consumers

None declared

Structure

Field

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.structure.type.Field
Name: de.tudarmstadt.ukp.dkpro.core.api.structure.type.Field
Supertype: Annotation

Features of Field (1)
name (String)

the name of the tag

Producers and consumers of Field

Producers

Xml (format) XmlXPath (format)

Consumers

None declared

Syntax

ts syntax
Figure 7. Syntax types

Chunk

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.syntax.type.chunk.Chunk
Name: de.tudarmstadt.ukp.dkpro.core.api.syntax.type.chunk.Chunk
Supertype: Annotation

Features of Chunk (1)
chunkValue (String)

No description

Producers and consumers of Chunk

Producers

Consumers

Conll2000 (format)
Sub-types of Chunk (10)
Type Description

ADJC

adjective chunks

ADVC

adverb chunks

CONCJ

complex coordinating conjunctions such as "as well (as)" or "rather (than)"

INTJ

interjection

LST

enumeration symbol

NC

noun chunk (non-recursive noun phrase)

O

other or outside a chunk

PC

prepositional chunk

PRT

verb particle

VC

verb complex

Constituent

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.syntax.type.constituent.Constituent
Name: de.tudarmstadt.ukp.dkpro.core.api.syntax.type.constituent.Constituent
Supertype: Annotation
similar: http//vocab.lappsgrid.org/Constituent (LAPPS)

Features of Constituent (4)
constituentType (String)

No description

parent (Annotation)

The parent constituent

children (FSArray of Annotation)

No description

syntacticFunction (String)

No description

Producers and consumers of Constituent

Producers

Consumers

Sub-types of Constituent (28)
Type Description

ADJP

No description

ADVP

No description

CONJP

No description

FRAG

No description

INTJ

No description

LST

No description

NAC

No description

NP

No description

NX

No description

PP

No description

PRN

No description

PRP

No description

PRT

No description

QP

No description

ROOT

No description

RRC

No description

S

No description

SBAR

No description

SBARQ

No description

SINV

No description

SQ

No description

UCP

No description

VP

No description

WHADJP

No description

WHADVP

No description

WHNP

No description

WHPP

No description

X

No description

Dependency

A dependency relation between two tokens. The dependency annotation begin and end offsets correspond to those of the dependent.

Features of Dependency (4)
Governor (Token)

The governor word

Dependent (Token)

The dependent word

DependencyType (String)

The dependency type

flavor (String)

Flavor of the dependency relation (basic, collapsed, enhanced, etc…​)

Producers and consumers of Dependency

Producers

Consumers

Sub-types of Dependency (57)
Type Description

ABBREV

No description

ACOMP

No description

ADVCL

No description

ADVMOD

No description

AGENT

No description

AMOD

No description

APPOS

No description

ATTR

No description

AUX0

No description

AUXPASS

No description

CC

No description

CCOMP

No description

COMPLM

No description

CONJ

No description

CONJP

No description

CONJ_YET

No description

COP

No description

CSUBJ

No description

CSUBJPASS

No description

DEP

No description

DET

No description

DOBJ

No description

EXPL

No description

INFMOD

No description

IOBJ

No description

MARK

No description

MEASURE

No description

MWE

No description

NEG

No description

NN

No description

NPADVMOD

No description

NSUBJ

No description

NSUBJPASS

No description

NUM

No description

NUMBER

No description

PARATAXIS

No description

PARTMOD

No description

PCOMP

No description

POBJ

No description

POSS

No description

POSSESSIVE

No description

PRECONJ

No description

PRED

No description

PREDET

No description

PREP

No description

PREPC

No description

PRT

No description

PUNCT

No description

PURPCL

No description

QUANTMOD

No description

RCMOD

No description

REF

No description

REL

No description

ROOT

Dependency tree root.

TMOD

No description

XCOMP

No description

XSUBJ

No description

PennTree

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.syntax.type.PennTree
Name: de.tudarmstadt.ukp.dkpro.core.api.syntax.type.PennTree
Supertype: Annotation

The Penn Treebank-style phrase structure string.

Features of PennTree (2)
PennTree (String)

Contains a Penn Treebank-style representation of a tree.

TransformationNames (String)

The name(s) of the transformation(s) that have been performed on the PennTree

Producers and consumers of PennTree

Producers

Consumers

None declared

ABBREV

Producers and consumers of ABBREV

Producers

None declared

Consumers

None declared

ACOMP

Producers and consumers of ACOMP

Producers

None declared

Consumers

None declared

ADJC

adjective chunks

Producers and consumers of ADJC

Producers

None declared

Consumers

None declared

ADJP

Producers and consumers of ADJP

Producers

None declared

Consumers

None declared

ADVC

adverb chunks

Producers and consumers of ADVC

Producers

None declared

Consumers

None declared

ADVCL

Producers and consumers of ADVCL

Producers

None declared

Consumers

None declared

ADVMOD

Producers and consumers of ADVMOD

Producers

None declared

Consumers

None declared

ADVP

Producers and consumers of ADVP

Producers

None declared

Consumers

None declared

AGENT

Producers and consumers of AGENT

Producers

None declared

Consumers

None declared

AMOD

Producers and consumers of AMOD

Producers

None declared

Consumers

None declared

APPOS

Producers and consumers of APPOS

Producers

None declared

Consumers

None declared

ATTR

Producers and consumers of ATTR

Producers

None declared

Consumers

None declared

AUX0

Producers and consumers of AUX0

Producers

None declared

Consumers

None declared

AUXPASS

Producers and consumers of AUXPASS

Producers

None declared

Consumers

None declared

CC

Producers and consumers of CC

Producers

None declared

Consumers

None declared

CCOMP

Producers and consumers of CCOMP

Producers

None declared

Consumers

None declared

COMPLM

Producers and consumers of COMPLM

Producers

None declared

Consumers

None declared

CONCJ

complex coordinating conjunctions such as "as well (as)" or "rather (than)"

Producers and consumers of CONCJ

Producers

None declared

Consumers

None declared

CONJ

Producers and consumers of CONJ

Producers

None declared

Consumers

None declared

CONJP

Producers and consumers of CONJP

Producers

None declared

Consumers

None declared

CONJP

Producers and consumers of CONJP

Producers

None declared

Consumers

None declared

CONJ_YET

Producers and consumers of CONJ_YET

Producers

None declared

Consumers

None declared

COP

Producers and consumers of COP

Producers

None declared

Consumers

None declared

CSUBJ

Producers and consumers of CSUBJ

Producers

None declared

Consumers

None declared

CSUBJPASS

Producers and consumers of CSUBJPASS

Producers

None declared

Consumers

None declared

DEP

Producers and consumers of DEP

Producers

None declared

Consumers

None declared

DET

Producers and consumers of DET

Producers

None declared

Consumers

None declared

DOBJ

Producers and consumers of DOBJ

Producers

None declared

Consumers

None declared

EXPL

Producers and consumers of EXPL

Producers

None declared

Consumers

None declared

FRAG

Producers and consumers of FRAG

Producers

None declared

Consumers

None declared

INFMOD

Producers and consumers of INFMOD

Producers

None declared

Consumers

None declared

INTJ

Producers and consumers of INTJ

Producers

None declared

Consumers

None declared

INTJ

interjection

Producers and consumers of INTJ

Producers

None declared

Consumers

None declared

IOBJ

Producers and consumers of IOBJ

Producers

None declared

Consumers

None declared

LST

Producers and consumers of LST

Producers

None declared

Consumers

None declared

LST

enumeration symbol

Producers and consumers of LST

Producers

None declared

Consumers

None declared

MARK

Producers and consumers of MARK

Producers

None declared

Consumers

None declared

MEASURE

Producers and consumers of MEASURE

Producers

None declared

Consumers

None declared

MWE

Producers and consumers of MWE

Producers

None declared

Consumers

None declared

NAC

Producers and consumers of NAC

Producers

None declared

Consumers

None declared

NC

noun chunk (non-recursive noun phrase)

Producers and consumers of NC

Producers

None declared

Consumers

None declared

NEG

Producers and consumers of NEG

Producers

None declared

Consumers

None declared

NN

Producers and consumers of NN

Producers

None declared

Consumers

None declared

NP

Producers and consumers of NP

Producers

None declared

Consumers

None declared

NPADVMOD

Producers and consumers of NPADVMOD

Producers

None declared

Consumers

None declared

NSUBJ

Producers and consumers of NSUBJ

Producers

None declared

Consumers

None declared

NSUBJPASS

Producers and consumers of NSUBJPASS

Producers

None declared

Consumers

None declared

NUM

Producers and consumers of NUM

Producers

None declared

Consumers

None declared

NUMBER

Producers and consumers of NUMBER

Producers

None declared

Consumers

None declared

NX

Producers and consumers of NX

Producers

None declared

Consumers

None declared

O

other or outside a chunk

Producers and consumers of O

Producers

None declared

Consumers

None declared

PARATAXIS

Producers and consumers of PARATAXIS

Producers

None declared

Consumers

None declared

PARTMOD

Producers and consumers of PARTMOD

Producers

None declared

Consumers

None declared

PC

prepositional chunk

Producers and consumers of PC

Producers

None declared

Consumers

None declared

PCOMP

Producers and consumers of PCOMP

Producers

None declared

Consumers

None declared

POBJ

Producers and consumers of POBJ

Producers

None declared

Consumers

None declared

POSS

Producers and consumers of POSS

Producers

None declared

Consumers

None declared

POSSESSIVE

Producers and consumers of POSSESSIVE

Producers

None declared

Consumers

None declared

PP

Producers and consumers of PP

Producers

None declared

Consumers

None declared

PRECONJ

Producers and consumers of PRECONJ

Producers

None declared

Consumers

None declared

PRED

Producers and consumers of PRED

Producers

None declared

Consumers

None declared

PREDET

Producers and consumers of PREDET

Producers

None declared

Consumers

None declared

PREP

Producers and consumers of PREP

Producers

None declared

Consumers

None declared

PREPC

Producers and consumers of PREPC

Producers

None declared

Consumers

None declared

PRN

Producers and consumers of PRN

Producers

None declared

Consumers

None declared

PRP

Producers and consumers of PRP

Producers

None declared

Consumers

None declared

PRT

Producers and consumers of PRT

Producers

None declared

Consumers

None declared

PRT

verb particle

Producers and consumers of PRT

Producers

None declared

Consumers

None declared

PRT

Producers and consumers of PRT

Producers

None declared

Consumers

None declared

PUNCT

Producers and consumers of PUNCT

Producers

None declared

Consumers

None declared

PURPCL

Producers and consumers of PURPCL

Producers

None declared

Consumers

None declared

QP

Producers and consumers of QP

Producers

None declared

Consumers

None declared

QUANTMOD

Producers and consumers of QUANTMOD

Producers

None declared

Consumers

None declared

RCMOD

Producers and consumers of RCMOD

Producers

None declared

Consumers

None declared

REF

Producers and consumers of REF

Producers

None declared

Consumers

None declared

REL

Producers and consumers of REL

Producers

None declared

Consumers

None declared

ROOT

Producers and consumers of ROOT

Producers

None declared

Consumers

None declared

ROOT

Dependency tree root.

Producers and consumers of ROOT

Producers

None declared

Consumers

None declared

RRC

Producers and consumers of RRC

Producers

None declared

Consumers

None declared

S

Producers and consumers of S

Producers

None declared

Consumers

None declared

SBAR

Producers and consumers of SBAR

Producers

None declared

Consumers

None declared

SBARQ

Producers and consumers of SBARQ

Producers

None declared

Consumers

None declared

SINV

Producers and consumers of SINV

Producers

None declared

Consumers

None declared

SQ

Producers and consumers of SQ

Producers

None declared

Consumers

None declared

TMOD

Producers and consumers of TMOD

Producers

None declared

Consumers

None declared

UCP

Producers and consumers of UCP

Producers

None declared

Consumers

None declared

VC

verb complex

Producers and consumers of VC

Producers

None declared

Consumers

None declared

VP

Producers and consumers of VP

Producers

None declared

Consumers

None declared

WHADJP

Producers and consumers of WHADJP

Producers

None declared

Consumers

None declared

WHADVP

Producers and consumers of WHADVP

Producers

None declared

Consumers

None declared

WHNP

Producers and consumers of WHNP

Producers

None declared

Consumers

None declared

WHPP

Producers and consumers of WHPP

Producers

None declared

Consumers

None declared

X

Producers and consumers of X

Producers

None declared

Consumers

None declared

XCOMP

Producers and consumers of XCOMP

Producers

None declared

Consumers

None declared

XSUBJ

Producers and consumers of XSUBJ

Producers

None declared

Consumers

None declared

Tag

Tag

A multi purpose tag

Features of Tag (1)
value (String)

No description

Producers and consumers of Tag

Producers

None declared

Consumers

None declared

Tfidf

Tfidf

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.frequency.tfidf.type.Tfidf
Name: de.tudarmstadt.ukp.dkpro.core.api.frequency.tfidf.type.Tfidf
Supertype: Annotation

Annotates the tf.idf score of a token, stem, or lemma.

Features of Tfidf (2)
tfidfValue (Double)

The tf.idf score.

term (String)

The string that was used to compute this tf.idf score. If a stem or lemma was used, the covered text of this annotation does not need to be equal to this string.

This string can be used to construct a vector space with the right terms without having to access the indexes again.

Producers and consumers of Tfidf

Producers

Consumers

None declared

Topic Modeling

TopicDistribution

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.mallet.type.TopicDistribution
Name: de.tudarmstadt.ukp.dkpro.core.mallet.type.TopicDistribution
Supertype: Annotation

An array representing the topic proportions in a document.

Features of TopicDistribution (2)
TopicProportions (DoubleArray)

Each topic’s proportion in the document.

TopicAssignment (IntegerArray)

Pointers to topics the document has been assigned to.

Producers and consumers of TopicDistribution

Producers

Consumers

None declared

Transformation

SofaChangeAnnotation

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.api.transform.type.SofaChangeAnnotation
Name: de.tudarmstadt.ukp.dkpro.core.api.transform.type.SofaChangeAnnotation
Supertype: Annotation

Encodes an edit operation that can be interpreted by the ApplyChangesAnnotator.

Features of SofaChangeAnnotation (3)
value (String)

In case of an "insert" or "replace" operation, this feature indicates the value to be inserted or replaced.

operation (String)

Operation to perform: "insert", "replace", "delete"

reason (String)

The reason for the change.

Producers and consumers of SofaChangeAnnotation

Producers

Consumers

Utility

TimerAnnotation

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.performance.type.TimerAnnotation
Name: de.tudarmstadt.ukp.dkpro.core.performance.type.TimerAnnotation
Supertype: Annotation

Used for storing timing information (e.g. for performance testing).

Features of TimerAnnotation (3)
startTime (Long)

No description

endTime (Long)

No description

name (String)

The name of the timer. Used to automatically determine whether this is an upstream or downstream timer.

Producers and consumers of TimerAnnotation

Producers

None declared

Consumers

None declared

Wikipedia

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.io.jwpl.type.WikipediaLink
Name: de.tudarmstadt.ukp.dkpro.core.io.jwpl.type.WikipediaLink
Supertype: Annotation

Wikipedia link

LinkType (String)

The type of the link, e.g. internal, external, image, …​

Target (String)

The link target url

Anchor (String)

The anchor of the link

Producers

WikipediaLink (format)

Consumers

None declared

Wikipedia (JWPL)

ArticleInfo

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.io.jwpl.type.ArticleInfo
Name: de.tudarmstadt.ukp.dkpro.core.io.jwpl.type.ArticleInfo
Supertype: Annotation

Contains basic information about the article.

Features of ArticleInfo (4)
Authors (Integer)

Number of unique authors of this article

Revisions (Integer)

Number of revisions of this article.

FirstAppearance (Long)

The Timestamp of the first appearance of this article.

LastAppearance (Long)

The Timestamp of the last appearance of this article.

Producers and consumers of ArticleInfo

Producers

Consumers

None declared

DBConfig

Database configuration for the connection to the database where the CAS data was retrieved.

Features of DBConfig (5)
Host (String)

DB Host

DB (String)

Database

User (String)

Username

Password (String)

User password

Language (String)

Wikipedia Language Versions

Producers and consumers of DBConfig

Producers

Consumers

None declared

WikipediaRevision

URI: http://dkpro.github.io/dkpro-core/releases/1.8.0/docs/typesystem-reference.html#de.tudarmstadt.ukp.dkpro.core.io.jwpl.type.WikipediaRevision
Name: de.tudarmstadt.ukp.dkpro.core.io.jwpl.type.WikipediaRevision
Supertype: Annotation

Represents a revision in Wikipedia.

Features of WikipediaRevision (7)
revisionId (Integer)

The ID of the revision.

pageId (Integer)

The pageId of the Wikipedia page of this revision.

contributorName (String)

The username of the user/contributor who edited this revision.

comment (String)

The comment that the editor entered for this revision.

contributorId (Integer)

The userId of the user/contributor who created this revision

timestamp (Long)

The timestamp of the revision, given in milliseconds since the standard base time (January 1, 1970, 00:00:00 GMT)

minor (Boolean)

Whether this revision has been marked as minor edit by its contributor.

Producers and consumers of WikipediaRevision

Producers

Consumers

None declared

Subtype tables

Table 2. Sub-types of Dependency
Type Description

ABBREV

No description

ACOMP

No description

ADVCL

No description

ADVMOD

No description

AGENT

No description

AMOD

No description

APPOS

No description

ATTR

No description

AUX0

No description

AUXPASS

No description

CC

No description

CCOMP

No description

COMPLM

No description

CONJ

No description

CONJP

No description

CONJ_YET

No description

COP

No description

CSUBJ

No description

CSUBJPASS

No description

DEP

No description

DET

No description

DOBJ

No description

EXPL

No description

INFMOD

No description

IOBJ

No description

MARK

No description

MEASURE

No description

MWE

No description

NEG

No description

NN

No description

NPADVMOD

No description

NSUBJ

No description

NSUBJPASS

No description

NUM

No description

NUMBER

No description

PARATAXIS

No description

PARTMOD

No description

PCOMP

No description

POBJ

No description

POSS

No description

POSSESSIVE

No description

PRECONJ

No description

PRED

No description

PREDET

No description

PREP

No description

PREPC

No description

PRT

No description

PUNCT

No description

PURPCL

No description

QUANTMOD

No description

RCMOD

No description

REF

No description

REL

No description

ROOT

Dependency tree root.

TMOD

No description

XCOMP

No description

XSUBJ

No description

Table 3. Sub-types of POS
Type Description

ADJ

Adjectives

ADV

Adverbs

ART

Determiners and articles

CARD

Numerals

CONJ

Conjunctions

N

Nouns

O

Catch-all for other categories such as abbreviations or foreign words

PP

Prepositions and postpositions

PR

Pronoun

PRT

Particles

PUNC

Punctuation marks

V

Verbs

Table 4. Sub-types of Chunk
Type Description

ADJC

adjective chunks

ADVC

adverb chunks

CONCJ

complex coordinating conjunctions such as "as well (as)" or "rather (than)"

INTJ

interjection

LST

enumeration symbol

NC

noun chunk (non-recursive noun phrase)

O

other or outside a chunk

PC

prepositional chunk

PRT

verb particle

VC

verb complex

Table 5. Sub-types of Constituent
Type Description

ADJP

No description

ADVP

No description

CONJP

No description

FRAG

No description

INTJ

No description

LST

No description

NAC

No description

NP

No description

NX

No description

PP

No description

PRN

No description

PRP

No description

PRT

No description

QP

No description

ROOT

No description

RRC

No description

S

No description

SBAR

No description

SBARQ

No description

SINV

No description

SQ

No description

UCP

No description

VP

No description

WHADJP

No description

WHADVP

No description

WHNP

No description

WHPP

No description

X

No description

Table 6. Sub-types of O
Type Description

AT

at-mention (indicates another user as a recipient of a tweet)

DM

discourse marker, indications of continuation of a message across multiple tweets

EMO

emoticon

HASH

Hashtag (indicates topic/category for tweet)

INT

proper noun + verbal

URL

URL or email address

Table 7. Sub-types of NamedEntity
Type Description

Animal

No description

Cardinal

No description

ContactInfo

No description

Date

No description

Disease

No description

Event

No description

Fac

No description

FacDesc

No description

Game

No description

Gpe

No description

GpeDesc

No description

Language

No description

Law

No description

Location

No description

Money

No description

Nationality

No description

Norp

No description

Ordinal

No description

OrgDesc

No description

Organization

No description

PerDesc

No description

Percent

No description

Person

No description

Plant

No description

Product

No description

ProductDesc

No description

Quantity

No description

Substance

No description

Time

No description

WorkOfArt

No description

Table 8. Sub-types of Split
Type Description

CompoundPart

A CompoundPart represents one fragment from the compounding word.

LinkingMorpheme

This type represents a linking morpheme between two CompoundParts.

Table 9. Sub-types of AnnotationBase
Type Description

CoreferenceChain

Marks the beginning of a chain.

Table 10. Sub-types of Div
Type Description

Document

No description

Heading

Document title, section heading, etc.

Paragraph

No description

Table 11. Sub-types of DocumentAnnotation
Type Description

DocumentMetaData

The DocumentMetaData annotation stores information about a single processed document.

Table 12. Sub-types of Anomaly
Type Description

GrammarAnomaly

No description

SpellingAnomaly

No description

Table 13. Sub-types of Token
Type Description

JapaneseToken

No description

Table 14. Sub-types of N
Type Description

NN

Common noun

NNV

nominal + verbal

NP

Noun phrase

NPV

proper noun + verbal

Table 15. Sub-types of TOP
Type Description

TagDescription

Description of an individual tag.