- ID - (ignored) Token counter, starting at 1 for each new sentence.
- FORM - (Token) Word form or punctuation symbol.
- LEMMA - (Lemma) Fine-grained part-of-speech tag, where the tagset depends on the
language, or identical to the coarse-grained part-of-speech tag if not available.
- PLEMMA - (ignored) Automatically predicted lemma of FORM
- POS - (POS) Fine-grained part-of-speech tag, where the tagset depends on the language,
or identical to the coarse-grained part-of-speech tag if not available.
- PPOS - (ignored) Automatically predicted major POS by a language-specific tagger
- FEAT - (Morpheme) Unordered set of syntactic and/or morphological features (depending
on the particular language), separated by a vertical bar (|), or an underscore if not available.
- PFEAT - (ignored) Automatically predicted morphological features (if applicable)
- HEAD - (Dependency) Head of the current token, which is either a value of ID or zero
('0'). Note that depending on the original treebank annotation, there may be multiple tokens with
an ID of zero.
- PHEAD - (ignored) Automatically predicted syntactic head
- DEPREL - (Dependency) Dependency relation to the HEAD. The set of dependency relations
depends on the particular language. Note that depending on the original treebank annotation, the
dependency relation may be meaningfull or simply 'ROOT'.
- PDEPREL - (ignored) Automatically predicted dependency relation to PHEAD
- FILLPRED - (auto-generated) Contains 'Y' for argument-bearing tokens
- PRED - (SemanticPredicate) (sense) identifier of a semantic 'predicate' coming from a
current token
- APREDs - (SemanticArgument) Columns with argument labels for each semantic predicate
(in the ID order)
Sentences are separated by a blank new line