See: Description
Class | Description |
---|---|
Conll2000Reader |
Reads the Conll 2000 chunking format.
|
Conll2000Writer |
Writes the CoNLL 2000 chunking format.
|
Conll2002Reader |
Writes the CoNLL 2002 named entity format.
|
Conll2002Writer |
Writes the CoNLL 2002 named entity format.
|
Conll2006Reader |
Reads a file in the CoNLL-2006 format.
|
Conll2006Writer |
Writes a specific Conll File (9 TAB separated) annotation from the CAS object.
|
Conll2009Reader |
Reads a file in the CoNLL-2009 format.
|
Conll2009Writer |
ID - (ignored) Token counter, starting at 1 for each new sentence.
FORM - (Token) Word form or punctuation symbol.
LEMMA - (Lemma) Fine-grained part-of-speech tag, where the tagset depends on the
language, or identical to the coarse-grained part-of-speech tag if not available.
PLEMMA - (ignored) Automatically predicted lemma of FORM
POS - (POS) Fine-grained part-of-speech tag, where the tagset depends on the language,
or identical to the coarse-grained part-of-speech tag if not available.
PPOS - (ignored) Automatically predicted major POS by a language-specific tagger
FEAT - (Morpheme) Unordered set of syntactic and/or morphological features (depending
on the particular language), separated by a vertical bar (|), or an underscore if not available.
PFEAT - (ignored) Automatically predicted morphological features (if applicable)
HEAD - (Dependency) Head of the current token, which is either a value of ID or zero
('0').
|
Conll2012Reader |
Reads a file in the CoNLL-2009 format.
|
Conll2012Writer |
Copyright © 2011–2015. All rights reserved.