Reads a list of words from a text file (one token per line) and retains only tokens or other annotations that match any of these words.
Remove every token that does or does not match a given regular expression.
Removing trailing character (sequences) from tokens, e.g.
Copyright © 2007–2016 Ubiquitous Knowledge Processing (UKP) Lab, Technische Universität Darmstadt. All rights reserved.