public class WordListProcessor extends Object
Modifier and Type | Field and Description |
---|---|
protected static Pattern |
ESCAPE_DELIMITER1 |
protected static Pattern |
ESCAPE_DELIMITER2 |
protected static Pattern |
ESCAPE_DELIMITER3 |
protected static Pattern |
HTML_REMOVER |
protected static Pattern |
REFERENCE_PATTERN |
protected static Pattern |
SUPERSCRIPT_PATTERN |
Constructor and Description |
---|
WordListProcessor() |
Modifier and Type | Method and Description |
---|---|
protected String |
deWikify(String word) |
protected String |
escapeDelimiters(String text) |
protected String |
removeBrackets(String word) |
protected String |
removeComments(String word) |
protected String |
removeTemplates(String word) |
List<String> |
splitWordList(String text)
Splits the given text by comma, semicolon, line break, etc.
|
protected static final Pattern HTML_REMOVER
protected static final Pattern ESCAPE_DELIMITER1
protected static final Pattern ESCAPE_DELIMITER2
protected static final Pattern ESCAPE_DELIMITER3
protected static final Pattern REFERENCE_PATTERN
protected static final Pattern SUPERSCRIPT_PATTERN
Copyright © 2011-2016 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.