public class WPOSEn extends Object
1) ==English== ===Etymology=== ===Noun=== ===Verb=== ==Finnish== ===Etymology=== ===Noun=== (level 3 in English Wiktionary: ===Noun===) 2) In the case of multiple etymologies, all subordinate headers need to have their levels increased by 1: ===Etymology 1=== ====Pronunciation==== ====Noun==== POS=noun ===Etymology 2=== ====Pronunciation==== ====Noun==== POS=noun ====Verb==== POS=verb (level 4 in English Wiktionary: ===Verb===)see http://en.wiktionary.org/wiki/Wiktionary:Entry_layout_explained see http://en.wiktionary.org/wiki/Wiktionary:Entry_layout_explained/POS_headers
Constructor and Description |
---|
WPOSEn() |
Modifier and Type | Method and Description |
---|---|
static boolean |
isSecondLevelHeaderWordNotPOS(String str)
Gets true, if str is known header, e.g.
|
static POSText[] |
splitToPOSSections(String page_title,
LangText[] etymology_sections)
Splits each etymology section into POS sections.
|
public static boolean isSecondLevelHeaderWordNotPOS(String str)
public static POSText[] splitToPOSSections(String page_title, LangText[] etymology_sections)
page_title
- - word which are described in this article 'text'
1) Splits the following text to "Noun" and "Verb"
2) Extracts part of speech "noun" and "verb"
===Noun=== {{en-noun}} ===Verb===Todo: save info about the link Etymology <-> POS.
Copyright © 2011-2016 Ubiquitous Knowledge Processing (UKP) Lab. All Rights Reserved.