DesignandDevelopmentof AutomaticUnlexicalizedStatisticalConstituencyParserforTigrigna

Gereziher, Hailesilassie

DesignandDevelopmentof AutomaticUnlexicalizedStatisticalConstituencyParserforTigrigna

Files

Gereziher Hailesilassie Tafere.pdf (2.56 MB)

Date

2017-06

Authors

Gereziher, Hailesilassie

Publisher

ASTU

Abstract

Syntax Parsing is known to be an intermediate step for higher components of Natural Language Processing like Machine Translation. The research on the design and developmentofautomaticunlexicalizedstatisticalconstituencyparserforTigrignalanguage is never attempted. As a result, in this research, a tree bank containing a total of 250 syntactically parsed corpus is made with the help of linguistic professional. Viterbi based bottom up probabilistic context free parsing tool is used for parsing using automatic probabilistic context free grammar induction model. Maximum likelihood estimation technique is used to extract and learn probabilities automatically from context free grammar repositories. Tigrigna word and Afﬁx Segementer, Transliteration system and the Trigram ’n’ tags which is efﬁcient language independent tagger, are integrated as inputs in to the parser. The segementer splits morphological afﬁxes as well as complex words into their representative base forms. While the primary role of the parser becomes structural disambiguation, where as the role of Trigram ’n’ Tags tagger is to handle lexical or word category disambiguation together with the word and Morphological segementer. After all, the parser is evaluated with standard parser evaluation scoring tool called Evaluate Bracketing. Accordingly, the parser has achieved state of the art accuracy with F-Score 95.12% on on 75-25 percentage split.

URI

http://10.240.1.28:4000/handle/123456789/1741

Collections

Thesis

Full item page

DesignandDevelopmentof AutomaticUnlexicalizedStatisticalConstituencyParserforTigrigna

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By