Using Machine Learning to Improve Rule based Arabic Named Entity Recognition

dc.Location2011 T 58.6 S56
dc.SupervisorDr Khaled Shaalan & Dr Sherief Abdallah
dc.contributor.authorShoaib, Muhammad
dc.date.accessioned2013-03-21T14:21:09Z
dc.date.available2013-03-21T14:21:09Z
dc.date.issued2011-01
dc.descriptionDISSERTATION WITH DISTINCTION
dc.description.abstractArabic Language is widely spoken and highly influential Language both politically and geographically. Thus it is crucial to perform Information Extraction on diverse Arabic texts. In past decade many researchers have targeted the Information Extraction in general and Named Entity Recognition in particular for Arabic language. Mostly researchers have applied Machine Learning for Arabic Named Entity Recognition while few researchers have used hand crafted rules for Named Entity Recognition task.The Machine Learning techniques and rule based techniques for named entity recognition are mostly viewed as rival approaches. The work presented in this thesis is an effort to combine rule based and Machine Learning approaches into a Hybrid System for Named Entity Recognition. The Person, Organization and Location entities identified by rule based system are used as features combined with several other features for Machine Learning system. The final outcome provides enhanced Named Entity annotations.The evaluation of the experiments conducted shows that the Hybrid approach stated in thesis significantly improves the quality of named entity recognition of independent rule based system and independent Machine Learning system. Moreover the statistical significance tests confirms that the results obtained are valid and not occurred by chance.en_US
dc.identifier.other90049
dc.identifier.urihttp://bspace.buid.ac.ae/handle/1234/58
dc.language.isoenen_US
dc.publisherThe British University in Dubai (BUiD)en_US
dc.subjectinformation extractionen_US
dc.subjectnamed entity recognitionen_US
dc.subjectmachine learningen_US
dc.subjectnamed entity annotationen_US
dc.titleUsing Machine Learning to Improve Rule based Arabic Named Entity Recognitionen_US
dc.typeDissertationen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis.pdf
Size:
1.34 MB
Format:
Adobe Portable Document Format
Description:
Full Text
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: