Positive Unlabelled Learning to Recognize Dishes as Named Entity

dc.Location2019 T 58.5 T37
dc.SupervisorProfessor Sherief Abdallah
dc.contributor.authorTAREK, AIMAN
dc.date.accessioned2019-08-27T08:14:33Z
dc.date.available2019-08-27T08:14:33Z
dc.date.issued2019-04
dc.description.abstractWith the growth of social media, there is a need to analyse the user-generated content; especially the text reviews. Online text reviews have a lot of potential and opportunities for both users and business owners. Many researches target analysing text reviews extracting useful info especially Named Entity Recognition. In this research, I focus on extracting food and dish names as a named entity. With the lack of labelled data, I try to overcome the cold start and avoid manual labelling by building a lookup table from a dictionary. I work with Yelp dataset, going through each text review, using each noun as a candidate, label the positive samples using the aforementioned lookup table, then using Positive Unlabelled learning techniques to recognise more entities within the unlabelled data, by predicting the probability for each candidate. I considered the surrounding words; preceding and following in building the model, as well as Part of Speech tag for each. To eliminate duplicates due to repeated candidates from different reviews or sentences, I calculate the average and represent each candidate entity only once. The results show how we can automate entity recognition process, using dictionaries and machine learning techniques and achieve an acceptable accuracy of 67% and boost the newly discovered entities by around 15% using Positive Unlabelled learning over automatically build lookup table. This research has the potential to be extended to other topics other than food and dish names, also it acts as a framework and algorithm independent.en_US
dc.identifier.other2016210186
dc.identifier.urihttps://bspace.buid.ac.ae/handle/1234/1459
dc.language.isoenen_US
dc.publisherThe British University in Dubai (BUiD)en_US
dc.subjectsocial mediaen_US
dc.subjectuser-generated contenten_US
dc.subjectnamed entity recognitionen_US
dc.titlePositive Unlabelled Learning to Recognize Dishes as Named Entityen_US
dc.typeDissertationen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2016210186.pdf
Size:
1.02 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: