A Novel Hybrid Genetic-Whale Optimization Model for Ontology Learning from Arabic Text
Date
2019
Journal Title
Journal ISSN
Volume Title
Publisher
MDPI
Abstract
Ontologies are used to model knowledge in several domains of interest, such as the
biomedical domain. Conceptualization is the basic task for ontology building. Concepts are identified,
and then they are linked through their semantic relationships. Recently, ontologies have constituted
a crucial part of modern semantic webs because they can convert a web of documents into a
web of things. Although ontology learning generally occupies a large space in computer science,
Arabic ontology learning, in particular, is underdeveloped due to the Arabic language’s nature
as well as the profundity required in this domain. The previously published research on Arabic
ontology learning from text falls into three categories: developing manually hand-crafted rules,
using ordinary supervised/unsupervised machine learning algorithms, or a hybrid of these two
approaches. The model proposed in this work contributes to Arabic ontology learning in two ways.
First, a text mining algorithm is proposed for extracting concepts and their semantic relations from
text documents. The algorithm calculates the concept frequency weights using the term frequency
weights. Then, it calculates the weights of concept similarity using the information of the ontology
structure, involving (1) the concept’s path distance, (2) the concept’s distribution layer, and (3) the
mutual parent concept’s distribution layer. Then, feature mapping is performed by assigning the
concepts’ similarities to the concept features. Second, a hybrid genetic-whale optimization algorithm
was proposed to optimize ontology learning from Arabic text. The operator of the G-WOA is a hybrid
operator integrating GA’s mutation, crossover, and selection processes with the WOA’s processes
(encircling prey, attacking of bubble-net, and searching for prey) to fulfill the balance between both
exploitation and exploration, and to find the solutions that exhibit the highest fitness. For evaluating
the performance of the ontology learning approach, extensive comparisons are conducted using
different Arabic corpora and bio-inspired optimization algorithms. Furthermore, two publicly
available non-Arabic corpora are used to compare the efficiency of the proposed approach with
those of other languages. The results reveal that the proposed genetic-whale optimization algorithm
outperforms the other compared algorithms across all the Arabic corpora in terms of precision, recall,
and F-score measures. Moreover, the proposed approach outperforms the state-of-the-art methods of
ontology learning from Arabic and non-Arabic texts in terms of these three measures.
Description
Keywords
text mining; ontology learning; hybrid models; genetic algorithms; whale optimization
algorithm
Citation
Rania M. Ghoniem, Nawal Alhelwa and Khaled Shaalan (2019) “A Novel Hybrid Genetic-Whale Optimization Model for Ontology Learning from Arabic Text,” Algorithms, 12(9), p. 182.