Escolar Documentos
Profissional Documentos
Cultura Documentos
Int(Lahore),26(1),181-184,2014
181
ABSTRACT: The Holy Quran has affected the lives of Muslim nation and it is among one of the most
reading books. Despite the fact that the Quran is recited heavily overall on the globe yet there is less
concentration on Quranic search. Currently available models exploit keyword based searching. Currently
available models exploit keyword based searching which are not only less efficient as well as keyword
based searching techniques does not search Quranic concept accurately. This research paper, addresses
the deficiencies of keyword based searching and the issues related to semantic search in the Holy Quran,
and propose a model that is capable of performing semantic search.
INTRODUCTION:
The Holy Quran is most sacred scripture among Muslim
nation and is an ultimate source of information and
assortment of diverse knowledge and dissimilar subjects. It
discusses almost all fields of life and provide basics for all
areas of knowledge. Neutrally, today on earth one of every
five people is Muslim [1]. Therefore, significance of
understanding the Holy Quran for every Muslim as well as
for those scholars who are interested in the study of man and
society is very high. In view of the fact that Holy Quran has
been effectively influential not only in molding the destinies
of Islamic societies, but also in changing the destiny of
mankind as a whole [2]. Therefore, understanding the
concepts of the Quran is of paramount significance if one
wishes to study this book comprehensively.
The Holy Quran has its own style of describing different
concepts which is unique in many ways. Generally a concept
has been discussed in different chapters. For example, the
concept of Hell is discussed in various chapters and similarly
the oneness of the Almighty has been discussed throughout
the Holy Quran. It is also possible that one verse may
contain more than one theme. For example, Verse 40 of the
chapter 76 contains only seven words having 5 different
concepts in it. Such that the first concept is we (Allah) have
warned you (Human); second one is we (Allah) have warned
of chastisement; third one is chastisement Is near at hand;
forth is Man shall see (in Qayamat) what his(human) two
hands have sent before and the last one is Unbeliever shall
say (in Qayamat) I were dust. One underlying point in these
verses is that the word Allah and Qayamat have never been
used in this verse but the context reveals what is being said.
One other unique style of Quran is that one term has been
used in many different styles depending on the context. For
example, Muhammad is used as Ahmad, Mudhathir,
Muzammil, Mubashir, Nazeer and Heaven is used as The
Garden & Paradise etc. A term may also be used in different
meanings. The disambiguation between meanings depends
on the context in which term is being used.
Even though The Quran is recited heavily overall on the
globe yet there is less concentration on searching the
Quranic concepts digitally. Currently available models
exploit keyword based searching, where statistical and
keyword-based techniques have achieved some success in
data mining and information retrieval systems [3]. Despite
this fact such systems are not only less efficient as well as
keyword based searching techniques have several limitations
182
Sci.Int(Lahore),26(1),181-184,2014
METHODOLOGY
The current research is proposing theoretical framework
architecture for Concept based Evolutionary Quranic Search
Engine (CEQSE) that will take user queries as input and will
search concepts in Quran accordingly. The benefit of
implementing this framework is that the timing and accuracy
for searching is not same all the time. Initially this search
engine may take a longer time and may search some
irrelevant verses in comparison with its search after
experience. This framework consists of eight modules as
shown in figure 1.
Quran Document:
This module behaves as an input interface and is use to take
a Quran Document as input from user. This module holds all
verses of Quran in it. Although this is one time task but it
give opportunity to add as many books to search text as user
wants. This document passes the text to next module for
further processing.
Ontology Extractor:
The purpose of this module is to extract ontological
knowledge from the factual knowledge. This module takes
XML file and considering the concept that how a human
brain actually store semantic information perform tagging
operation on sentences level. This module does tagging
through dividing each sentence into three tags; Subject,
Object and Predicate. This ontological knowledge is store
into Ontological Knowledgebase repository to use this
knowledge as a conceptual knowledge for further modules.
This module is also responsible to provide ontology to
Query Engine according to user query.
Query Engine:
Query Engine work as a controller, it gets queries of user
and passes it to subsystems for processing. It gets the query
from user application and passes it to POS Tagger. It is also
responsible to retrieve the ontological knowledge from
ontology extractor to entertain the query of user. Then after
validation of concepts from concept validator and sends it to
Ontology Extractor for refinement of Ontological
knowledge.
POS Tagger:
POS Tagger is used for part of speech tagging /tokenization
of words. This is used to label each word of a sentence into
its suitable token like verb, adverb, noun etc.
XML Generator:
This module takes Quran document and converts it into
XML file format for CEQSE framework. XML is a most
useful language applies for the transmission of data in all
type of applications due to its popularity in storing and
describing information. This XML file then further transfers
to Ontology Extractor Module.
Morphological Analysis:
Initial task of this module is to filter out the verse from those
words, which are more frequent in the query as they contain
very low inequity for retrieval of relevant concept from
ontological Knowledgebase. As a document or query have
many morphological deviations so this module then is use to
extract the comprise morphemes in a word. In the result this
module brings the words to their stems or root form.
Sci.Int(Lahore),26(1),181-184,2014
183
Allah
Paradise
Garden
Hell
Sea
Punishment
Jinn
Man
Water
Earth
Believer
Sinner/Criminal
Search
Truth
0
0
3
1
2
0
6
6
1
4
0
0
Guided
Ways
1
0
0
1
1
0
3
6
1
4
0
0
Islamicity
11
15
5
2
2
0
5
4
3
1
2
0
Corpus
Quran
0
0
0
1
1
0
5
5
1
3
0
0
Al-Islam
1
0
0
1
0
0
6
0
3
4
0
0
Actual
Result
42
14
14
2
4
0
6
36
0
3
0
3
1.
2.
3.
4.
i.
5.
184
ii.
6.
DISCUSSION
The concept formulated in the proposed model and
algorithm is to provide concept based Quranic searching. To
justify proposed model and algorithm three parameters have
been selected i.e. efficiency, accuracy and unbiased
searching of the Quranic text. Although there are currently
many Quranic softwares and databases are available which
are performing good searching. Yet these softwares are
capable to find most significant words in query and then
retrieve those verses in Quran from database that contains
such kind of keywords in them [4-8, 11-12] whether or not
they are required. Critical analysis of these tools implies that
keyword based searching does not have any ability to
entertain the query of user properly as there are many verses
in Quran which actually doesnt contain any explicit word
yet they possess many hidden concepts in them. For instance
Sura e Rehman contains many different concepts and it is
clear from table 1 that different searching tools have
different results for any particular word.
Other then the accuracy another critical issue is that this
cycle will be repeated every time any of the concept is
searched which results in slow and seemingly wrong results.
On the contrary proposed model extracts concepts from this
verse and tag each pronoun to a particular noun with which
it represent.
There exist software [16] which perform ontology based
Quranic search using NLP for knowledge acquisition but the
limitation of this software is, it is consulting Hadith for
getting the answers of many concepts that are not clearly
reveal in Quran. This phenomena is pointing towards an
illusion that there exist many Hadith books and every writer
is providing its own interpretation about different Hadith so,
there is a possibility exist extracted concepts about user
query actually confusing the user through providing different
interpretation on a single search. Such software also subsist
which have predefined concepts [15,14] and give the
answer of every search of user in fixed means, like if they
have concept ALLAH is one, then if user search ALLAH
they every time give same verse assuming that all concepts
of Quran have been listed. While Quran is termed as living
book among Muslims and still a lot of research is being
made by scholars to understand Quranic concepts so there is
a sheer need to update the concepts and to bind new
concepts with relevant verses. CEQSE provides a theoretical
framework having the ability to refine its search through
evolving and improving its concepts from user validation.
Therefore, it is able to provide efficient searching with better
understating of Quranic Verses, delivering the deep
understanding of meanings.
CONCLUSION
This paper propose a theoretical framework providing a
comprehensive basis for implementing semantic based
concept extraction engine for Quranic search. The effective
results of search provide better understanding of underlying
concepts of Quranic Verses to user. In future an application
will be developed for Quranic search using this framework,
Sci.Int(Lahore),26(1),181-184,2014