Construction of Durian Dataset from Web Collection for Query Reformulation Research
Azilawati Azizan1, Zainab Abu Bakar2, Nurazzah Abdul Rahman3
1Azilawati Azizan, Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Perak Branch, Tapah Campus, Tapah Road, Perak, Malaysia.
2Zainab Abu Bakar, Al-Madinah International University, Shah Alam, Selangor, Malaysia.
3Nurazzah Abd Rahman, Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Shah Alam, Selangor Malaysia.
Manuscript received on 11 October 2019 | Revised Manuscript received on 20 October 2019 | Manuscript Published on 02 November 2019 | PP: 630-634 | Volume-8 Issue-2S11 September 2019 | Retrieval Number: B10980982S1119/2019©BEIESP | DOI: 10.35940/ijrte.B1098.0982S1119
Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Study in the field of Information Retrieval (IR) has long been developed and thrived over time. And most of them use the available standard dataset for testing and evaluation. In line with that, the existence of new dataset has also increased to meet the needs of their respective studies. However, to the best of our knowledge, there is no dataset collected from web document that focuses on fruit domain. Therefore, in this paper we contribute to this field by publishing a dataset of web document for fruit focusing on durian fruit. This durian fruit dataset is suitable for query reformulation experiment, searching system, web information retrieval and any search engine experiment. This dataset contains a collection of web document for fruit and durian fruit, a collection of queries and a set of relevant judgement. In addition, in this paper we also publish a list of frequently asked query regarding durian, and an extended list of query characteristic categories.
Keywords: Dataset Construction, Durian, Relevant Judgement, Test Query, Web Collection.
Scope of the Article: Web Technologies