Olex-Genetic Algorithm Based Information Retrieval Model from Historical Document Images
N. Vanjulavalli
Dr. N. Vanjulavalli, *Asst. Professor, Dept of Computer Science Annai College of Arts and Science affiliated to Bharathidasan University, India.
Manuscript received on November 20, 2019. | Revised Manuscript received on November 26, 2019. | Manuscript published on 30 November, 2019. | PP: 3350-3356 | Volume-8 Issue-4, November 2019. | Retrieval Number: C6283098319/2019©BEIESP | DOI: 10.35940/ijrte.C6283.118419
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Presently, the process of retrieving the historical documents is treated as an important challenge due to the fact that the document possessesindividual structure and level of deprivation. The textual characters present in the printouts come together with the typographical objects. The retrieval and perusal of the visual typographical objects indicates that the content of heritage documents helps to effectively interpret the documents. The extraction of the visual objects finds useful in the interpretation and conveyance of more details regarding diverse practices of demonstration in past documents and the impact in the present status of publication. A pair of essential typographical objects linked to the history of knowledge and information is footnotes and tables. In this paper, the main intention is the detection of the existence of the visual elements from historical printable documents. A new Olex-GA based footnote recognition model (OFR) is developed. The footnote detection model make use of a collection of layout-based models for the extraction of few features concerning to the font and appearance. In addition, Olex-GA algorithm is for the classification purposes. This model is validated using a massive set of 18thcentury printed documents with higher than 32 million images, and the outcome showed their effective outcome of the presented model.
Keywords: Information Retrieval; Historical Documents; Olexga; Classification.
Scope of the Article: Classification.