A Conceptual Dependency Graph Based Keyword Extraction Model for Source Code to API Documentation Mapping
Nakul Sharma1, Prasanth Yalla2
1Nakul Sharma, Research Scholar, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Guntur (Dist), India
2Prasanth Yalla, Professor, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Guntur (Dist), India.
Manuscript received on 10 March 2019 | Revised Manuscript received on 18 March 2019 | Manuscript published on 30 July 2019 | PP: 5888-5895 | Volume-8 Issue-2, July 2019 | Retrieval Number: B1092078219/2019©BEIESP | DOI: 10.35940/ijrte.B1092.078219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Natural language processing on software systems usually contain high dimensional noisy and irrelevant features which lead to inaccurate and poor contextual similarity between the project source code and its API documentation. Most of the traditional source code analysis models are independent of finding and extracting the relevant features for contextual similarity. As the size of the project source code and its related API documentation increases, these models incorporate the contextual similarity between the source code and API documentation for code analysis. One of the best solutions for this problem is finding the essential features using the source code dependency graph. In this paper, the dependency graph is used to compute the contextual similarity computation between the source code metrics and its API documents. A novel contextual similarity measure is used to find the relationship between the project source code metrics to the API documents. Proposed model is evaluated on different project source codes and API documents in terms of pre-processing, context similarity and runtime. Experimental results show that the proposed model has high computational efficiency compared to the existing models on the large size datasets.
Index Terms: Contextual Similarity, Natural Language Processing, Text Mining, Code analysis, Dependency Graph.
Scope of the Article: Text Mining