Sqoop usage in Hadoop Distributed File System and Observations to Handle Common Errors
K. UmaPavan Kumar1, S V N Srinivasu2, M N. Nachappa3
1Dr K Uma Pavan, Assoicate professor CS IT, Jain Univrsity, Bangalore,
2Dr. M N. Nachappa, Professor and Academic Head, School of CS and IT, Jain University,
3Dr S.V.N. Srinivasu, Professor CSE, Narasaraopeta Engineering College (Autonomous), Narasaraopet, A.P.
Manuscript received on October 06, 2020. | Revised Manuscript received on October 25, 2020. | Manuscript published on November 30, 2020. | PP: 452-454 | Volume-9 Issue-4, November 2020. | Retrieval Number: 100.1/ijrte.D4980119420 | DOI: 10.35940/ijrte.D4980.119420
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: The Hadoop framework provides a way of storing and processing the huge amounts of the data. The social media like Facebook, twitter and amazon uses Hadoop eco system tools so as to store the data in Hadoop distributed file system and to process the data Map Reduce (MR). The current work describes the usage of Sqoop in the process of import and export with HDFS. The work involves various possible import/export commands supported by the tool Sqoop in the eco system of Hadoop. The importance of the work is to highlight the common errors while installing Sqoop and working with Sqoop. Many developers and researchers were using Sqoop so as to perform the import/export process and to handle the source data in the relational format. In the current work the connectivity between mysql and sqoop were presented and various commands usage along with the results were presented. The outcome of the work is for each command the possible errors encountered and the corresponding solution is mentioned. The common configuration settings we have to follow so as to handle the Sqoop without any errors is also mentioned.
Keywords: Import, Export, Hadoop, Sqoop, Configuration, Relational Source.