Sqoop Introduction - Sqoop

What is Sqoop?

Sqoop is a traditional application management system, designed for the interaction of applications with relational database using RDBMS to generate Big Data. Big Data which is generated by RDBMS will be stored in Relational Database Servers in the relational database structure.
While,Big Data storages and analyzers like MapReduce, Hive, HBase, Cassandra, Pig, etc. of the Hadoop ecosystem exists there is a requirement of tool to transfer data in between the relational database servers. Here, Sqoop supports the Hadoop ecosystem to provide feasible interaction between relational database server and Hadoop’s HDFS.

Sqoop: “SQL to Hadoop and Hadoop to SQL”

Sqoop is a data transfer tool designed by Apache Software Foundation to transfer data between Hadoop and relational database servers. It enables data transfer relational databases like MySQL, Oracle to Hadoop HDFS, and export from Hadoop file system to relational databases.

How Sqoop Works?

Below mentioned image describes the workflow of Sqoop.
how sqoop works

How Sqoop Import tool works?

Sqoop import tool imports individual tables from RDBMS to HDFS. Each and every row in a table is considered as a record in HDFS. Stored records like text data in text files or as binary data in Avro and Sequence files.

How Sqoop Export tool works?

Sqoop export tool exports a set of files from HDFS back to an RDBMS. The files in Sqoop records are called as rows in table. Those are read and parsed into a set of records and delimited with user-specified delimiter.

All rights reserved © 2018 Wisdom IT Services India Pvt. Ltd DMCA.com Protection Status

Sqoop Topics