Sqoop is available in a few places. The primary home of the project is http://github.com/ cloudera/sqoop. This repository contains all the Sqoop source code and documentation.

Official releases are available at this site, as well as the source code for the version currently under development. The repository itself contains instructions for compiling the project. Alternatively, Cloudera’s Distribution for Hadoop contains an installation package for Sqoop alongside compatible editions of Hadoop and other tools like Hive.If you download a release from github, it will be placed in a directory such as /home/ yourname/sqoop-x.y.z/. We’ll call this directory $SQOOP_HOME. You can run Sqoop by running the executable script$SQOOP_HOME/bin/sqoop.

If you’ve installed a release from Cloudera, the package will have placed Sqoop’s scripts in standard locations like /usr/bin/sqoop. You can run Sqoop by simply typing sqoop at the command line. (Regardless of how you install Sqoop, we’ll refer to this script as just sqoop from here on.)

Running Sqoop with no arguments does not do much of interest:

Sqoop is organized as a set of tools or commands. Without selecting a tool, Sqoop does not know what to do. help is the name of one such tool; it can print out the list of available tools, like this:

Available commands:

See 'sqoop help COMMAND' for information on a specific command.

As it explains, the help tool can also provide specific usage instructions on a particular tool, by providing that tool’s name as an argument:

...

An alternate way of running a Sqoop tool is to use a tool-specific script. This script will be named sqoop-toolname. For example, sqoop-help, sqoop-import, etc. These commands are identical to running sqoop help or sqoop import.