Hadoop Tutorial


What is Hadoop tutorial?

Hadoop is an open-source framework which provides storage and big data processing in a distributed environment in various clusters of computers with simple programming models. It offers local computation and storage from single servers. This tutorial provides basic understanding about Big Data, MapReduce algorithm, and Hadoop Distributed File System.

Audience

This tutorial is prepared for the professionals who wish to learn the basics of Big Data Analytics using Hadoop Framework and become a Hadoop Developer. It is useful for Software Professionals, Analytics Professionals, and ETL developers.

Prerequisites

To learn this tutorial one must have prior knowledge of Core Java, database concepts, and any of the Linux operating system flavors.

Next Topics »
Data Sources
Data Storage And Analysis
Comparison With Other Systems
Meet Hadoop Data Sources Data Storage And Analysis Comparison With Other Systems A Brief History Of Hadoop Mapreduce A Weather Dataset Analyzing The Data With Hadoop Scaling Out Hadoop Streaming Hadoop Pipes The Hadoop Distributed Filesystem The Design Of Hdfs Hdfs Concepts The Command-line Interface Hadoop Filesystems The Java Interface Anatomy Of A File Read Parallel Copying With Distcp Hadoop Archives Hadoop I/o Data Integrity Compression Hadoop Serialization File-based Data Structures Developing A Mapreduce Application The Configuration Api Configuring The Development Environment Writing A Unit Test Running Locally On Test Data Running On A Cluster Tuning A Job Mapreduce Workflows How Mapreduce Works Anatomy Of A Mapreduce Job Run Failures Job Scheduling Shuffle And Sort Task Execution Mapreduce Types And Formats Mapreduce Types Input Formats Output Formats Mapreduce Features Counters Different Ways Of Sorting Datasets How Does Joins Performs Side Data Distribution Mapreduce Library Classes Setting Up A Hadoop Cluster Cluster Specification Cluster Setup And Installation Ssh Configuration Hadoop Configuration Hadoop Security Benchmarking A Hadoop Cluster Hadoop In The Cloud Administering Hadoop Hdfs Monitoring Maintenance Pig Installing And Running Pig An Example To Write Hadoop Comparison With Databases Pig Latin User-defined Functions- Filter Udf Data Processing Operators Pig In Practice Hive Installing Hive How To Use Hive To Run A Query Running Hive Comparison With Traditional Databases Hiveql Tables Querying Data User-defined Functions Have Query Hbase Hbasics Concepts How To Install Hadoop? Clients Example Of Hadoop Tools Hbase Versus Rdbms Praxis Zookeeper Installing And Running Zookeeper Servers In Hadoop The Zookeeper Service Building Applications With Zookeeper Zookeeper In Production Sqoop Getting Sqoop A Sample Import Generated Code Database Imports: A Deeper Look Working With Imported Data Importing Large Objects Performing An Export Exports: A Deeper Look Hadoop Interview Questions Hadoop Practice Tests