Apache Tajo Tutorial

Apache Tajo Tutorial

What do you understand by the term Apache Tajo?

Apache Tajo is a robust big data relational and distributed data warehouse system for Apache Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation, and ETL (extract-transform-load process) on large-data sets stored on HDFS (Hadoop Distributed File System) and other data sources.

Initially, a South Korean infrastructure company, Gruter has started Tajo, later on the contributions to the Tajo project came from experts from Intel, Etsy, NASA, Cloudera, Hortonworks. In Korean language, an Ostrich is referred as Tajo. A top level open source Apache project was granted to Tajo in the year 2014.

The tutorial covers the basics of the concept of Tajo, the cluster setup, Tajo shell, SQL queries. It also covers the integration of Tajo with big data technologies and some illustrations.

What are the prerequisites required and who are the audience for learning the concept of Apache Tajo?

Good knowledge and understanding of Core Java, any of the Linux OS and DBMS are required to understand the concept of Apache Tajo.

This tutorial is mainly targeted for the professionals aspiring to make a career in Big Data Analytics using Apache Tajo. Anyone on completion of this tutorial gets complete knowledge and understanding about the concept of Apache Tajo

Apache Tajo Tutorial: List of Topics

All rights reserved © 2020 Wisdom IT Services India Pvt. Ltd DMCA.com Protection Status