7013 Hadoop Foundation

Course: 7013

This hands-on Hadoop course teaches Data Analysts, BI Analysts, BI Developers, SAS Developers and other types of analysts who need to answer questions and analyze Big Data stored in a Hadoop cluster how to develop applications and analyze Big Data stored in Apache Hadoop using Hive.

Download PDF
  • Duration: 2 days
  • Price: $1,490.00
Get This Course $1,490.00
December 12 - 13

9:00 AM – 4:00 PM CST

January 16 - 17

9:00 AM – 4:00 PM CST

February 13 - 14

9:00 AM – 4:00 PM CST

March 13 - 14

9:00 AM – 4:00 PM CST

Scroll to view additional course dates

Reserve Your Seat

  • Virtual instructor Led Training
  • Complete Hands-on Labs
  • Softcopy of Courseware
  • Learning Labs
  • Virtual instructor Led Training
  • Complete Hands-on Labs
  • Softcopy of Courseware
  • Learning Labs
  • You can use your Purchase Card and checkout
  • The GSA Contract Number: 47QTCA20D000D
  • Call 800-453-5961 for details
  • Customize your class
  • Delivery Onsite or Online for your organization
  • Choice of Dates when and where you want
  • Guidance in choosing and customizing your class

Question About this Course?

Course Overview Hadoop Foundation

This hands-on Hadoop course teaches Data Analysts, BI Analysts, BI Developers, SAS Developers and other types of analysts and students how to develop applications and analyze Big Data stored in Apache Hadoop using Hive. These are the students that needs to answer questions and analyze Big Data stored in a Hadoop cluster.

They will also learn the details of Hadoop, YARN, the Hadoop Distributed File System (HDFS), an overview of MapReduce, and a deep dive into using Hive to perform data analytics on Big Data. They will also work through lab exercises using the Hortonworks Data Platform for Windows to issue HDFS commands to add/remove files and folders from HDFS, run and monitor MapReduce jobs, retrieve HCatalog schemas from within a Pig script, perform a join of datasets and use advanced Hive features like windowing, views and multi-file inserts.

Prerequisite

  • Students should be familiar with SQL and have a minimal understanding of programming principles. No prior Hadoop knowledge is required.

Agenda Topics

  • Understand the architecture of the Hadoop Distributed File System (HDFS) and how HDFS Federation works in Hadoop
  • Use the Hadoop client to input data into HDFS
  • Understand the various tools & frameworks in Hadoop 2.0 ecosystem
  • Use Sqoop to transfer data between Hadoop and a relational database
  • Understand the architecture of MapReduce and run a MapReduce job on Hadoop 2.0
  • Define and implement Hive tables
  • Write efficient Hive queries and use Hive to run SQL-like queries to perform data analysis
  • Perform data analytics on Big Data using Hive
  • Use HCatalog with Hive

Question About this Course?

Need help picking the right course?

Contact Us

Call Now

Call Now800-453-5961