Hadoop Foundation

Course: 7013

Note: For additional questions and clarification, you may reach Bill Ramirez at bill@dynamicsedge.com or on his following contact details: Office- (510) 804-3600 & Cell- (415) 200-6969 COURSE OVERVIEW: This hands-on Hadoop course teaches Data Analysts, BI Analysts, BI Developers, SAS Developers and other types of analysts who need to answer questions and analyze Big Data […]

Download PDF
  • Duration: 2 days
  • Price:
Get This Course

Reserve Your Seat

  • Virtual instructor Led Training
  • Complete Hands-on Labs
  • Softcopy of Courseware
  • Learning Labs
  • Virtual instructor Led Training
  • Complete Hands-on Labs
  • Softcopy of Courseware
  • Learning Labs
  • You can use your Purchase Card and checkout
  • The GSA Contract Number: 47QTCA20D000D
  • Call 800-453-5961 for details
  • Customize your class
  • Delivery Onsite or Online for your organization
  • Choice of Dates when and where you want
  • Guidance in choosing and customizing your class

Question About this Course?

Note: For additional questions and clarification, you may reach Bill Ramirez at bill@dynamicsedge.com or on his following contact details: Office- (510) 804-3600 & Cell- (415) 200-6969

COURSE OVERVIEW:

This hands-on Hadoop course teaches Data Analysts, BI Analysts, BI Developers, SAS Developers and other types of analysts who need to answer questions and analyze Big Data stored in a Hadoop cluster how to develop applications and analyze Big Data stored in Apache Hadoop using Hive. Students will learn the details of Hadoop, YARN, the Hadoop Distributed File System (HDFS), an overview of MapReduce, and a deep dive into using Hive to perform data analytics on Big Data.

Students will work through lab exercises using the Hortonworks Data Platform for Windows to issue HDFS commands to add/remove files and folders from HDFS, run and monitor MapReduce jobs, retrieve HCatalog schemas from within a Pig script, perform a join of datasets and use advanced Hive features like windowing, views and multi-file inserts.

PREREQUISITES:

  • Students should be familiar with SQL and have a minimal understanding of programming principles. No prior Hadoop knowledge is required.

AGENDA TOPICS:

  • Understand the architecture of the Hadoop Distributed File System (HDFS) and how HDFS Federation works in Hadoop
  • Use the Hadoop client to input data into HDFS
  • Understand the various tools & frameworks in Hadoop 2.0 ecosystem
  • Use Sqoop to transfer data between Hadoop and a relational database
  • Understand the architecture of MapReduce and run a MapReduce job on Hadoop 2.0
  • Understand how Hive tables are defined and implemented
  • Write efficient Hive queries and use Hive to run SQL-like queries to perform data analysis
  • Perform data analytics on Big Data using Hive
  • Use HCatalog with Hive

Question About this Course?

Need help picking the right course?

Contact Us

Call Now

Call Now800-453-5961