Call +60 3-2711 7241 Email: malaysiacourses@tertiaryinfotech.com

HRDF Approved Training Provider in Malaysia - Modular Fast Track Skill-Based Trainings

Apache Hadoop Big Data Training

Hadoop is indispensable when it comes to processing big data—as necessary to understanding your information as servers are to storing it. This 2 days crash course on Apache Hadoop Big Data training aims to give a good overview and familiarisation with Big Data tool sets such as Hadoop, MapReduce Pig, Hive,Impala, Sqoop, Oozie, Zookeeper Apache Sparks. It will explain Hadoop, its file system (HDFS), its processing engine (MapReduce) .

Topics include:

  • Understanding Hadoop core components: HDFS and MapReduce
  • Setting up your Hadoop development environment
  • Working with the Hadoop file system
  • Running and tracking Hadoop jobs
  • Tuning MapReduce
  • Understanding Hive and HBase
  • Exploring Pig tools
  • Building workflows
  • Using other libraries, such as Impala, Mahout, and Storm
  • Understanding Spark
  • Visualizing Hadoop output

HRDF SBL Claimable for Employers Registered with HRDF

HRDF claimable

Course Code: M448

Course Booking

MYR1,800.00 (GST-exclusive)

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.


Course Details

Day1

Module 1: Get Started on Apache Hadoop

  • Why Hadoop?
  • Differnece between HBase and Hadoop

Module 2: Hadoop Core Components

  • Java Virutal Machine (JVM)
  • HDFS
  • Hadoop Cluster Components
  • Exploring Hadoop Platforms

Module 3: Setup Hadoop Development Environment

  • Setup Cloudera Hadoop VM
  • Adding Hadoop LIbraries 
  • Programming Languages

Module 4: MapReduce  2.0/YARN

  • What is MapReduce?
  • MapReduce Components
  • MapReduce on HDFS

Module 5: Hive

  • What is Hive?
  • Hive Queries
  • Analyzing data with Hive

Day 2

Module 6: Pig

  • What is Pig
  • Pig Data types
  • Pig Commands

Module 7: Connectors and Workflows

  • Introducing Sqoop
  • Importing Data with Sqoop
  • Introuducing Flume
  • Importing Data with Sqoop
  • Introducing Zookeeper
  • Using Zookeeper to co-ordindate workflow
  • Introducing Oozie
  • Scheduling jobs using Oozie

Module 8: Exploring Other Hadoop Libraries

  • Introducing Impala
  • Introducing Mahout
  • Introduing Storm

Module 8: Apache Spark Basics

  • Why Apache Spark?
  • Apache Spark Components
  • Apache Spark Commmands

Who Should Attend

  • Data Scientists
  • Data Analyts
  • Hadoop Administrator
  • Big Data Analysts

Prerequisite

Nil

Trainers

Hadoop Big Data TrainerHassan Keshavarz is a Ph. D Candidate in MJIIT. He is Microsoft certification holder in IT sector and member of IEEE Association as well. Currently, he is working in Xchanging Malaysia, a DXC-YTL Joint Venture as Technical Lead in EDW Team as Big Data Administrator/Developer and Hadoop architecture. Hassan has 3+ years experience in installing, configuring and utilizing Hadoop eco-system in Enterprise Data Warehouse, Reporting and Analytic domains. He is an expert at: Apache Hadoop, Apache Hive, Sqoop, Flume, preparing study narratives documents and case reviews, Perform the tasks of reviewing studies performed by the staff, Strong written and verbal communication skill, Invited reviewers of journals and conferences.

Hadoop Big Data TrainerSwamy Gurram is a Cloudera Certified Developer for Apache Hadoop (CCDH). He has a lot hands on experience with Hadoop technologies (HDFS, MapReduce, Hive, Pig, HBase, Sqoop, flume, Impala, Knox etc) and Hadoop cluster installations, administration along with virtual server installations . He has 10+ years of experience in Technical Architecture, Onsite Relationship Management and Automotive Business Analyst. He also has experience in Hadoop, Mainframe Technologies, Visual Basics, Mainframe Migration Tools. He is expertise in system life cycle skills comprising of analysis, technical design, testing, implementation and documentation

Hadoop Big Data trainerSyed Muhammad Farrukh Akhtar has more than 15 years of experience analysis, designing, developing, integrating and managing large applications for diverse industries. He has experience working in Dubai, Pakistan, Germany and Malaysia, a strong hands-on experience of software design, development and integration on different platform like IBM J2EE, Oracle and Microsoft .Net, Big data, Hadoop, Spark, HBase, Hive, Sqoop, Flume and NoSQL. He also has expertise in Machine Learning/ Deep Learning with Tensor Flow, Keras and Python, excellent skills in React, Ionic 2, Angular 2, Mobile Apps with React Native and Node.js. He is highly knowledgeable in object oriented software development, requirements analysis, and database design. Possess deep understanding of Open Source technologies’ applicability in emerging business areas. He possesses excellent knowledge in Rational Unified Process (RUP); Rational Software Architect; data modeling and mapping; and extensible system design using the UML and Visio. Professional experience on J2EE, JMS, Web Sphere, Oracle, Spring, Hibernate, Struts and 3-Tier Web-based Applications Development.

Customer Reviews (5)

Will RecommendReview by Kartini Mohamed
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
To include practical training

The use of Bash Script, Talent Studio for Big Data, and Tableau software could be included in the practical training. (Posted on 10/16/2018)
Will Recommend Review by Shamiel Hashim Ibrahim
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
extend training days - 3 days minimum (Posted on 8/5/2018)
Will RecommendReview by Tamir
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
Niil (Posted on 6/25/2018)
Might RecomemndReview by Wong Joon Keet
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
Include more hands-on practice (Posted on 6/19/2018)
Might RecomemndReview by Ho Kin Yee
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
More in-depth on Hadoop and it's ecosystem instead of other industry use-cases.

Hassan was nice and he's really knowledgeable with Hadoop, just with he'd show more technical demo and teach us more in-depth about Hadoop's architecture itself. (Posted on 6/19/2018)

Write Your Own Review

You're reviewing: Apache Hadoop Big Data Training

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses

Hadoop Data Analysis Training

Hadoop Data Analysis Training

1 Review(s)
MYR880.00 (GST-exclusive)
Big Data Analysis with Apache Hive

Big Data Analysis with Apache Hive

MYR880.00 (GST-exclusive)
Apache Solr Search Platform Training

Apache Solr Search Platform Training

MYR880.00 (GST-exclusive)
Apache Hbase Training

Apache Hbase Training

MYR880.00 (GST-exclusive)
Apache Spark Essential Training

Apache Spark Essential Training

MYR880.00 (GST-exclusive)
Machine Learning with Apache Spark

Machine Learning with Apache Spark

MYR880.00 (GST-exclusive)
Create a Cross Platform Mobile Apps with Apache Cordova

Create a Cross Platform Mobile Apps with Apache Cordova

1 Review(s)
MYR880.00 (GST-exclusive)
Basic Scala Training for Apache Spark

Basic Scala Training for Apache Spark

MYR880.00 (GST-exclusive)