Call +60 3-7490 2093 Email:

HRDF Approved Training Provider in Malaysia - Modular Fast Track Skill-Based Trainings

Hadoop Data Analysis Training

Hadoop is the cloud computing platform data scientists use to perform highly parallelized operations on big data.  In this course, you will learn how to analyze data using Pig, Hive and YARN. You will also learn how to configure the Hadoop distributed file system (HDFS), perform processing and ingestion using MapReduce, copy data from cluster to cluster, create data summarizations, and compose queries.

Course Highlights

  • Setting up and administrating clusters
  • Ingesting data
  • Working with MapReduce, YARN, Pig, and Hive
  • Selecting and aggregating large datasets
  • Defining limits, unions, filters, and joins
  • Writing custom user-defined functions (UDFs)
  • Creating queries and lookups


All participants will receive a Certificate of Completion from Tertiary Courses after achieved at least 75% attendance.

HRDF SBL Claimable for Employers Registered with HRDF

HRDF claimable

Course Code: M409

Course Booking


Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.

Course Details

Topic 1 Hadoop Introduction

  • HDFS
  • MapReduce
  • YARN
  • Data Ingestion

Topic 2 : Hive

  • Select and Aggregate
  • Sort and Limit
  • Filters
  • Joins

Topic 3: Pig

  • Select and Aggregate
  • Sort and Limit
  • Filters
  • Joins
  • UDF
  • Streaming UDF

Topic 4 : HBase

  • What is NoSQL
  • CAP
  • Introduction to Hbase
  • Data Analysis with HBase

Course Admin


This is beginner course. No Hadoop or SQL knowledge is required.:

Software Requirement

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Big Data Analysts


Python TrainerSaeid Alizadeh is a technopreneur specialized in field of IOT (Internet of things), Building Management System (BMS), Building Automation System (BAS), Automotive Hydroponics Systems, and generally sense, monitor and control mechanical and electrical equipment such as ventilation, lighting, power systems, fire systems, and security systems.

Saeid’s past experience on IoT application include:

  1. Smart Building System
  2. Energy Monitoring, Controlling and Saving
  3. Environmental Monitoring 
  4. Flood detection and prediction system based on IoT and big data 
  5. Online Weather Station Based on IoT
  6. Smart Farming System (Long Range Wireless Sensor Networks)
  7. Smart Hydroponic System Based on IoT
  8. IP TV and Digital Signage System
  9. RFID Solution 
  10. GPS Tracking System
  11. Remote Sensing

Tableau TrainerLee Cheong Loong is a manager with an EMBA – 21 years working experience indifference role/ Dept. (Sales, Logistic, IT, Online Store, Internal Trainer, and Customer services), involved in multiple IT project (Server P2V upgrade, Network infra upgrade, and ERP system Deployment). He has switched career to Project management role, Big - data / Data scientist related project, with Professional Certification Big Data & Analytics, Professional Certificate in Tableau, & Pass the HRDF Certified Trainer Programme. He also involves in AI chatbot and application development to help small-medium practitioner (accounting industry).

Big data trainerRupesh Nanglia(RUPS) has more than 13 years' experience in software development/architecture and system integration. Has architected solutions in agile and scrum development environments. Is adept in Big Data Analytics, Internet of Things (IoT), Project Management, as well as Cloud Computing and Network Analysis. Moreover, while his on-the-job experience has afforded him a well-rounded skill set. He is an expert at Apache Hadoop, No SQL DB(Cassandra, MongoDB, Hbase, Neo4j...), Spark, Kafka, Storm, R, Data Analytics, Social Media Analytics, OpenStack Cloud Computing Platform, preparing study narratives documents and case reviews, Perform the tasks of reviewing studies performed by the staff. It is also working ‎ on the Data Science field as a trainer and Data Scientist. Has worked on Machine Learning and Process ‎ Mining projects. Strong written and verbal communication skills.Successful background working with stakeholders to develop and implement an architecture framework that aligns strategy, processes, and IT assets with a business goal. Has been associated with the training industry for more than half a decade.

Customer Reviews (1)

Will Recommend Review by Course Participant/Trainee
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
More exercises...hands on (Posted on 22/10/2016)

Write Your Own Review

You're reviewing: Hadoop Data Analysis Training

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha
    Attention: Captcha is case sensitive.

Product Subjects

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses