Call +60 3-2711 7241 Email: sales@tertiarycourses.com.my

HRDF Approved Training Provider in Malaysia - Modular Fast Track Skill-Based Trainings

Hadoop Data Analysis Training

Hadoop is the cloud computing platform data scientists use to perform highly parallelized operations on big data.  In this course, you will learn how to analyze data using Pig, Hive and YARN. You will also learn how to configure the Hadoop distributed file system (HDFS), perform processing and ingestion using MapReduce, copy data from cluster to cluster, create data summarizations, and compose queries.

Course Highlights

  • Setting up and administrating clusters
  • Ingesting data
  • Working with MapReduce, YARN, Pig, and Hive
  • Selecting and aggregating large datasets
  • Defining limits, unions, filters, and joins
  • Writing custom user-defined functions (UDFs)
  • Creating queries and lookups

Certificate

All participants will receive a Certificate of Completion from Tertiary Courses after achieved at least 75% attendance.

HRDF SBL Claimable for Employers Registered with HRDF

HRDF claimable

Course Code: M409

Course Booking

MYR880.00

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.


Course Details

Module 1 Hadoop Introduction

  • HDFS
  • MapReduce
  • YARN
  • Data Ingestion

Module 2 : Hive

  • Select and Aggregate
  • Sort and Limit
  • Filters
  • Joins

Module 3: Pig

  • Select and Aggregate
  • Sort and Limit
  • Filters
  • Joins
  • UDF
  • Streaming UDF

Module 4 : HBase

  • What is NoSQL
  • CAP
  • Introduction to Hbase
  • Data Analysis with HBase

Course Admin

Prerequisite

This is beginner course. No Hadoop or SQL knowledge is required.:

Software Requirement

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Big Data Analysts

Trainers

Hadoop Big Data TrainerHassan Keshavarz with more than 5 years' experience as a computer engineer and researcher, he is adept in Big Data Analytics, Internet of Things (IoT), Project Management, as well as Cloud Computing and Network Analysis. Moreover, while his on-the-job experience has afforded him a well-rounded skill set. He is an expert at: Apache Hadoop, OpenStack Cloud Computing Platform, preparing study narratives documents and case reviews, Perform the tasks of reviewing studies performed by the staff, Strong written and verbal communication skill, Invited reviewers of journals and conferences.

Hadoop Big Data TrainerSwamy Gurram is a Cloudera Certified Developer for Apache Hadoop (CCDH). He has a lot hands on experience with Hadoop technologies (HDFS, MapReduce, Hive, Pig, HBase, Sqoop, flume, Impala, Knox etc) and Hadoop cluster installations, administration along with virtual server installations . He has 10+ years of experience in Technical Architecture, Onsite Relationship Management and Automotive Business Analyst. He also has experience in Hadoop, Mainframe Technologies, Visual Basics, Mainframe Migration Tools. He is expertise in system life cycle skills comprising of analysis, technical design, testing, implementation and documentation

Hadoop Big Data TrainerTarun Sukhani is an IT executive, educator, author, speaker, data scientist, security expert, agile coach, polyglot coder, and entrepreneur with over 19 years of combined professional experience both in the U.S. and internationally. As a seasoned veteran, my expertise lies in leading teams in the design and delivery of highly scalable, concurrent, and performant enterprise software solutions with budgets of up to $100 million. I am particularly adept at building productive, self-managing agile teams with predictable velocities and delivery timeframes.

Tarun Sukhani is skilled in all phases of the SDLC/ALM, with a solid foundation in Agile (XP, SAFe, Lean, Scrum, Kanban, and Scrumban) and traditional (PMI and PRINCE2) project management frameworks and methodologies.

He is proficient in Big Data/Data Science: Hadoop, Pig, Hive, HBase, Spark, R/Rattle, Cassandra, YARN, Zookeeper, Mahout, SimpleCV, OpenCV

Hadoop Big Data trainerSyed Muhammad Farrukh Akhtar has more than 15 years of experience analysis, designing, developing, integrating and managing large applications for diverse industries. He has experience working in Dubai, Pakistan, Germany and Malaysia, strong hands-on experience of software design, development and integration on different platform like IBM J2EE, Oracle and Microsoft .Net, Big data, Hadoop, Spark, HBase, Hive, Sqoop, Flume and NoSQL. He also has expertise in Machine Learning/ Deep Learning with Tensor Flow, Keras and Python, excellent skills in React, Ionic 2, Angular 2, Mobile Apps with React Native and Node.js.

He is highly knowledgeable in object oriented software development, requirements analysis, and database design. Possess deep understanding of Open Source technologies’ applicability in emerging business areas. He possesses excellent knowledge in Rational Unified Process (RUP); Rational Software Architect; data modeling and mapping; and extensible system design using the UML and Visio. Professional experience on J2EE, JMS, Web Sphere, Oracle, Spring, Hibernate, Struts and 3-Tier Web-based Applications Development.

Customer Reviews (1)

Will RecommendReview by Dwight
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
More exercises...hands on (Posted on 22/10/2016)

Write Your Own Review

You're reviewing: Hadoop Data Analysis Training

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha
    Attention: Captcha is case sensitive.

Product Subjects

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses