Call +60 3-7490 2093 Email:

HRDF Approved Training Provider in Malaysia - Modular Fast Track Skill-Based Trainings

Big Data Analysis with Apache Hive

Apache Hive is a tool of choice for many data scientists because it allows them to work with SQL, a familiar syntax, to derive insights from Hadoop, reflecting the information that businesses seek to plan effectively.This course shows how to use Hive to process data, structure and optimize your data. The course will also show how to use  HUE, the Hadoop user interface, to leverage HiveQL when analyzing data..

Course Highlights

  • Defining data structures in Hive
  • Selecting data
  • Joining tables
  • Manipulating data
  • Filtering results
  • Aggregating data
  • Using built-in aggregate functions
  • Mastering built-in table-generating functions
  • Using CUBE and ROLLUP
  • Using clauses: WHERE and HAVING
  • Using LIKE, JOIN, and SEMI JOIN
  • Using functions: String, math, date, and conditional


All participants will receive a Certificate of Completion from Tertiary Courses after achieved at least 75% attendance.

HRDF SBL Claimable for Employers Registered with HRDF

HRDF claimable

Course Code: M410

Course Booking


Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.

Course Details

Topic 1: Get Started on Apache Hive

  • What is Hive?
  • How Hive Works with Hadoop
  • Install CDH on VirtualBox
  • Hue 4 UI Overview

Topic 2: Basic Hive Operations

  • Create and Drop Database
  • Create and Drop Table
  • Create Table from CSV File
  • Alter Table
  • Fix CSV File with Serde
  • Load Data to Empty Table
  • Partition Tables

Topic 3: HiveQL

  • Retrieve Data with SELECT
  • SELECT Options
  • Operators and Built In Functions
  • Filter data with WHERE

Topic 4: Aggregating Data

  • Hive Aggregations
  • Having
  • Grouping Sets
  • Cube & Rollup

Topic 5: Joining Tables

  • Combining Tables with JOIN
  • Joining Multiple Tables

Topic 6: Data Analysis with Apache Hive

  • Math Functions
  • String Functions
  • Date Functions
  • Conditional Statements

Course Admin


This is beginner course. No Hadoop or SQL knowledge is required.:

Software Requirement

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Big Data Analysts


Big data trainerRupesh Nanglia(RUPS) has more than 13 years' experience in software development/architecture and system integration. Has architected solutions in agile and scrum development environments. Is adept in Big Data Analytics, Internet of Things (IoT), Project Management, as well as Cloud Computing and Network Analysis. Moreover, while his on-the-job experience has afforded him a well-rounded skill set. He is an expert at Apache Hadoop, No SQL DB(Cassandra, MongoDB, Hbase, Neo4j...), Spark, Kafka, Storm, R, Data Analytics, Social Media Analytics, OpenStack Cloud Computing Platform, preparing study narratives documents and case reviews, Perform the tasks of reviewing studies performed by the staff. Is also working ‎ on the Data Science field as a trainer and Data Scientist. Has worked on Machine Learning and Process ‎ Mining projects. Strong written and verbal communication skills.Successful background working with stakeholders to develop and implement an architecture framework that aligns strategy, processes, and IT assets with a business goal. Has been associated with the training industry for more than half a decade.

Hadoop Big Data TrainerSwamy Gurram is a Cloudera Certified Developer for Apache Hadoop (CCDH). He has a lot of hands-on experience with Hadoop technologies (HDFS, MapReduce, Hive, Pig, HBase, Sqoop, Flume, Impala, Knox, etc) and Hadoop cluster installations, administration along with virtual server installations. He has 10+ years of experience in Technical Architecture, Onsite Relationship Management, and Automotive Business Analyst. He also has experience in Hadoop, Mainframe Technologies, Visual Basics, Mainframe Migration Tools. He is expertise in system life cycle skills comprising of analysis, technical design, testing, implementation and documentation

Hadoop Big Data trainerSyed Muhammad Farrukh Akhtar has more than 15 years of experience analysis, designing, developing, integrating and managing large applications for diverse industries. He has experience working in Dubai, Pakistan, Germany and Malaysia, a strong hands-on experience of software design, development and integration on different platforms like IBM J2EE, Oracle and Microsoft .Net, Big data, Hadoop, Spark, HBase, Hive, Sqoop, Flume, and NoSQL. He also has expertise in Machine Learning/ Deep Learning with Tensor Flow, Keras, and Python, excellent skills in React, Ionic 2, Angular 2, Mobile Apps with React Native and Node.js. He is highly knowledgeable in object-oriented software development, requirements analysis, and database design. Possess a deep understanding of Open Source technologies’ applicability in emerging business areas. He possesses excellent knowledge in Rational Unified Process (RUP); Rational Software Architect; data modeling and mapping; and extensible system design using the UML and Visio. Professional experience on J2EE, JMS, Web Sphere, Oracle, Spring, Hibernate, Struts and 3-Tier Web-based Applications Development.

Write Your Own Review

You're reviewing: Big Data Analysis with Apache Hive

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha
    Attention: Captcha is case sensitive.

Product Subjects

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses