Call +60 3-2711 7241

HDRD Approved Training Provider in Malaysia - Learn New Skills to Enhance Your Employability from our HDRF Approved Courses

Apache Spark Essential Training

Apache Spark is a powerful platform that provides users with new ways to store and make use of big data. In this course, get up to speed with Spark, and discover how to leverage this popular processing engine to deliver effective and comprehensive insights into your data. The trainer will how you  how to analyze data in Spark using PySpark and Spark SQL, explores running machine learning algorithms using MLib, demonstrates how to create a streaming analytics application using Spark Streaming, and more.
Topics include:

  • Overview of Apache Spark
  • Apache Spark components
  • Databricks
  • data interfaces
  • Import files
  • Spark ML for Machine Learning
  • Spark SQL for querying streaming data

HRDF SBL Claimable for Employers Registered with HRDF

HRDF claimable

Course Code: M456

Course Booking

MYR880.00

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.


Course Details

Module 2: Exploring Data

  • Data Interface
  • RDD Basic Operations
  • Import Data
  • Actions and Transformations
  • Saving Results

Module 3: Analyzing Data

  • Select and FIlter Data
  • Aggregate Data
  • Save Data

Module 4: SparkSQL

  • Creating Tables
  • Querying Data 
  • Visualizing Data

Module 5: Machine Learning

  • ML or MLlib Module
  • Preprocessing Data
  • Linear Regression
  • Classification

Module 6: Spark Streaming

  • Streaming Setup
  • Querying Streaming Data

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Apache Spark developers who want to use Apache Spark for Hadoop Big Data analysis

Prerequisite

Basic knowledge of Python is assumed

Trainers

Project Manager and Big Data TrainerTarun Sukhani is an IT executive, educator, author, speaker, data scientist, security expert, agile coach, polyglot coder, and entrepreneur with over 19 years of combined professional experience both in the U.S. and internationally. As a seasoned veteran, my expertise lies in leading teams in the design and delivery of highly scalable, concurrent, and performant enterprise software solutions with budgets of up to $100 million. I am particularly adept at building productive, self-managing agile teams with predictable velocities and delivery timeframes.

Tarun Sukhani is skilled in all phases of the SDLC/ALM, with a solid foundation in Agile (XP, SAFe, Lean, Scrum, Kanban, and Scrumban) and traditional (PMI and PRINCE2) project management frameworks and methodologies.

He is proficient in Big Data/Data Science: Hadoop, Pig, Hive, HBase, Spark, R/Rattle, Cassandra, YARN, Zookeeper, Mahout, SimpleCV, OpenCV

Jason is a native of Kuala Lumpur, Malaysia; studied Bachelor’s Degree in Accounting and Finance from the London School of Economics Program, University of London. Raised in a typical Chinese family with entrepreneurial business background that is involved in manufacturing and real estate development. Worked as an Executive at the Asset and License Management Department in Standard Chartered, Malaysia; promoted to Data Analyst six months later. Later joined Tune Hotels Regional Services, a hotel management and hotel chain operator; served as Senior Revenue Executive. Served as Research Analyst with Wealth-X, a company that provides prospecting, intelligence and wealth due diligence on ultra-high net worth individuals. Thereafter served as Senior Data Analyst with Xchanging Malaysia, a joint venture between Xchanging and YTL Communications to develop and deliver enhanced mobile internet and cloud-based hosting offerings in Malaysia. Currently working as a Data Analyst with GoQuO, a full service e-commerce solutions provider to airlines and OTAs. Community Organizer of Big Data Malaysia, a professional network for individuals with interest in all aspects of Big Data, and Member of the Founder Institute for Malaysian Chapter, the world’s largest entrepreneur training and startup launch program. Occasionally participates in marathons and is an avid off-road cyclist. Passionate about technology, economics and enjoys social events.

Write Your Own Review

You're reviewing: Apache Spark Essential Training

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses