Call +603 9100 1312

Instructor-Led Classroom Adult Training in Malaysia - Learn New Skills to Enhance Your Employability from our HDRF Approved Courses

Machine Learning with Apache Spark

Apache Spark is a powerful platform that provides users with new ways to store and make use of big data. In this course, get up to speed with Spark, and discover how to leverage this popular processing engine to deliver effective and comprehensive insights into your data. The trainer will how you  how to analyze data in Spark using PySpark and Spark SQL, explores running machine learning algorithms using MLib, demonstrates how to create a streaming analytics application using Spark Streaming, and more.
Topics include:

  • Understanding Spark
  • Reviewing Spark components
  • Where Spark shines
  • Understanding data interfaces
  • Working with text files
  • Loading CSV data into DataFrames
  • Using Spark SQL to analyze data
  • Running machine learning algorithms using MLib
  • Querying streaming data
  • Connecting BI tools to Spark

 

HRDF SBL Claimable for Employers Registered with HRDF

HRDF claimable

Course Code: M537

Course Booking

MYR880.00

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.

Course Details

Module 1: Overview of Apache Spark

  • What is Apache Spark
  • History of Apache Spark
  • Apache Spark Components
  • Install Spark on Local Computer
  • Databricks

Module 2: Resilient Distributed Datasets (RDD)

  • What is RDD
  • Create a RDD
  • Basic Operations of a RDD

Module 3: Machine Learning in Apache Spark

  • Intro to MLlib
  • MLlib Modules
  • Regression
  • Classification
  • Tree
  • Clustering

Module 4: Recommendation System with Spark

  • What is Recommendation System?
  • Matrix Factorization and Rating

Who Should Attend

  • Big Data Analysts
  • Data Scientists
  • Data Analysts

Prerequisite

Nil

Trainers

Data Science TrainerDr. Aanand is a Full Stack Data Scientist who once had a torrid love affair with Physics. He has consulted and published in the area of Public Health, Electricity Markets, Telecom, BFSI, Advertising & Communication Strategies and Digital & Social Media Technologies. He has worked on assignments with international agencies such as International Monetary Fund, World Bank, Royal Netherland Embassy etc. besides MNCs like Tata Consultancy Services, Kie Square Consulting and several government organizations of national importance.

He regularly conducts general training programs in Python (Pandas, NumPy, SciPy, Matplotlib, Bokeh), R (dplyr, rstanarm, knitR, ggplot2), Data Visualization (Tableau, D3.js) and Machine Learning (Reinforced Learning, Scikit Learn) and specialized training programs on Structural Equation Modeling and SAP Hana.

He holds a doctorate in Operations Research from Indian Institute of Management Ahmedabad and a post graduate in Physics from University of Mumbai. He has advanced training in mathematical programming including optimization, advanced multivariate data analysis, and simulation techniques. When he is not teaching or consulting he can be found meditating or heading for an adventurous trek.

Write Your Own Review

You're reviewing: Machine Learning with Apache Spark

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.