Call +60 3-2711 7241 Email: malaysiacourses@tertiaryinfotech.com

HRDF Approved Training Provider in Malaysia - Modular Fast Track Skill-Based Trainings

Machine Learning with Apache Spark

Apache Spark is a powerful platform that provides users with new ways to store and make use of big data. In this course, get up to speed with Spark, and discover how to leverage this popular processing engine to deliver effective and comprehensive insights into your data. The trainer will how you  how to analyze data in Spark using PySpark and Spark SQL, explores running machine learning algorithms using MLib, demonstrates how to create a streaming analytics application using Spark Streaming, and more.
Topics include:

  • Understanding Spark
  • Reviewing Spark components
  • Where Spark shines
  • Understanding data interfaces
  • Working with text files
  • Loading CSV data into DataFrames
  • Using Spark SQL to analyze data
  • Running machine learning algorithms using MLib
  • Querying streaming data
  • Connecting BI tools to Spark

 

HRDF SBL Claimable for Employers Registered with HRDF

HRDF claimable

Course Code: M537

Course Booking

MYR880.00

Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.


Course Details

Module 1: Overview of Apache Spark

  • What is Apache Spark
  • History of Apache Spark
  • Apache Spark Components
  • Install Spark on Local Computer
  • Databricks

Module 2: Resilient Distributed Datasets (RDD)

  • What is RDD
  • Create a RDD
  • Basic Operations of a RDD

Module 3: Machine Learning in Apache Spark

  • Intro to MLlib
  • MLlib Modules
  • Regression
  • Classification
  • Tree
  • Clustering

Module 4: Recommendation System with Spark

  • What is Recommendation System?
  • Matrix Factorization and Rating

Who Should Attend

  • Big Data Analysts
  • Data Scientists
  • Data Analysts

Prerequisite

Nil

Trainers

Data Science TrainerDr. Aanand is a Full Stack Data Scientist who once had a torrid love affair with Physics. He has consulted and published in the area of Public Health, Electricity Markets, Telecom, BFSI, Advertising & Communication Strategies and Digital & Social Media Technologies. He has worked on assignments with international agencies such as International Monetary Fund, World Bank, Royal Netherland Embassy etc. besides MNCs like Tata Consultancy Services, Kie Square Consulting and several government organizations of national importance.

He regularly conducts general training programs in Python (Pandas, NumPy, SciPy, Matplotlib, Bokeh), R (dplyr, rstanarm, knitR, ggplot2), Data Visualization (Tableau, D3.js) and Machine Learning (Reinforced Learning, Scikit Learn) and specialized training programs on Structural Equation Modeling and SAP Hana.

He holds a doctorate in Operations Research from Indian Institute of Management Ahmedabad and a post graduate in Physics from University of Mumbai. He has advanced training in mathematical programming including optimization, advanced multivariate data analysis, and simulation techniques. When he is not teaching or consulting he can be found meditating or heading for an adventurous trek.

Machine Learning TrainerAmir Othman is a software engineer by profession. Being educated in Bauhaus Universität Weimar and Hochschule Ulm, he brings experiences from different facades of the world.

With expertise in web technology, natural language processing and machine learning, he is a freelance data scientist. Some of his works include two international news aggregator

www.kronologimalaysia.com and www.diezeitachse.de

He also holds an impressive port folio for data visualizations, primarily focusing on web based techniques.

Big Data trainerSyed Muhammad Farrukh Akhtar has more than 15 years of experience analysis, designing, developing, integrating and managing large applications for diverse industries. He has experience working in Dubai, Pakistan, Germany and Malaysia, strong hands-on experience of software design, development and integration on different platform like IBM J2EE, Oracle and Microsoft .Net, Big data, Hadoop, Spark, HBase, Hive, Sqoop, Flume and NoSQL. He also has expertise in Machine Learning/ Deep Learning with Tensor Flow, Keras and Python, excellent skills in React, Ionic 2, Angular 2, Mobile Apps with React Native and Node.js.

He is highly knowledgeable in object oriented software development, requirements analysis, and database design. Possess deep understanding of Open Source technologies’ applicability in emerging business areas. He possesses excellent knowledge in Rational Unified Process (RUP); Rational Software Architect; data modeling and mapping; and extensible system design using the UML and Visio. Professional experience on J2EE, JMS, Web Sphere, Oracle, Spring, Hibernate, Struts and 3-Tier Web-based Applications Development.

Write Your Own Review

You're reviewing: Machine Learning with Apache Spark

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha

Tags

Use spaces to separate Subjects. Use single quotes (') for phrases.

You May Be Interested In These Courses