Call +603 9100 1312

Instructor-Led Classroom Adult Training in Malaysia - Learn New Skills to Enhance Your Employability from our HDRF Approved Courses

Basic Scala Training for Apache Spark

Scala is a general-purpose programming language providing support for functional programming. It is the preferred programming language for the widely popular Apache Spark. This course will focus on using Scala with Apach Spark. The course will cover the basic syntax of Scala including functions, parallel processing, and programming Apache Spark with Scala, as well as how to use SQL from Scala.

The topics include:

  • The advantages of Scala for data science
  • Scala data types
  • Scala arrays, vectors, and ranges
  • Parallel processing in Scala
  • Mapping functions over parallel collections
  • When and when not to use parallel collections
  • Using SQL in Scala
  • Scala and Spark RDDs
  • Scala and Spark DataFrames
  • Creating DataFrames
Course Code: M585

Course Booking


Course Date

Course Time

* Required Fields

Course Cancellation/Reschedule Policy

We reserve the right to cancel or re-schedule the course due to unforeseen circumstances. If the course is cancelled, we will refund 100% to participants.
Note the venue of the training is subject to changes due to class size and availability of the classroom.
Note the minimal class size to start a class is 3 Pax.

Course Details

Module 1: Get Started on Scala

  • What is Scala
  • Why Scala
  • Installing Scala
  • Use Scala wih IntelliJ

Module 2: Scala Basics

  • Data Types
  • Variables
  • Collections 
  • Sets and Array

Module 3: Functions 

  • Define Functions 
  • Maps
  • Expressions
  • Scala Objects

Module 4: Parallel Processing

  • Parallel Collections
  • Mapping functions  
  • Filtering

Module 6: SQL

  • Install Postgre SQL
  • Loading Data
  • Query
  • SQL Summary

Module 6: Scala and Apache Spark

  • Apach Spark Introduction
  • Databricks
  • RDD
  • Mapping Functions over RDD
  • Statistics over RDD
  • Dataframes
  • Grouping and Filtering over Dataframes
  • Joining Dataframes

Who Should Attend

  • Data Scientists
  • Data Analysts
  • Big Data Analysts




Project Manager and Big Data TrainerTarun Sukhani is an IT executive, educator, author, speaker, data scientist, security expert, agile coach, polyglot coder, and entrepreneur with over 19 years of combined professional experience both in the U.S. and internationally. As a seasoned veteran, my expertise lies in leading teams in the design and delivery of highly scalable, concurrent, and performant enterprise software solutions with budgets of up to $100 million. I am particularly adept at building productive, self-managing agile teams with predictable velocities and delivery timeframes.

Tarun Sukhani is skilled in all phases of the SDLC/ALM, with a solid foundation in Agile (XP, SAFe, Lean, Scrum, Kanban, and Scrumban) and traditional (PMI and PRINCE2) project management frameworks and methodologies.

He is proficient in Big Data/Data Science: Hadoop, Pig, Hive, HBase, Spark, R/Rattle, Cassandra, YARN, Zookeeper, Mahout, SimpleCV, OpenCV

Big Data TrainerAjit is a certified Big data architect with 13 years of experience in the field of Business Data Analytics leading functions like Enterprise Data Warehouse Design, Development of BI Solutions around leading BI and Big Data Analytics platforms, IT Project and Service Management. Provided thought leadership in architecture design of Business Data Analytics solutions leveraging best practices and methodologies to implement Business Intelligence and Big Data solutions in corporate environments. Holds the credit of delivering breakthrough solutions in the areas of BI, Big Data Analytics - In-Memory Computing and Analytics to transform the Business performance of Fortune 500 Enterprises. Gained comprehensive hands-on implementation experience in the field of Big Data , SAP Analytics (BW & SAP HANA) and Business Objects Reporting Tools.

Actively involved in architecting the solution and implementation of high performance large volume data integration processes, database, storage, and other back-end services in fully virtualized environments. Certified Project Manager, Lead Auditor of ISO 22301/ ISO 27001/ ISO 20000/ ISO 9001, Agile SCRUM Master Certified practitioner with skills in managing the engineering resources optimally to get the best output with the minimum resources, using Agile Scrum methodology. Possess in-depth knowledge and experience in data modeling and business intelligence systems (dimensional modeling, data mining, predictive analytics). Strongly believe in facilitator approach to lead global cross-cultural teams and practices consultative approach in managing projects focused on implementing data warehousing and business intelligence solutions effectively and efficiently to meet today’s dynamic business environment.

Jason is a native of Kuala Lumpur, Malaysia; studied Bachelor’s Degree in Accounting and Finance from the London School of Economics Program, University of London. Raised in a typical Chinese family with entrepreneurial business background that is involved in manufacturing and real estate development. Worked as an Executive at the Asset and License Management Department in Standard Chartered, Malaysia; promoted to Data Analyst six months later. Later joined Tune Hotels Regional Services, a hotel management and hotel chain operator; served as Senior Revenue Executive. Served as Research Analyst with Wealth-X, a company that provides prospecting, intelligence and wealth due diligence on ultra-high net worth individuals. Thereafter served as Senior Data Analyst with Xchanging Malaysia, a joint venture between Xchanging and YTL Communications to develop and deliver enhanced mobile internet and cloud-based hosting offerings in Malaysia. Currently working as a Data Analyst with GoQuO, a full service e-commerce solutions provider to airlines and OTAs. Community Organizer of Big Data Malaysia, a professional network for individuals with interest in all aspects of Big Data, and Member of the Founder Institute for Malaysian Chapter, the world’s largest entrepreneur training and startup launch program. Occasionally participates in marathons and is an avid off-road cyclist. Passionate about technology, economics and enjoys social events.

Write Your Own Review

You're reviewing: Basic Scala Training for Apache Spark

How do you rate this product? *

  1 star 2 stars 3 stars 4 stars 5 stars
1. Do you find the course meet your expectation?
2. Do you find the trainer knowledgeable in this subject?
3. How do you find the training environment
  • Reload captcha


Use spaces to separate Subjects. Use single quotes (') for phrases.