Discover new courses and celebrate learning with us today. 🎓

Explore

Development

Web Development

Apache Spark and Scala

Unlock the power of Big Data with our comprehensive Apache Spark and Scala course. Dive into RDDs, Spark SQL, and more!

4.9

315,475 rating
11 Lessons
229 Students
Last updated 6 months ago

By Insculpt Technologies

via Udemy

Go To Course

Brief Summary

This course is your go-to guide for mastering Big Data using Apache Spark and Scala! You'll learn the ins and outs of big data tools in a friendly way.

Key Points

Understand what limits Hadoop mapreduce has and how Spark can fix them
Learn Scala programming and its cool features
Get hands-on with RDDs and create awesome apps in Spark

Learning Outcomes

Master the limitations of mapreduce and how Spark outshines it
Gain proficiency in Scala and its applications
Get comfortable working with RDDs and develop applications using Spark

About This Course

A complete Guide for Processing Big Data with Spark

This course on Apache Spark and Scala aims at providing an advanced expertise in big data Hadoop ecosystem. This course will provide a standard skillset which helps one become a specialist on the top of Big data Hadoop developer.

The course starts with a detailed description on limitations of mapreduce and how Spark can help overcome them. Further it covers a deeper dive into the Scala programming language.

Moving on it covers Spark as a standalone cluster and an understanding of Resiliient Distributed Datasets.

The course also covers concepts of Spark SQL using SQL queries through SQL context and Hive Queries through Hive context.

This course certainly provides material required for building a career path from Big data Hadoop developer to BIg data Hadoop architect.

Understand the limitations of Hadoop mapreduce and how Spark overcomes these limitations
Gain expertise in Scala programming language and its characteristics
Able to work with RDDs' and create applications in Spark

Course Curriculum

Module-1 Introduction to Big data, Hadoop and Spark

12 Lectures

1.1 Overview of Big Data

1.2 Introduction to Apache Hadoop

1.3 Hadoop Distributed File System

1.4 Hadoop Map Reduce

1.5 Introduction to Apache Spark

1.6 Characteristics of Apache Spark

1.7 Users and Use Cases of Apache Spark

1.8 Job Execution Flow and Spark Execution

1.9 Spark Unified Stack

1.10 Complete Picture of Apache Spark

1.11 Why Spark with Scala

1.12 Apache spark Architecture

Module 2: Introduction to Scala Programming Language

5 Lectures

2.1 Introduction to Scala

2.2 Scala Basic Syntax

2.3 Scala Class and Objects

2.4 If else Statements in Scala

2.5 Loops in Scala

Module 3: Advanced Scala Programming

9 Lectures

3.1 Functions and Procedures in Scala

3.2 Access Modifiers

3.3 Strings and Arrays

3.4 Scala Collections

3.5 Scala Traits

3.6 Pattern Matching

3.7 Scala Extractors

3.8 Scala Exception Handling

3.9 Scala Files IO

Module 4: Apache Spark RDDs

5 Lectures

4.1 Programming with RDDs

4.2 Starting with Spark

4.3 Creating RDDs

4.4 RDD Operations

4.5 Lifecycle of Spark

Module 5: Apache Spark RDDs II

4 Lectures

5.1 Spark Caching

5.2 Common Transformations and Actions

5.3 Spark Functions

5.4 Some more Spark functions

Module 6: Working with Key-Value pairs

5 Lectures

6.1 Key Value Pairs

6.2 Aggregate Functions

6.3 Working with Aggregate Functions

6.4 Joins in Spark

6.5 Practical on Word count example

Module 7: Advanced Spark Programming

5 Lectures

7.1 Spark Shared Variables

7.2 Spark and Fault Tolerance

7.3 Broadcast variables

7.4 Numeric RDD Operations

7.5 Per-Partition Operations

Module 8: Running Spark jobs on Cluster

5 Lectures

8.1 Spark Runtime Architecture

8.2 Spark Driver

8.3 Executors

8.4 Cluster Managers

8.5 Cluster Managers II

Module 9: Spark SQL

4 Lectures

9.1 Introduction to Spark SQL

9.2 Starting Point-SQL Context

9.3 Hive with Spark SQL

9.4 Spark SQL Caching

Module 10: Spark Streaming

1 Lectures

People.json, Employee.json

Module 11: Machine Learning in Spark

7 Lectures

11.1 machine learning with mllib

11.2 MLib Data Types

11.3 labeled point data types

11.4 Local Matrices in mllib

11.5 MLib Algorithms

11.6 Classification and Regression

11.7 Clustering

Module 12: GraphX in Spark

5 Lectures

12.1 GraphX Introduction

12.2 Creating Graphs

12.3 Graph Operators

12.4 Subgraph Transformation

12.5 Computation with map reduce triplets

Instructor

Insculpt Technologies

4.9

315,475 Reviews
345 Students
34 Course

Insculpt technologies is a leading publisher of development courses which provide in-depth knowledge and high quality training. Insculpt technologies is serving with a mission of providing right direction to people who are looking for a career in IT/software industry. Insculpt is the best place for learning new technologies and making things easy to understand virtually.

More Courses By Insculpt Technologies

C Programming:The best approach to learn C Language

4.9

(230)

Comprehensive Course on MongoDB and Data Modeling

4.9

(230)

Mastering ASP.NET and ADO.NET with C#

4.9

(230)

MongoDB: Initiating the Next Step

4.9

(230)

Process Big Data using Apache PIG

4.9

(230)

Review

4.9 course rating

4K ratings

Carson S.

3.5

4 years ago

I wish the course included more Spark scripting and creating RDDs from more complex data. The video could also be edited to remove some of the errors. The spacing on the syntax drove me crazy the first half. Just be consistent. The ML section of this course was hardly useful and should be better explored with larger synthetic or real data.

Helpful
Not helpful

Suhas J.

1.0

5 years ago

So far looks like just a bunch of sentences read by a person from a book.

Helpful
Not helpful

Pavan S. B.

1.0

5 years ago

Waste of money. not expected from udemy

Helpful
Not helpful

Tahsina K.

5.0

5 years ago

teacher is very very good.explained very well.

Helpful
Not helpful

Shikhar C.

1.5

5 years ago

Not at all

Helpful
Not helpful

Niladri M.

4.0

5 years ago

The transcript is faulty, this is unexpected. However the course is informative.

Helpful
Not helpful

Ivan M.

1.0

6 years ago

Examples from real live, not only slides

Helpful
Not helpful

Anil P.

2.0

6 years ago

no focus on how the function or method is working. Just explaining the output doesnt help in better understanding.

Helpful
Not helpful

Gaurav K.

1.0

6 years ago

no proper understanding of concepts, its like some ppt is being read out

Helpful
Not helpful

Prasad

3.0

6 years ago

need real time examples rather than only explanations

Helpful
Not helpful

Ratings

This course includes:

54.5 hours on-demand video
3 articles
249 downloadable resources
Access on mobile and TV
Full lifetime access
Certificate of completion

Courses You May Like

Lorem ipsum dolor sit amet elit

Show More Courses

Become a Certified Web Developer: HTML, CSS and JavaScript

4.9

(230)

By: Carolyn Welborn

Apache Spark and Scala

Brief Summary

Key Points

Learning Outcomes

About This Course

Course Curriculum