Brief Summary
This course is your go-to guide for mastering Big Data using Apache Spark and Scala! You'll learn the ins and outs of big data tools in a friendly way.
Key Points
-
Understand what limits Hadoop mapreduce has and how Spark can fix them
-
Learn Scala programming and its cool features
-
Get hands-on with RDDs and create awesome apps in Spark
Learning Outcomes
-
Master the limitations of mapreduce and how Spark outshines it
-
Gain proficiency in Scala and its applications
-
Get comfortable working with RDDs and develop applications using Spark
About This Course
A complete Guide for Processing Big Data with Spark
This course on Apache Spark and Scala aims at providing an advanced expertise in big data Hadoop ecosystem. This course will provide a standard skillset which helps one become a specialist on the top of Big data Hadoop developer.
The course starts with a detailed description on limitations of mapreduce and how Spark can help overcome them. Further it covers a deeper dive into the Scala programming language.
Moving on it covers Spark as a standalone cluster and an understanding of Resiliient Distributed Datasets.
The course also covers concepts of Spark SQL using SQL queries through SQL context and Hive Queries through Hive context.
This course certainly provides material required for building a career path from Big data Hadoop developer to BIg data Hadoop architect.
Understand the limitations of Hadoop mapreduce and how Spark overcomes these limitations
Gain expertise in Scala programming language and its characteristics
Able to work with RDDs' and create applications in Spark
Carson S.
I wish the course included more Spark scripting and creating RDDs from more complex data. The video could also be edited to remove some of the errors. The spacing on the syntax drove me crazy the first half. Just be consistent. The ML section of this course was hardly useful and should be better explored with larger synthetic or real data.