Last Update: 11/2021Duration: 57h 22m | Video: .MP4, 1280x720 30 fps | Audio: AAC, 44.1 kHz, 2ch | Size: 24.2 GBGenre: eLearning | Language: English
Learn key Data Eeering Skills such as SQL, Python and Spark with tons of Hands-on tasks and exercises
What you'll learn:
Setup Development Environment to learn building Data Eeering Applications on GCP
Database Essentials for Data Eeering using Postgres
Data Eeering Programming Essentials using Python
Data Eeering using Spark Dataframe APIs (PySpark)
Data Eeering using Spark SQL (PySpark and Spark SQL)
Relevance of Spark Metastore and integration of Dataframes and Spark SQL
Ability to build Data Eeering Pipelines using Spark leveraging Python as Programming Language
Use of different file formats such as Parquet, JSON, CSV etc in building Data Eeering Pipelines
Setup self support single node Hadoop and Spark Cluster to get enough practice on HDFS and YARN
Requirements:
Laptop with decent configuration (Minimum 4 GB RAM and Dual Core)
Sign up for GCP with the available credit or AWS Access
Setup self support lab on cloud platforms (you might have to pay the applicable cloud fee unless you have credit)
CS or IT degree or prior IT experience is highly desired
Description:
As part of this course, you will learn all the Data Eeering Essentials related to building Data Pipelines using SQL, Python as well as Spark.