Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Data engineering road map, Cheat Sheet of Computer Science

Data engineering road mapData engineering road mapData engineering road map

Typology: Cheat Sheet

2023/2024

Uploaded on 12/18/2024

sivaprakash-rayachoti
sivaprakash-rayachoti 🇮🇳

1 document

1 / 6

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
pf3
pf4
pf5

Partial preview of the text

Download Data engineering road map and more Cheat Sheet Computer Science in PDF only on Docsity!

56 59 60 61 62 63 64 65 66 67 68 70 7gil 72 TAS) Milestone 1 week 1-2 Big Data Fundamentals / Data Lake Storage - Introduction : — Database vs Datawarehouse vs Datalake — Hadoop Overview -— Spark Overview — Linux Commands — HDFS - Role of Data Engineers week 3 — understanding how distributed processing works internally week 4 — Apache Spark Core API's week 5 -— Getting Started with Dataframes and Spark SQL week 6 - More of Spark Dataframe transformations week 7 — Apache Spark Caching week 8 - Spark Architecture and Aggregate functions I week 9 - Internals of Spark (calculating initial number of partitions, parallelism, partioning, bucketing)