Sumit Mittal Profile Banner
Sumit Mittal Profile
Sumit Mittal

@bigdatasumit

9,138
Followers
21
Following
117
Media
2,255
Statuses

Big Data Trainer โ€ข Founder & CEO TrendyTech โ€ข Tweets about #BigData & #DataEngineering . Helping you get hike ๐Ÿ’ฐ & find your Dream Job ๐Ÿš€ Join 20000+ students๐Ÿ‘‡

Joined March 2018
Don't wanna be here? Send us removal request.
@bigdatasumit
Sumit Mittal
3 months
I asked my team member to give me one decent resume. I got one resume, I checked the ATS score, it was 38 I did a lot of research over the past few days, I made multiple changes, without using any AI software (to strike the right balance between the ATS score & human
3K
896
2K
@bigdatasumit
Sumit Mittal
3 months
I asked my team member to give me one decent resume. I got one resume, I checked the ATS score, it was 38 I did a lot of research over the past few days, I made multiple changes, without using any AI software (to strike the right balance between the ATS score & human
2K
785
2K
@bigdatasumit
Sumit Mittal
6 months
From a CTC of 12 LPA to 30+ LPA in 7 months Multiple offers & an extremely good hike, that's what a well planned learning can help you achieve. It's such a joy to see your students getting tremendous success. My advice to everyone out there, a few failures in life should not
Tweet media one
26
50
797
@bigdatasumit
Sumit Mittal
10 months
SQL Full course in 9 hours - (totally free & rock solid content) This includes all major topics like 1. SQL Fundamentals, CRUD Operations 2. Primary Key vs Unique Key, Auto Increment Values - 3. DDL vs DML, Truncate vs Delete 4. Foreign Key Constraint 5. Distinct, Order By,
3
157
507
@bigdatasumit
Sumit Mittal
6 months
Here Comes the Gold mine for Data Engineers Python Complete Playlist (4 videos already released) lecture 1 - lecture 2 - lecture 3 - lecture 4 - The 5th lecture is coming on Tuesday
2
121
421
@bigdatasumit
Sumit Mittal
4 months
23 SQL videos that will make you fall in love with SQL SQL Basics (14 videos) ๐Ÿ“Œ SQL Fundamentals, CRUD Operations & Setting Environment - ๐Ÿ“Œ Primary Key vs Unique Key, Auto Increment Values - ๐Ÿ“Œ DDL vs DML, Truncate vs Delete
0
95
361
@bigdatasumit
Sumit Mittal
8 months
Get 10X more Interview calls - ATS Score 38 to 100 I'm giving you free access to my session where I will show how to optimize your resume & take the ATS score from 38 to 100. yes you heard it right, a fully ATS compliant resume with 100 score. This will help you get 10X more
Tweet media one
361
177
342
@bigdatasumit
Sumit Mittal
9 months
I'm giving you access to 10 FREE Courses ๐Ÿฆ‘ 1. Artificial Intelligence 2. Machine Learning 3. Cloud Computing 4. Ethical Hacking 5. Data Analytics 6. AWS Certified 7. Data Science 8. Business Intelligence 9. Python 10. Deep Learning To get, just: - Like & Retweet
Tweet media one
334
209
332
@bigdatasumit
Sumit Mittal
9 months
I'm giving you access to 10 FREE Courses ๐Ÿฆ‘ 1. Artificial Intelligence 2. Machine Learning 3. Cloud Computing 4. Ethical Hacking 5. Data Analytics 6. AWS Certified 7. Data Science 8. Business Intelligence 9. Python 10. Deep Learning To get, just: - Like & Retweet
Tweet media one
307
209
339
@bigdatasumit
Sumit Mittal
4 months
Data Engineers Interview Preparation - A complete Package (Free) Complete SQL Free Course - Complete Python Free Course (6 videos already released till now) lecture 1 - lecture 2 - lecture 3 -
0
108
341
@bigdatasumit
Sumit Mittal
5 months
Complete Guide for Data Engineers (Save it) Full SQL Course (Free) - Complete Python Free Course (6 videos already released till now) lecture 1 - lecture 2 - lecture 3 - lecture 4 -
1
102
340
@bigdatasumit
Sumit Mittal
5 months
Data Engineers Interview Preparation - A complete Package (Free) Complete SQL Free Course - Complete Python Free Course (6 videos already released till now) lecture 1 - lecture 2 - lecture 3 -
0
98
304
@bigdatasumit
Sumit Mittal
14 days
Data Engineers Interview Preparation - A complete Package (Free) Complete SQL Free Course - Complete Python Free Course (6 videos already released till now) lecture 1 - lecture 2 - lecture 3 -
0
75
304
@bigdatasumit
Sumit Mittal
6 months
Data Engineers Interview Preparation - A complete Package (Free) Complete SQL Free Course - Complete Python Free Course (6 videos already released till now) lecture 1 - lecture 2 - lecture 3 -
0
77
281
@bigdatasumit
Sumit Mittal
4 months
From a CTC of 12 LPA to 30+ LPA in 7 months Multiple offers & an extremely good hike, that's what a well planned learning can help you achieve. It's such a joy to see your students getting tremendous success. My advice to everyone out there, a few failures in life should not
Tweet media one
5
28
272
@bigdatasumit
Sumit Mittal
5 months
Learn SQL for free - Complete plan SQL Basics (14 videos) ๐Ÿ“Œ SQL Fundamentals, CRUD Operations & Setting Environment - ๐Ÿ“Œ Primary Key vs Unique Key, Auto Increment Values - ๐Ÿ“Œ DDL vs DML, Truncate vs Delete -
1
91
260
@bigdatasumit
Sumit Mittal
5 months
A Data Engineering Roadmap beyond just cracking Interviews! Here is a 32 weeks Step by Step Plan 1. Introduction to Big Data/DataLake Storage (3 weeks) Big Data - The Big Picture, Linux Commands, Introducing the Multi Node Practice Environment, Distributed Storage 2.
1
58
237
@bigdatasumit
Sumit Mittal
8 months
I'm giving you free access to my session where I will show how to optimize your resume & take the ATS score from 38 to 100. yes you heard it right, a fully ATS compliant resume with 100 score. This will help you get 10X more Interview calls. In this session I have talked
Tweet media one
239
132
226
@bigdatasumit
Sumit Mittal
7 months
All Data Engineers should definitely read these 10 posts.. 1. From 0 to Hero in SQL - Follow this Plan 2. Crunching Big Data in absolute layman terms ๐Ÿ”ฅ 3. Normalization vs Denormalization 4. Super
3
51
222
@bigdatasumit
Sumit Mittal
7 days
Secret Revealed - From 0 interview calls to 100+ calls I asked one of my team member to give me one resume (struggling to fetch interview calls) I got one resume, I checked the ATS score, it was 38 I did a lot of research over the past few days, I made multiple changes,
178
98
217
@bigdatasumit
Sumit Mittal
9 months
Big Data End to End Pipeline on major Cloud Platforms! Ingest -> Store -> Process -> Serve Ingest - Get the data from multiple sources using some ingestion framework. Example - AWS Glue, Azure Data Factory, NiFi, Sqoop Store - Since we are going to store huge amount of data
3
39
198
@bigdatasumit
Sumit Mittal
6 days
All Data Engineers should definitely read these 10 posts.. 1. From 0 to Hero in SQL - Follow this Plan 2. Crunching Big Data in absolute layman terms ๐Ÿ”ฅ 3. Normalization vs Denormalization 4. Super
0
46
204
@bigdatasumit
Sumit Mittal
2 months
I asked my team member to give me one decent resume. I got one resume, I checked the ATS score, it was 38 I did a lot of research over the past few days, I made multiple changes, without using any AI software (to strike the right balance between the ATS score & human
264
107
196
@bigdatasumit
Sumit Mittal
5 months
Data Engineers Interview Preparation - A complete Package (Free) Complete SQL Free Course - Complete Python Free Course (6 videos already released till now) lecture 1 - lecture 2 - lecture 3 -
1
48
194
@bigdatasumit
Sumit Mittal
3 months
Link to the session: Please like this comment so that it stays at the top.
6
23
193
@bigdatasumit
Sumit Mittal
6 months
Here Comes the Gold mine for Data Engineers - A complete Pack (Free) Complete SQL Free Course - Complete Python Free Course (6 videos already released till now) lecture 1 - lecture 2 - lecture 3 -
0
56
184
@bigdatasumit
Sumit Mittal
4 months
Order of execution in a SQL query We all know SQL, but most of us do not understand the internals of it. Let me take an example to explain this better. Select p.plan_name, count(plan_id) as total_count From plans p Join subscriptions s on s.plan_id=p.plan_id Where p.plan_name
3
30
179
@bigdatasumit
Sumit Mittal
5 months
Big Data Interviews in 2020 vs Big Data Interviews in 2024. It's amazing to see how the technology landscape has shifted in last 4 years. I conducted Big Data Mock Interviews back in 2020, Here is a playlist (2020) We again Started conducting these
2
37
165
@bigdatasumit
Sumit Mittal
4 months
From 0 to Hero in SQL - Follow this Plan SQL Basics (14 videos) ๐Ÿ“Œ SQL Fundamentals, CRUD Operations & Setting Environment - ๐Ÿ“Œ Primary Key vs Unique Key, Auto Increment Values - ๐Ÿ“Œ DDL vs DML, Truncate vs Delete -
0
42
164
@bigdatasumit
Sumit Mittal
5 months
A Data Engineering Roadmap beyond just cracking Interviews! Here is a 32 weeks Step by Step Plan 1. Introduction to Big Data/DataLake Storage (3 weeks) Big Data - The Big Picture, Linux Commands, Introducing the Multi Node Practice Environment, Distributed Storage 2.
2
44
154
@bigdatasumit
Sumit Mittal
6 months
A Data Engineering Roadmap beyond just cracking Interviews! Here is a 32 weeks Step by Step Plan 1. Introduction to Big Data/DataLake Storage (3 weeks) Big Data - The Big Picture, Linux Commands, Introducing the Multi Node Practice Environment, Distributed Storage 2.
1
38
152
@bigdatasumit
Sumit Mittal
6 months
All Data Engineers should definitely read these 10 posts.. 1. From 0 to Hero in SQL - Follow this Plan 2. Crunching Big Data in absolute layman terms ๐Ÿ”ฅ 3. Normalization vs Denormalization 4. Super
0
32
153
@bigdatasumit
Sumit Mittal
6 months
From a CTC of 5.37 LPA to 30 LPA in a span of just 3 years! This story is of my amazing and hardworking student (not revealing name as company names & CTC is disclosed) Here is he journey - year 2020 - 5.37 LPA CTC with overall experience of 6 years Got to know about my "Big
Tweet media one
8
10
132
@bigdatasumit
Sumit Mittal
9 months
CICD for Data Engineers in a Super Easy way! Lets say you are working on a RetailAnalysis Project & have a Jira ticket assigned to you RA-17843 If you are a developer you would create a feature branch, feature-RA-17843ย & work on it. As soon as you make a git push, and github
2
21
128
@bigdatasumit
Sumit Mittal
9 months
Data Engineering - 10 Managerial round Interview Questions 1. what is the size of your cluster 2. How much data you deal with on a daily basis 3. what is your role in your big data project 4. Are you using onpremise setup or you are working on cloud 5. which big data
4
17
129
@bigdatasumit
Sumit Mittal
9 months
Step by Step Plan to learn Big Data (All Free resources Included) 1. Learn SQL Basics - SQL will be used at a lot of places - Hive/Spark SQL/RDBMS queries Joins & windowing functions are very important 2. Learn Programming/Python for Data Engineering -
1
40
125
@bigdatasumit
Sumit Mittal
9 months
If you are learning SQL, learn all the below things.. 1. SQL Fundamentals, CRUD Operations 2. Primary Key vs Unique Key, Auto Increment Values - 3. DDL vs DML, Truncate vs Delete 4. Foreign Key Constraint 5. Distinct, Order By, Limit, Like Keyword 6. Order of execution in SQL 7.
2
33
124
@bigdatasumit
Sumit Mittal
6 months
CICD for Data Engineers in a Super Easy way! Lets say you are working on a RetailAnalysis Project & have a Jira ticket assigned to you RA-17843 If you are a developer you would create a feature branch, feature-RA-17843ย & work on it. As soon as you make a git push, and github
3
22
127
@bigdatasumit
Sumit Mittal
6 months
30 Data Engineering videos on trending topics - Interview preparation 1. Explaining Data Lake Versus Data Warehouse - 2. Learn Columnar Storage -ย  3. Analysing the failed Spark Jobs using Log Files - 4.
1
32
123
@bigdatasumit
Sumit Mittal
2 months
Get 10X more Interview calls (Free access to the session) Take your ATS score close to 100 now! Here is what I did.. I asked my team member to give me one decent resume. I got one resume, I checked the ATS score, it was 38 I did a lot of research over the past few days, I
126
52
122
@bigdatasumit
Sumit Mittal
10 months
All Data Engineers should definitely read these 10 posts.. 1. From 0 to Hero in SQL - Follow this Plan 2. Crunching Big Data in absolute layman terms ๐Ÿ”ฅ 3. Normalization vs Denormalization 4. Super
0
30
112
@bigdatasumit
Sumit Mittal
10 months
One Person whom I truly admire in the field of Data Engineering isย  @EcZachly (Zach Wilson) Here are 9 excellent technical posts byย him. I urge all the Big Data Enthusiasts to check these. 1. order of execution in SQL 2. Important Tips - Data
0
13
117
@bigdatasumit
Sumit Mittal
7 months
Apache Spark - Lets cover multiple scenarios in this post consider you have a 20 node spark cluster Each node is of size - 16 cpu cores / 64 gb RAM Let's say each node has 3 executors, with each executor of size - 5 cpu cores / 21 GB RAM => 1. What's the total capacity of
3
18
114
@bigdatasumit
Sumit Mittal
10 months
Recently Zach Wilson @EcZachly has created a public Github repo with all the resources, books, companies, and social media accounts you should be following to stay current on data engineering topics. This should act like a gold mine for all the Data Engineering enthusiasts. If
1
22
109
@bigdatasumit
Sumit Mittal
7 months
I am starting with a Free Python Course for Data folks on my youtube channel Here is my promise - this will be even better than the paid courses. The first video will be released on 26th February, Monday @ 5 pm After receiving wonderful feedback on my free SQL series, it was
Tweet media one
4
15
108
@bigdatasumit
Sumit Mittal
10 months
Working 8 PM to 5 AM at a call center to help my family with Financial crisis Walking 3 km everyday to attend my MCA coaching classes in order to save a few bucks. Working on a Diwali night for extra 300 rupees. Working in extreme dental pain, even when I was supposed to get a
Tweet media one
2
3
101
@bigdatasumit
Sumit Mittal
9 months
Step by Step Plan to learn Big Data (All Free resources Included) 1. Learn SQL Basics - SQL will be used at a lot of places - Hive/Spark SQL/RDBMS queries Joins & windowing functions are very important 2. Learn Programming/Python for Data Engineering -
3
24
100
@bigdatasumit
Sumit Mittal
4 months
Order of execution in a SQL query We all know SQL, but most of us do not understand the internals of it. Let me take an example to explain this better. Select p.plan_name, count(plan_id) as total_count From plans p Join subscriptions s on s.plan_id=p.plan_id Where p.plan_name
0
17
99
@bigdatasumit
Sumit Mittal
9 days
2024 - Rockstar Data Engineer Roadmap Prerequisites --------------------- 1. Linux commands 2. Programming fundamentals (preferably python) 3. SQL is very important You should learn the below things -------------------------------------- 1. Distributed Computing Fundamentals
0
19
79
@bigdatasumit
Sumit Mittal
8 months
I am giving Free access to my session where I have covered 10 recently asked Pyspark Interview questions. If you are going for a pyspark Interview most likely you will face these questions. I have given the answers to all of them, and you need to portray it the same way in your
64
31
79
@bigdatasumit
Sumit Mittal
5 months
30 Data Engineering videos on trending topics - Interview preparation 1. Explaining Data Lake Versus Data Warehouse - 2. Learn Columnar Storage - 3. Analysing the failed Spark Jobs using Log Files - 4.
0
25
79
@bigdatasumit
Sumit Mittal
7 months
Python for Data Engineers / Data Analysts & Data Scientists I have released 3 videos till now : video 1 - - Introduction - Installing python 3 - Our first program - Variable - Datatypes - Type errors are caught at runtime - Typecasting - String
1
17
78
@bigdatasumit
Sumit Mittal
10 days
These 30 videos will help you for your next Data Engineering Interview! 1. Explaining Data Lake Versus Data Warehouse - 2. Learn Columnar Storage - 3. Analysing the failed Spark Jobs using Log Files -
0
23
75
@bigdatasumit
Sumit Mittal
8 months
CICD for Data Engineers in a Super Easy way! Lets say you are working on a RetailAnalysis Project & have a Jira ticket assigned to you RA-17843 If you are a developer you would create a feature branch, feature-RA-17843 & work on it. As soon as you make a git push, and github
3
12
75
@bigdatasumit
Sumit Mittal
7 months
I'm giving you free access to my session where I will show how to optimize your resume & take the ATS score from 38 to 100. yes you heard it right, a fully ATS compliant resume with 100 score. This will help you get 10X more Interview calls. In this session I have talked
65
41
72
@bigdatasumit
Sumit Mittal
6 months
Even if you have done a few python paid courses, check these 6 videos that I have uploaded on youtube. You will truly realize that you didn't knew things this way & in this depth. 1. 2. 3. 4.
12
19
72
@bigdatasumit
Sumit Mittal
10 months
I asked my team member to give me one decent resume. I got one resume, I checked the ATS score, it was 26 I did a lot of research over the past few days, I made multiple changes, without using any AI software (to strike the right balance between the ATS score & human
2
6
73
@bigdatasumit
Sumit Mittal
7 months
If you are learning SQL, learn all the below things.. 1. SQL Fundamentals, CRUD Operations 2. Primary Key vs Unique Key, Auto Increment Values - 3. DDL vs DML, Truncate vs Delete 4. Foreign Key Constraint 5. Distinct, Order By, Limit, Like Keyword 6. Order of execution in SQL 7.
0
13
68
@bigdatasumit
Sumit Mittal
6 months
From 0 to Hero in SQL - Follow this Plan SQL Basics (14 videos) ๐Ÿ“Œ SQL Fundamentals, CRUD Operations & Setting Environment - ๐Ÿ“Œ Primary Key vs Unique Key, Auto Increment Values - ๐Ÿ“Œ DDL vs DML, Truncate vs Delete -
0
20
65
@bigdatasumit
Sumit Mittal
8 days
Big Data End to End Pipeline Ingest -> Store -> Process -> Serve Ingest - Get the data from multiple sources using some ingestion framework. Example - AWS Glue, Azure Data Factory, NiFi Store - Since we are going to store huge amount of data we need a Distributed/Object
0
10
65
@bigdatasumit
Sumit Mittal
10 months
Normalization vs Denormalization Normalization is a process of dividing the data into multiple smaller tables with an intent to reduce data redundancy & inconsistency. However, Denormalization is totally opposite of above idea. Denormalization is the technique of combining
0
9
65
@bigdatasumit
Sumit Mittal
8 months
Apache Spark Partition skew explained in a super simple way Let's say we have 1 lakh coins of different denominations and we want to find the total sum. If one person has to do that, then its a monolythic style and this will take time. so the best way is to distribute it to
1
9
65
@bigdatasumit
Sumit Mittal
5 months
A Data Engineering Roadmap beyond just cracking Interviews! Here is a 32 weeks Step by Step Plan 1. Introduction to Big Data/DataLake Storage (3 weeks) Big Data - The Big Picture, Linux Commands, Introducing the Multi Node Practice Environment, Distributed Storage 2.
Tweet media one
2
11
61
@bigdatasumit
Sumit Mittal
7 months
Internal working of Apache Spark - One of the most liked writeup Lets say you have a 20 node spark cluster Each node is of size - 16 cpu cores / 64 gb RAM Let's say each node has 3 executors, with each executor of size - 5 cpu cores / 21 GB RAM => 1. What's the total capacity
Tweet media one
1
4
57
@bigdatasumit
Sumit Mittal
7 days
I am developing an end to end project on Azure Data Engineering and will release it on my youtube channel. This project will be on healthcare domain. For now I will design and develop the version 1 of it. There will be 3 datasets - Patients data - Clinical Trial data (results
2
6
57
@bigdatasumit
Sumit Mittal
6 months
am looking to hire 2 people for developing innovative solutions around Big Data technologies. Initially I will provide a paid internship and then if things go well, I will convert it to a full time role. My preferred choice for this hiring will be - Candidates with a career
28
5
33
@bigdatasumit
Sumit Mittal
7 months
From yesterday, I kept exploring different names for the Biggest Data Engineering Community that I am building. So finally, here is the name. "The Data Engineers Club" Deciding on a name is just the very first step, from the upcoming week we will be in action with a lot of
4
4
55
@bigdatasumit
Sumit Mittal
7 months
I have a vision to create a really Strong community in Data Engineering (The Biggest Community) To achieve this mission, I am starting with multiple Non Profit initiatives. I don't know till what level this will go, but I am determined to give my best! Few of the things that I
3
7
54
@bigdatasumit
Sumit Mittal
26 days
From a CTC of 12 LPA to 30+ LPA in just 7 months Multiple offers & an extremely good hike, that's what a well planned learning can help you achieve. It's such a joy to see your students getting tremendous success. My advice to everyone out there, a few failures in life
Tweet media one
0
5
53
@bigdatasumit
Sumit Mittal
6 months
Here Comes the Gold mine for Data Engineers Python Complete Playlist (5 videos already released) lecture 1 - lecture 2 - lecture 3 - lecture 4 - lecture 5 -
2
13
53
@bigdatasumit
Sumit Mittal
7 months
1,00,000 Subscribers on YouTube ๐Ÿ”ฅ No clickbaits, no controversies, Just Educational Content. On May 4, 2019 I posted my first video on my YouTube channel. So far we have uploaded complete SQL course & there is an ongoing Python series for students. In the long run, we donโ€™t
Tweet media one
8
1
51
@bigdatasumit
Sumit Mittal
6 months
7 offers in Big Data with a whopping 230% hike Ideally I do not recommend my students taking more than 3 offers, but I cant stop anyone! Got this message from one of my student who cracked multiple companies. As a mentor It motivates me when I receive such messages from my
Tweet media one
3
5
48
@bigdatasumit
Sumit Mittal
7 months
Till 2004 - used to be below average student throughout 2004 - failed in 12th Pre-boards 2006 - Got 1.5 lakh+ rank in AIEEE even after 1 year coaching 2007 - Could only manage to get admission in BCA Distant Na hi skills the, and naa ni paise :) Looks like it's the end of my
Tweet media one
2
3
48
@bigdatasumit
Sumit Mittal
3 months
10 offers in Big Data with a whopping 90% hike (~2X CTC) Got this message from one of my student who cracked multiple companies. As a mentor It motivates me when I receive such messages from my students. I offer a super premium big data program that gives top results. if you
Tweet media one
Tweet media two
1
2
46
@bigdatasumit
Sumit Mittal
8 months
I have talked to a lot of students who recently attended the interview on Azure cloud. The way interviews are conducted is mostly the interviewer ask one question and have a lot of related follow up questions. For Azure Storage account I have framed 3 set of questions which
0
3
48
@bigdatasumit
Sumit Mittal
5 months
12 ways to improve the performance of your Spark jobs 1. Caching Data In Memory 2. Join Strategy Hints for SQL Queries 3. Coalesce Hints for SQL Queries 4. Adaptive Query Execution
2
9
48
@bigdatasumit
Sumit Mittal
7 months
I have seen a lot of candidates who keep postponing the interviews. They keep saying I am still preparing, or I am not prepared. They feel they need more time, and want perfection. Remember on thing, you will never feel that you are 100% prepared. My suggestion, decide on a
0
6
47
@bigdatasumit
Sumit Mittal
6 months
Today's Data Engineering Mock interview will be at next level. It will simulate the SQL & DSA round for Data Engineers in a product based Company. The one who is taking the interview is working in Google & the one who is attending the interviews works in Walmart. Both of these
0
3
46
@bigdatasumit
Sumit Mittal
7 months
As part of the Data Engineers Club, we got a few mock interviews done this weekend. I sincerely thank the volunteers, who are making this a huge success! we will soon be uploading those to youtube so that it can help a bigger crowd. I will talk about what went well & what could
5
2
44
@bigdatasumit
Sumit Mittal
8 months
Learn Apache Spark Step by Step (Follow the Sequence) 1. A quick introduction to the Spark API 2. Overview of Spark - RDD, accumulators, broadcast variable 3. Spark SQL, Datasets, and DataFrames: 4.
Tweet media one
0
7
44
@bigdatasumit
Sumit Mittal
5 months
"Sir I travelled all the way from Singapore, Just to meet you! " My students made me realise, โ€˜everything is possibleโ€™ - Student Success Celebration at The Taj Bangalore Key highlights of the event >> 130 Selected Students were Invited, 126 Attended showing their love for me.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
1
43
@bigdatasumit
Sumit Mittal
8 months
Learn Apache Spark Step by Step (Follow the Sequence) 1. A quick introduction to the Spark API 2. Overview of Spark - RDD, accumulators, broadcast variable 3. Spark SQL, Datasets, and DataFrames: 4.
0
7
44
@bigdatasumit
Sumit Mittal
7 months
I am super excited to announce the launch of my new program "The Elite Data Engineering Program" It's a 20 weeks program High level topics covered in the Elite DE program are: =>Distributed Storage & Processing Fundamentals =>Pyspark in Depth =>A lot of performance tuning
2
7
43
@bigdatasumit
Sumit Mittal
8 months
What is Databricks? Databricks is a company formed by creators of apache spark. Databricks provides an apache spark based unified analytics platform optimized for cloud. when we talk about open source version of spark we have to deal with all the below challenges: =>
Tweet media one
0
8
43
@bigdatasumit
Sumit Mittal
8 months
20 Recently asked Pyspark Interview questions 1. Difference between client and cluster mode 2. what is partition skew, reasons for it. How to solve partition skew issues? 3. what is a broadcast join in apache spark 4. what is the difference between partition and bucketing
0
10
43
@bigdatasumit
Sumit Mittal
2 months
2024 - Rockstar Data Engineer Roadmap Prerequisites --------------------- 1. Linux commands 2. Programming fundamentals (preferably python) 3. SQL is very important You should learn the below things -------------------------------------- 1. Distributed Computing Fundamentals
1
10
41
@bigdatasumit
Sumit Mittal
3 months
How to Learn SQL Step by Step (Complete Plan) SQL Basics (14 videos) ๐Ÿ“Œ SQL Fundamentals, CRUD Operations & Setting Environment - ๐Ÿ“Œ Primary Key vs Unique Key, Auto Increment Values - ๐Ÿ“Œ DDL vs DML, Truncate vs Delete -
0
9
40
@bigdatasumit
Sumit Mittal
20 days
What is Databricks? Databricks is a company formed by creators of apache spark. Databricks provides an apache spark based unified analytics platform optimized for cloud. when we talk about open source version of spark we have to deal with all the below challenges: =>
1
7
39
@bigdatasumit
Sumit Mittal
9 months
Most important question asked in Apache Spark Interviews. How do you optimize your spark jobs? Here are 10 ways you can optimize your spark job! 1. Filter irrelevant data as early as possible 2. make sure broadcast hash join strategy comes in play when joining one large and
0
4
37
@bigdatasumit
Sumit Mittal
5 months
My new Data Engineering batch is starting tomorrow. It's a 32 weeks extensive program covering the fundamentals, Pyspark, Azure Cloud, AWS Cloud, Streaming, Performance Tuning and Projects. Who should opt for this? People in IT who want to move into Data Engineering, or people
2
11
36
@bigdatasumit
Sumit Mittal
2 months
These 30 videos will help you for your next Data Engineering Interview! 1. Explaining Data Lake Versus Data Warehouse - 2. Learn Columnar Storage - 3. Analysing the failed Spark Jobs using Log Files -
0
7
39
@bigdatasumit
Sumit Mittal
10 months
10 trending questions asked in Apache Spark interviews 1. how are initial number of partitions calculated in a dataframe 2. what happens internally when you execute spark-submit 3. what is a partition skew and how to tackle it 4. what are the spark optimization techniques you
0
3
39
@bigdatasumit
Sumit Mittal
7 months
Data Engineering - Walmart Complete hiring Process I have created a youtube video for all the aspiring Data Engineers, who are looking to get into Walmart. In this video, I have covered the below questions 1. How to get Interview Call from Walmart ? 2. Positions that you can
1
5
39
@bigdatasumit
Sumit Mittal
3 months
Important Data Engineering services in AWS & Azure Cloud DataLake Storage ============== Amazon S3 ADLS gen2 interactive query service - Serverless ============================= AWS Athena Synapse serverless Datawarehouse ============== Amazon Redshift Synapse Serverful
1
7
38
@bigdatasumit
Sumit Mittal
5 months
I have seen a lot of candidates who keep postponing the interviews. They keep saying I am still preparing, or I am not prepared. They feel they need more time, and want perfection. Remember one thing, you will never feel that you are 100% prepared. My suggestion, decide on a
0
4
39
@bigdatasumit
Sumit Mittal
7 months
As part of the Data Engineers Club we have got an amazing start. In the last 2 weeks we managed to conduct 12 Mock interviews. I will be uploading all of these to my youtube channel. For the next 30 days, each single day we will release one mock interview. This is going to be
0
4
39
@bigdatasumit
Sumit Mittal
4 months
12 ways to improve the performance of your Spark jobs 1. Caching Data In Memory 2. Join Strategy Hints for SQL Queries 3. Coalesce Hints for SQL Queries 4. Adaptive Query Execution
1
7
37
@bigdatasumit
Sumit Mittal
6 months
15 Must See Big Data Mock Interviews! Interview 1 - Interview 2 - Interview 3 - Interview 4 - Interview 5 - Interview 6 - Interview 7 -
0
8
37
@bigdatasumit
Sumit Mittal
8 months
I just uploaded a new video on my youtube channel covering 10 pyspark interview questions which were recently asked in the interviews. Here are the questions which I have answered in the video. 1. Difference between client mode and cluster mode 2. what is partition skew,
Tweet media one
0
3
38