Loading...
「ツール」は右上に移動しました。
利用したサーバー: natural-voltaic-titanium
1319いいね 45805回再生

data engineer interview questions

In this video I have talked about salting in spark

Directly connect with me on:- topmate.io/manish_kumar25

Discord channel:- discord.gg/6r9QF6TE3F

Project details for resume :-

.Successfully led a data engineering project in a retail environment using technologies such as Apache Spark, Python, SQL, and Amazon S3 to optimize data processing.

.Implemented structured data models, including dimension and fact tables, to provide valuable context for point-of-sale data analysis.

Designed and executed an incentive program based on sales performance, enhancing motivation among sales teams by rewarding top performers.

Managed extensive daily data volumes of approximately 100GB, demonstrating the ability to handle large-scale data pipelines.

Employed Spark optimization techniques like caching and broadcast joins to improve data processing speed and efficiency.

Utilized Azure CI/CD pipelines for code deployment, and orchestrated workflows using Airflow and CRON jobs.


Detailed writeup to explain more during interview:-

As a Data Engineer on a project for a prominent offline grocery and kitchen supplies retailer, I applied my expertise in data engineering to drive critical improvements in their data processing and analysis operations.

The project primarily focused on processing and analyzing point-of-sale data, which was structured into dimension and fact tables to provide meaningful context for sales analysis. To further enhance employee motivation and performance, we designed and implemented an incentive program that rewarded salespeople with the highest sales volumes in each store.

Handling a substantial daily data volume of approximately 100GB, we leveraged Apache Spark and applied optimization techniques like data caching and broadcast joins to significantly accelerate data processing. This not only improved the speed of our data pipelines but also increased the efficiency of our data analysis.

We seamlessly integrated the code deployment process into the Azure CI/CD pipeline. As part of workflow automation, we orchestrated task scheduling using Airflow and CRON jobs.

One of the project's major achievements was the implementation of a customer engagement strategy that identified infrequent buyers and provided incentives in the form of coupons. This initiative not only boosted customer retention but also had a positive impact on the overall business growth.



For more queries reach out to me on my below social media handle.

Follow me on LinkedIn:- www.linkedin.com/in/manish-kumar-373b86176/
Follow Me On Instagram:- www.instagram.com/competitive_gyan1/
Follow me on Facebook:- www.facebook.com/MANISH12340

My Second Channel --    / @competitivegyan1  

Interview series Playlist:-    • Interview Questions and answers  


My Gear:-
Rode Mic:-- amzn.to/3RekC7a
Boya M1 Mic-- amzn.to/3uW0nnn
Wireless Mic:-- amzn.to/3TqLRhE
Tripod1 -- amzn.to/4avjyF4
Tripod2:-- amzn.to/46Y3QPu
camera1:-- amzn.to/3GIQlsE
camera2:-- amzn.to/46X190P
Pentab (Medium size):-- amzn.to/3RgMszQ (Recommended)
Pentab (Small size):-- amzn.to/3RpmIS0
Mobile:-- amzn.to/47Y8oa4 ( Aapko ye bilkul nahi lena hai)
Laptop -- amzn.to/3Ns5Okj
Mouse+keyboard combo -- amzn.to/3Ro6GYl
21 inch Monitor-- amzn.to/3TvCE7E
27 inch Monitor-- amzn.to/47QzXlA
iPad Pencil:-- amzn.to/4aiJxiG
iPad 9th Generation:-- amzn.to/470I11X
Boom Arm/Swing Arm:-- amzn.to/48eH2we

My PC Components:-
intel i7 Processor:-- amzn.to/47Svdfe
G.Skill RAM:-- amzn.to/47VFffI
Samsung SSD:-- amzn.to/3uVSE8W
WD blue HDD:-- amzn.to/47Y91QY
RTX 3060Ti Graphic card:- amzn.to/3tdLDjn
Gigabyte Motherboard:-- amzn.to/3RFUTGl
O11 Dynamic Cabinet:-- amzn.to/4avkgSK
Liquid cooler:-- amzn.to/472S8mS
Antec Prizm FAN:-- amzn.to/48ey4Pj

コメント