Solicité el puesto a través de un captador. El proceso duró 1 día. Acudí a una entrevista en Birlasoft (Hyderabad) en mar 2021
Entrevista
I received a call from the recruiter and the interview was scheduled the very next day. It was a video call with someone from the Team I was interviewing for. The Interview was divided into 3 parts (1 - Python related questions, 2- ETL/Data Warehouse related questions and 3- AWS related questions)
Preguntas de entrevista [2]
Pregunta 1
PYTHON
------------
1) What is the use of Class Method?
2) What is the Garbage Collection?
3) What are data structures in python?
4) What is the difference between Boto resource and Boto Client Object?
ETL/DWH
-------------
1) What is the difference between Data Lake and Data WareHouse?
2) Can you pull data from multiple sources for Glue? S3, csv files etc
3) Is Teradata RDBMS or NoSQL DB?
4) Difference between RDBMS and NoSQL DB?
5) Which DB is better for storing redundant data?
6) Difference betwen denormalized RDBMS and NoSQL for storing redundant data?
AWS
------
1) Why is AWS Glue Used? Is it used to create a Data Lake or Data Warehouse?
2) What is Cold Start Procedure in Lambda?
3) What is the maximum exection time of Lambda?
4) How to execute a 40 minute job using Lambda?
5) How to compress an image file using Lambda?
6) Assume a S3 Bucket hosted in another account which is private and I need to access the file path in
that S3 Bucket using a another aws account
7) Vertical vs Horizontal scaling in AWS
What you’re describing sounds like an interviewer who was checking boxes rather than evaluating thinking, which is a weak interviewing style. When someone just fires questions and looks for their expected keywords, they’re not really assessing your real capability—especially for a role like data engineering where reasoning matters more than memorized answers.
Preguntas de entrevista [1]
Pregunta 1
SCD. ADF, PySpark, delta format, write and schema evolution
Solicité el puesto a través de un captador. Acudí a una entrevista en Birlasoft
Entrevista
Basic Data warehouse concepts , Pyspark internal and Sql Questions joins and 5th highest salary, windows functions, hive internal and external tables,denserrank, rank, group by reduce by, hadoop and hdfs