Impetus DataEngineer Interview Questions


Impetus DataEngineer Interview Questions 

By Karthik Kondpak



Impetus Inteview Pattern

========================

Totally 3 Rounds

1) Technical Round 1

2) Technical Round 2

3) Hr Round


Technical Round 1:-

==================

1) Difference Between Val and Final Keyword in scala?

2) What is Trait?

3)Lets Consider there is a list,How will you print Each element in the list? Can you tell me Different Type of Iterators available 

  in Scala?  (Foreach ,Println)

4) How to Find even and odd Numbers using Map and Filter if you have given a List of Numbers?

5)Can you Explain About the Architecture of your Project?

6) How much data flows daily in your dataPipeline?

7)Explain About the Design Pattern You used in Your technical Pipeline?

Ex: Singleton,Factory Design Pattern

8)Difference Between GroupBy and ReduceByKey?

9) What is OOM Issue, Have You Ever come Across if you have come across how you dealt?

10) What is meant By DataSkewNess?

11) What are accumulator and BroadCast Variables?

12)What is BroadCastJoin ?


Technical Round 2 :-

====================

1). For below table A and B write left, right and inner join query and output?

Table A (id, name)

 1, amit

 2, Aditya 

Table B (id, name)

 2, Sunil

 4, suresh

2). Query to find second highest salary?

 Employee table - (id, name,salary)6. Query to find the employee name and 

manager name of employee and write output for below data?

id, name, manager id 

1, aditya, null

 2, amit, 1

 3, sunit, 2

3. You have employee table. Find the employee names having same salary?

4. What is difference between RDD and Dataframe?

5. If we do self join in spark then will it create another stage?

6. Will reading or writing the spark data frame creates new stage?

7. In what case sort merge join is not preferred to use in spark?

8. How can we see which join is being in our spark program?

9)lets Consider you data is Stored in Aws S3 then when you load data into spark how many partitions will be created?

10)Difference Between Hive and Amazon Glue?

11)Expain Difference Between DataFrame and DataSet?

12) How you deal with Nulls in the Spark Program?

13) What is Difference Between Map and FlatMap?

14) Given a list and write a Program to reverse the elements?


Comments

  1. Hi,
    Just now I started following you and loved your interview questions for data engineer. Currently I am in a production support and really want to get into data.I know the roadmap but have no idea like to which skill should I develop first or give priority to. Eagerly waiting for your response as I am unable to text you on LinkedIn.

    ReplyDelete

Post a Comment

Popular Posts