Accenture DataEngineer/ Bigdata Developer Interview Questions

By Karthik Kondpak

1. Intro like past experience current project and primary skills

2. Rate yourself in pyspark/scala

Sql/HQL

Spark

Etl

Bigdata

3. Explain the Data flow in your project?

4. What is Denormalization?

5.Different Type Hive tables?

6. Explain Spark job execution?

7. Which one does spark prefer cluster mode or client mode?

8. What will you do in order to pertain schema in hive if you drop external table in hive

9. Explain Bucketmap join, broadcast join?

10. Tell me about Few Spark optimization techniques which you have Worked?

11. How do you debug the code or perform unit testing?

12. What are the main challenges you have faced and how did you fix it?

13. How do you optimise the code(Continuation of 12th question )

14.scenario: in prod you have observed that when the data is increasing the job is running long it has many buckets and joins so what is your approach to fix this problem as it is the prod data you can not delete or modify data

15. If there is any error how do you try to fix it(Continuation of 14th question)

16. Emp table with empfirstname,lastname,id print the last name of 3rd employee,Do it using dataframes ?

18. What if I have to add #doj column to it and specify datatype to this how to do it in DataFrames?

19. Suppose you have a file with few words seperated with space, how do you count the number of characters in the file?

20. How good r u to learn different technologies, r u planning to take aws certification, do you have any questions for me

Search This Blog

BigData Tech Stack

Accenture DataEngineer Interview Questions

Accenture DataEngineer/ Bigdata Developer Interview Questions

By Karthik Kondpak

Comments

Post a Comment

Popular Posts

Impetus DataEngineer Interview Questions