Accenture
Global professional services company with leading capabilities in digital, cloud and security.
4 Rounds
~21 Days
Medium
The Interview Loop
Recruiter Screen (30 min)
Standard fit check, behavioral questions, and resume overview.
Technical Loop (3-4 Rounds)
Deep dive into domain knowledge, coding, and system design.
Interview Question Bank
Data Engineer
•
Coding
•
medium
Write a PySpark script to read a parquet file, drop duplicate rows based on 'user_id' keeping the row with the most recent 'update_timestamp', and write back to a Delta table.
#PySpark
#Window Functions
#Delta Lake
Data Engineer
•
Technical
•
hard
How do you handle data skewness in PySpark when joining a very large fact table with a dimension table?
#PySpark
#Performance Tuning
#Data Skewness
#Salting
Data Engineer
•
Technical
•
easy
What is the difference between REPARTITION and COALESCE in PySpark? When would you use one over the other?
#PySpark
#Data Partitioning
#Performance Optimization
Data Engineer
•
Technical
•
hard
How does the Catalyst Optimizer work in Spark? Explain the logical and physical plan generation.
#Spark Internals
#Catalyst Optimizer
#Query Execution
Data Engineer
•
Technical
•
medium
What is the 'small files problem' in Hadoop/Spark, and how do you resolve it?
#HDFS
#Spark
#Performance Tuning
#File Formats
Data Engineer
•
Technical
•
easy
What is the difference between an RDD, a DataFrame, and a Dataset in Spark?
#Spark
#Data Structures
Difficulty Radar
Based on recent AI-sourced data.
Meet Your Interviewers
The "Standard" Interviewer
Senior EngineerFocuses on core competencies, system constraints, and clear communication.
SimulateUnwritten Rules
Think Out Loud
Always explain your thought process before writing code or drawing architecture.