DXC Technology
American multinational B2B IT services provider.
4 Rounds
~21 Days
Medium
The Interview Loop
Recruiter Screen (30 min)
Standard fit check, behavioral questions, and resume overview.
Technical Loop (3-4 Rounds)
Deep dive into domain knowledge, coding, and system design.
Interview Question Bank
Data Engineer
•
Coding
•
medium
Write a PySpark snippet to group a dataframe by 'department' and calculate the average salary and total employee count for each.
#PySpark
#Aggregations
#DataFrames
Data Engineer
•
Technical
•
medium
Explain the difference between Repartition and Coalesce in PySpark. When would you use one over the other?
#PySpark
#Data Partitioning
#Performance Optimization
Data Engineer
•
Technical
•
medium
What is a Broadcast Join in Spark? How does it improve performance compared to a Sort Merge Join?
#PySpark
#Joins
#Distributed Computing
Data Engineer
•
Technical
•
hard
During a PySpark job, you notice that one task takes significantly longer than the others, causing a bottleneck. What is the likely cause and how do you fix it?
#PySpark
#Data Skew
#Troubleshooting
Data Engineer
•
Technical
•
easy
Explain lazy evaluation in Apache Spark. Why is it beneficial?
#Spark Architecture
#DAG
#Transformations vs Actions
Data Engineer
•
Technical
•
hard
Explain the concept of 'salting' in PySpark. Write a conceptual code snippet showing how you would implement it to fix a skewed join.
#PySpark
#Data Skew
#Advanced Optimization
Difficulty Radar
Based on recent AI-sourced data.
Meet Your Interviewers
The "Standard" Interviewer
Senior EngineerFocuses on core competencies, system constraints, and clear communication.
SimulateUnwritten Rules
Think Out Loud
Always explain your thought process before writing code or drawing architecture.