IBM
Global technology and consulting firm with deep roots in enterprise IT and AI.
3 Rounds
~14 Days
Medium
The Interview Loop
Recruiter Screen (30 min)
Standard fit check, behavioral questions, and resume overview.
Technical Loop (3-4 Rounds)
Deep dive into domain knowledge, coding, and system design.
Interview Question Bank
Data Engineer
•
Technical
•
medium
Explain how Apache Spark handles fault tolerance.
#Spark
#RDD Lineage
#DAG
Data Engineer
•
Technical
•
hard
How do you resolve an OutOfMemory (OOM) error in a Spark application?
#Spark
#Performance Tuning
#Memory Management
Data Engineer
•
Technical
•
medium
Explain the difference between repartition() and coalesce() in Spark. When would you use each?
#Spark
#Data Shuffling
#Optimization
Data Engineer
•
Technical
•
hard
What is data skew in Spark, and how do you mitigate it?
#Spark
#Data Skew
#Salting
Data Engineer
•
Technical
•
medium
Explain the concept of Broadcast Variables and Accumulators in Spark.
#Spark
#Shared Variables
Data Engineer
•
Technical
•
hard
What is the Catalyst Optimizer in Spark SQL?
#Spark
#Internals
#Optimization
Data Engineer
•
Technical
•
medium
What are Parquet and ORC formats? Why are they preferred in Big Data over CSV or JSON?
#File Formats
#Storage Optimization
Difficulty Radar
Based on recent AI-sourced data.
Meet Your Interviewers
The "Standard" Interviewer
Senior EngineerFocuses on core competencies, system constraints, and clear communication.
SimulateUnwritten Rules
Think Out Loud
Always explain your thought process before writing code or drawing architecture.