Flipkart
Indian e-commerce giant tackling massive supply chain and scale problems.
4 Rounds
~18 Days
Hard
The Interview Loop
Recruiter Screen (30 min)
Standard fit check, behavioral questions, and resume overview.
Technical Loop (3-4 Rounds)
Deep dive into domain knowledge, coding, and system design.
Interview Question Bank
Cloud Engineer
•
Behavioral
•
medium
Tell me about a time you caused a production outage. How did you handle it, and what was the Root Cause Analysis (RCA)?
#Ownership
#Incident Management
#Continuous Improvement
Cloud Engineer
•
Behavioral
•
medium
Describe a situation where you had to push back on a development team's architectural choice because it wasn't cloud-native or scalable.
#Communication
#Stakeholder Management
#Cloud Native
Cloud Engineer
•
Behavioral
•
easy
Tell me about a time you automated a repetitive operational task. What was the impact on the team?
#Automation
#Efficiency
#Initiative
Cloud Engineer
•
Coding
•
medium
Write a Python script to efficiently parse a 50GB Nginx access log file and find the top 10 IP addresses making requests.
#Python
#File I/O
#Data Structures
#Memory Management
Cloud Engineer
•
Coding
•
easy
Write a bash script to monitor disk space on 1000 Linux VMs and alert via an API webhook if any partition exceeds 85%.
#Bash
#Linux
#Automation
#Monitoring
Cloud Engineer
•
Coding
•
hard
Given a list of server logs with timestamps and error codes, write a function to find the longest contiguous window where the error rate exceeded 5%.
#Sliding Window
#Data Structures
#Log Parsing
Cloud Engineer
•
Coding
•
medium
Write a Go program that concurrently checks the health of 100 API endpoints and aggregates the results, timing out after 2 seconds.
#Go
#Concurrency
#Goroutines
#Channels
Cloud Engineer
•
System Design
•
hard
Design the cloud infrastructure to handle Flipkart's Big Billion Days (BBD) flash sale traffic, ensuring zero downtime and sub-second latency.
#Scalability
#High Availability
#Load Balancing
#Caching
Cloud Engineer
•
System Design
•
hard
How would you design a multi-region active-active Kubernetes cluster setup for Flipkart's checkout service?
#Kubernetes
#Multi-region
#Disaster Recovery
#Global Load Balancing
Cloud Engineer
•
System Design
•
hard
Design a centralized logging and observability stack for 5000+ microservices generating petabytes of logs daily.
#Observability
#Data Pipelines
#Kafka
#Elasticsearch
Cloud Engineer
•
System Design
•
hard
Design a zero-downtime deployment pipeline for a stateful service like a distributed cache or database.
#CI/CD
#Stateful Deployments
#Blue-Green
#Canary
Cloud Engineer
•
System Design
•
hard
How would you migrate a 10TB relational database from an on-premise data center to a managed cloud database with near-zero downtime?
#Database Migration
#Replication
#Cloud Architecture
Cloud Engineer
•
System Design
•
hard
Design a rate-limiting service for Flipkart's public API to prevent DDoS attacks and aggressive scraping.
#Rate Limiting
#Security
#Algorithms
#Redis
Cloud Engineer
•
System Design
•
medium
Design an image resizing service on the cloud that processes millions of product images dynamically based on device screen size.
#Serverless
#Caching
#Event-driven Architecture
Cloud Engineer
•
System Design
•
hard
Explain how you would design the IAM role hierarchy and cross-account access for Flipkart's staging, pre-prod, and prod environments.
#IAM
#Cloud Security
#Governance
#AWS/GCP
Cloud Engineer
•
System Design
•
hard
Design a disaster recovery (DR) strategy for Flipkart's order management system with an RTO of 15 minutes and RPO of 1 minute.
#Disaster Recovery
#RTO/RPO
#Data Replication
#High Availability
Cloud Engineer
•
Technical
•
medium
During a high-traffic event, you notice a sudden spike in 504 Gateway Timeouts on the API Gateway. Walk me through your step-by-step debugging process.
#Networking
#API Gateway
#Observability
#Incident Response
Cloud Engineer
•
Technical
•
medium
Explain the packet flow when a user searches for a product on Flipkart.com, from the browser to the backend microservice.
#DNS
#TCP/IP
#TLS
#Load Balancing
#Reverse Proxy
Cloud Engineer
•
Technical
•
medium
How do you manage Terraform state for a team of 50+ cloud engineers working concurrently on the same infrastructure?
#Terraform
#State Management
#Concurrency
#CI/CD
Cloud Engineer
•
Technical
•
medium
How do you ensure cost optimization for a massive cloud deployment without impacting performance during peak sales?
#FinOps
#Auto-scaling
#Spot Instances
#Resource Allocation
Cloud Engineer
•
Technical
•
hard
A Linux server's CPU load average is 50, but CPU utilization is only 10%. What is happening and how do you fix it?
#Linux
#Process States
#I/O Wait
#Troubleshooting
Cloud Engineer
•
Technical
•
hard
Explain how Kubernetes handles network partitions (split-brain) and what happens to the pods in the isolated node.
#Kubernetes
#Distributed Systems
#Networking
#Node Lifecycle
Cloud Engineer
•
Technical
•
medium
Differentiate between Application Load Balancer (ALB) and Network Load Balancer (NLB). When would you use NLB for Flipkart's infrastructure?
#Load Balancing
#OSI Model
#AWS/GCP
Cloud Engineer
•
Technical
•
medium
How do you securely manage secrets and API keys in a Kubernetes environment at scale?
#Kubernetes
#Secret Management
#Vault
#RBAC
Cloud Engineer
•
Technical
•
easy
Compare block storage, file storage, and object storage. Which would you choose for storing user-uploaded product review images and why?
#Storage
#AWS S3
#EBS
#EFS
Cloud Engineer
•
Technical
•
medium
What is the difference between a StatefulSet and a Deployment in Kubernetes? Give a real-world use case for each at Flipkart.
#Kubernetes
#Workload Management
#Stateful vs Stateless
Cloud Engineer
•
Technical
•
medium
Explain the Linux boot process from power-on to the login prompt.
#Linux
#OS Internals
#Bootloader
#Systemd
Cloud Engineer
•
Technical
•
medium
How does DNS resolution work? How would you configure DNS for a multi-region failover setup?
#DNS
#High Availability
#Failover
#Routing
Cloud Engineer
•
Technical
•
medium
What are Terraform modules? How do you version and test them before deploying to production?
#Terraform
#Modularity
#Testing
#CI/CD
Cloud Engineer
•
Technical
•
medium
How do you set up distributed tracing for a request that traverses 15 different microservices?
#Tracing
#OpenTelemetry
#Microservices
Cloud Engineer
•
Technical
•
medium
You notice OOMKilled errors on a critical Kubernetes pod. How do you investigate if it's a memory leak in the app or misconfigured resource limits?
#Kubernetes
#Memory Management
#Debugging
Cloud Engineer
•
Technical
•
medium
Explain the CAP theorem. How does it apply to a distributed NoSQL database like Cassandra used for Flipkart's shopping cart?
#CAP Theorem
#Distributed Systems
#NoSQL
#Cassandra
Cloud Engineer
•
Technical
•
hard
What is BGP? How is it used in hybrid cloud setups connecting on-premise data centers to AWS/GCP?
#BGP
#Hybrid Cloud
#Direct Connect
#Routing
Cloud Engineer
•
Technical
•
medium
What are inodes? What happens if a server runs out of inodes but still has disk space available?
#Linux
#File Systems
#Troubleshooting
Cloud Engineer
•
Technical
•
medium
Explain the concept of GitOps. How would you implement GitOps using ArgoCD for a fleet of Kubernetes clusters?
#GitOps
#ArgoCD
#Kubernetes
#Continuous Deployment
Difficulty Radar
Based on recent AI-sourced data.
Meet Your Interviewers
The "Standard" Interviewer
Senior EngineerFocuses on core competencies, system constraints, and clear communication.
SimulateUnwritten Rules
Think Out Loud
Always explain your thought process before writing code or drawing architecture.