Twitter / X
Real-time social platform with petabyte-scale data and ML ranking systems.
4 Rounds
~14 Days
Hard
The Interview Loop
Recruiter Screen (30 min)
Standard fit check, behavioral questions, and resume overview.
Technical Loop (3-4 Rounds)
Deep dive into domain knowledge, coding, and system design.
Interview Question Bank
Cloud Engineer
•
Behavioral
•
medium
X has a strong focus on efficiency. Tell me about a time you had to drastically reduce cloud infrastructure costs. What was your approach and the outcome?
#Cost Optimization
#Impact
#Problem Solving
Cloud Engineer
•
Behavioral
•
medium
Describe a situation where you had to push back on a product or feature deployment because of infrastructure stability or security concerns. How did you handle the conflict?
#Communication
#Conflict Resolution
#Ownership
Cloud Engineer
•
Behavioral
•
medium
X moves extremely fast and sometimes things break. Tell me about a time you had to deploy a critical fix or make a major infrastructure decision with incomplete information.
#Adaptability
#Decision Making
#High Pressure
Cloud Engineer
•
Behavioral
•
hard
Tell me about the most complex production outage you've ever debugged. What was the root cause, how did you find it, and what did you do to ensure it never happened again?
#Incident Management
#Debugging
#Post-mortems
Cloud Engineer
•
Behavioral
•
medium
How do you handle taking ownership of a legacy system where the original creators have left the company, there is minimal documentation, and it frequently fails?
#Legacy Systems
#Ownership
#Resilience
Cloud Engineer
•
Behavioral
•
easy
Describe a time you automated a tedious operational task. How did you identify the opportunity, what did you build, and how did you measure the impact?
#Automation
#Toil Reduction
#Initiative
Cloud Engineer
•
Behavioral
•
medium
Tell me about a time you strongly disagreed with a senior engineer or architect on a system design choice. How did you articulate your point, and what was the resolution?
#Communication
#Technical Debate
#Teamwork
Cloud Engineer
•
Coding
•
medium
Write a Python script to parse a 50GB Nginx access log file and output the top 10 IP addresses making requests. Optimize for memory usage.
#Python
#Log Parsing
#Memory Management
#Data Structures
Cloud Engineer
•
Coding
•
medium
Implement a rate limiter class in Python or Go using the Token Bucket algorithm. It should support multiple users identified by user_id.
#Python
#Go
#Rate Limiting
#Concurrency
Cloud Engineer
•
Coding
•
easy
Write a Bash one-liner or short script to find and gracefully terminate all zombie processes on a Linux machine.
#Bash
#Linux
#Process Management
Cloud Engineer
•
Coding
•
medium
Given a list of server connection logs represented as [start_time, end_time], write a function to find the maximum number of concurrent connections and the time it occurred.
#Algorithms
#Sweep Line
#Sorting
Cloud Engineer
•
Coding
•
medium
Write a Terraform module to provision a secure AWS VPC with 2 public subnets, 2 private subnets, a NAT gateway, and strict security groups allowing only internal traffic to the private subnets.
#Terraform
#AWS
#Networking
Cloud Engineer
•
Coding
•
medium
Implement a function that takes an IPv4 address string and a CIDR block string (e.g., '192.168.1.5', '192.168.1.0/24') and returns a boolean indicating if the IP belongs to the CIDR block.
#Networking
#Bit Manipulation
#Python
#Go
Cloud Engineer
•
Coding
•
medium
Write a script (Python/Boto3 or Bash/AWS CLI) to automate the detection and rotation of AWS IAM access keys older than 90 days across multiple accounts.
#Python
#AWS IAM
#Security
#Automation
Cloud Engineer
•
Coding
•
medium
Given a list of microservices and their dependencies (e.g., Service A depends on Service B), write an algorithm to determine a valid deployment order.
#Graphs
#Topological Sort
#Algorithms
Cloud Engineer
•
System Design
•
hard
Design the cloud infrastructure to support Twitter's timeline generation at scale, ensuring sub-200ms latency for 500 million daily active users.
#Scalability
#Caching
#Load Balancing
#Microservices
Cloud Engineer
•
System Design
•
hard
X has recently moved several workloads from AWS back to on-prem bare metal to save costs. Walk me through how you would architect the migration of a massive stateful service (like a distributed database) from AWS to an on-prem data center with zero downtime.
#Cloud Migration
#Bare Metal
#Database Replication
#Networking
Cloud Engineer
•
System Design
•
hard
Design a global rate-limiting service for the X API that can handle millions of requests per second across multiple geographic regions.
#Rate Limiting
#Distributed Systems
#Redis
#Concurrency
Cloud Engineer
•
System Design
•
hard
How would you design a highly available, multi-region Kubernetes cluster architecture to host X's core microservices?
#Kubernetes
#High Availability
#Multi-region
#Service Mesh
Cloud Engineer
•
System Design
•
medium
Design a cost-optimized storage tiering system for X's media assets (images and videos) that balances retrieval speed with AWS S3/GCS storage costs.
#Storage
#Cost Optimization
#AWS S3
#CDN
Cloud Engineer
•
System Design
•
hard
Design an infrastructure deployment pipeline capable of deploying a critical security patch to 10,000+ nodes in under 5 minutes without causing a global outage.
#CI/CD
#Automation
#Deployment Strategies
#Infrastructure as Code
Cloud Engineer
•
System Design
•
medium
Design a distributed caching layer for trending topics that can handle extreme traffic spikes (e.g., during the Super Bowl or a major global news event).
#Caching
#Redis
#Memcached
#Thundering Herd
Cloud Engineer
•
System Design
•
medium
Architect a disaster recovery plan for X's core authentication service. How do you ensure RTO (Recovery Time Objective) is under 1 minute and RPO (Recovery Point Objective) is zero?
#Disaster Recovery
#High Availability
#Databases
#Failover
Cloud Engineer
•
Technical
•
medium
You notice a sudden 500ms latency spike in a critical microservice running on EKS. Walk me through your exact troubleshooting steps.
#Kubernetes
#Observability
#Networking
#Debugging
Cloud Engineer
•
Technical
•
hard
How does Kubernetes handle network partitions between nodes? Explain the split-brain scenario and how etcd consensus prevents it.
#Kubernetes
#Distributed Systems
#etcd
#Consensus Algorithms
Cloud Engineer
•
Technical
•
hard
Describe the process of tuning a Linux kernel for high-throughput network traffic, specifically to handle millions of concurrent TCP connections.
#Linux
#Networking
#Kernel Tuning
#TCP/IP
Cloud Engineer
•
Technical
•
medium
Walk me through the lifecycle of a DNS query when a user types x.com, focusing specifically on how global load balancing and Anycast routing are involved.
#DNS
#Load Balancing
#Anycast
#BGP
Cloud Engineer
•
Technical
•
medium
How do you manage Terraform state for a massive, multi-team infrastructure without causing locking bottlenecks or state file corruption?
#Terraform
#State Management
#Collaboration
#CI/CD
Cloud Engineer
•
Technical
•
easy
Explain the difference between an Application Load Balancer (ALB) and a Network Load Balancer (NLB). When would you explicitly choose an NLB for a service at X?
#Load Balancing
#AWS
#OSI Model
Cloud Engineer
•
Technical
•
hard
How would you implement zero-downtime database schema migrations for a live service handling 100k queries per second?
#Databases
#Migrations
#High Availability
Cloud Engineer
•
Technical
•
medium
What happens at the OS and container level when a Kubernetes pod runs out of memory (OOMKilled)? How do you debug and prevent it?
#Kubernetes
#Linux
#Memory Management
#Docker
Cloud Engineer
•
Technical
•
hard
Explain BGP routing. How does X use BGP to announce IP prefixes for its global CDN network?
#BGP
#CDN
#Routing
#Networking
Cloud Engineer
•
Technical
•
medium
How do you secure internal microservice-to-microservice communication across different cloud VPCs or hybrid environments?
#Security
#Networking
#mTLS
#Service Mesh
Cloud Engineer
•
Technical
•
medium
What are the engineering and financial tradeoffs between using managed cloud databases (like AWS RDS or GCP Spanner) versus self-hosting databases on bare metal servers?
#Cost Optimization
#Databases
#Bare Metal
#Cloud Computing
Cloud Engineer
•
Technical
•
hard
Describe how you would architect a Prometheus and Grafana monitoring stack to ingest and query metrics from a fleet of 50,000 servers efficiently.
#Prometheus
#Grafana
#Monitoring
#Scalability
Difficulty Radar
Based on recent AI-sourced data.
Meet Your Interviewers
The "Standard" Interviewer
Senior EngineerFocuses on core competencies, system constraints, and clear communication.
SimulateUnwritten Rules
Think Out Loud
Always explain your thought process before writing code or drawing architecture.