Amazon

Amazon

E-commerce and cloud computing giant with AWS, the world's leading cloud platform.

5 Rounds ~28 Days Very Hard
Start Mock Interview

The Interview Loop

Recruiter Screen (30 min)

Standard fit check, behavioral questions, and resume overview.

Technical Loop (3-4 Rounds)

Deep dive into domain knowledge, coding, and system design.

Interview Question Bank

Cloud Engineer Behavioral hard

Tell me about a major cloud outage you experienced. How did you respond?

#Outage #On-Call
Cloud Engineer Behavioral hard

Describe a time you migrated a critical workload to the cloud with zero downtime.

#Cloud Migration
Cloud Engineer Behavioral easy

How do you stay updated with new cloud services and features?

#Continuous Learning
Cloud Engineer Behavioral medium

Tell me about a time you significantly reduced cloud infrastructure costs.

#FinOps #Impact
Cloud Engineer Behavioral medium

Describe a situation where you had to choose between two cloud architectures. How did you decide?

#Architecture #Tradeoffs
Cloud Engineer Behavioral medium

Tell me about a time you improved the reliability of a cloud-based data system.

#SRE #Impact
Cloud Engineer Behavioral medium

How do you communicate a complex cloud architecture to non-technical stakeholders?

#Stakeholders
Cloud Engineer Behavioral medium

Describe your experience with incident post-mortems. What do you include?

#Post-Mortem #Learning
Cloud Engineer Behavioral medium

Tell me about a time when you had to deal with a difficult customer or internal stakeholder who was unhappy with your cloud infrastructure delivery. How did you resolve it?

#Customer Obsession #Conflict Resolution #Communication
Cloud Engineer Behavioral hard

Tell me about a time you had to dive deep into a complex technical problem that others couldn't solve. What was the root cause and how did you find it?

#Dive Deep #Root Cause Analysis #Problem Solving
Cloud Engineer Behavioral medium

Describe a situation where you had a tight deadline to migrate a workload to the cloud. How did you prioritize your tasks to ensure you delivered on time without compromising security?

#Deliver Results #Time Management #Cloud Migration
Cloud Engineer Behavioral medium

Tell me about a time when you had to make a critical architectural decision with incomplete information. What was the risk, and how did you proceed?

#Bias for Action #Decision Making #Risk Management
Cloud Engineer Coding medium

Write a Python script using Boto3 to find all unattached EBS volumes in a specific region and delete them if they have been unattached for more than 30 days.

#Python #Boto3 #EBS #Automation
Cloud Engineer Coding easy

Write a bash one-liner or short script to parse an Apache access log file, find the top 10 IP addresses making the most requests, and count how many requests each made.

#Bash #Linux #Log Analysis
Cloud Engineer System Design hard

Design a data lake on AWS using S3, Glue, and Athena.

#AWS #S3 #Athena
Cloud Engineer System Design hard

How would you set up a streaming data pipeline on GCP using Pub/Sub and Dataflow?

#GCP #Pub/Sub #Dataflow
Cloud Engineer System Design hard

How would you architect a data platform that reduces spend by 40% without impacting performance?

#FinOps #Cloud
Cloud Engineer System Design hard

How do you implement disaster recovery for a cloud data warehouse?

#DR #RTO #RPO
Cloud Engineer System Design hard

How do you design a multi-region active-active architecture on AWS?

#Multi-Region #HA
Cloud Engineer System Design hard

Design a highly available, scalable web application on AWS that handles sudden spikes in traffic, similar to Prime Day. The application consists of a stateless web tier and a relational database.

#Auto Scaling #ALB #RDS Multi-AZ #ElastiCache #High Availability
Cloud Engineer System Design medium

Design an event-driven serverless architecture to process image uploads. When a user uploads an image, it needs to be resized, watermarked, and its metadata stored in a database.

#Serverless #Lambda #S3 #DynamoDB #Event-Driven Architecture
Cloud Engineer System Design hard

A financial services client requires a Disaster Recovery plan with a Recovery Point Objective (RPO) of 5 minutes and a Recovery Time Objective (RTO) of 1 hour. How would you design this across two AWS regions?

#Disaster Recovery #RPO/RTO #Cross-Region Replication #Route53
Cloud Engineer Technical hard

How does container networking work in Kubernetes?

#Networking #CNI
Cloud Engineer Technical hard

Compare AWS, GCP, and Azure for a data-intensive workload. What are the key differentiators?

#AWS #GCP #Azure
Cloud Engineer Technical medium

What is the shared responsibility model in cloud security?

#Cloud Security #IAM
Cloud Engineer Technical easy

Explain IaaS, PaaS, and SaaS with examples.

#IaaS #PaaS #SaaS
Cloud Engineer Technical hard

What is a VPC (Virtual Private Cloud)? How do you design a secure VPC architecture?

#VPC #Security
Cloud Engineer Technical easy

Explain the difference between regions, availability zones, and edge locations.

#Regions #AZs
Cloud Engineer Technical medium

How does auto-scaling work? What are the different scaling strategies?

#Auto-Scaling #EC2
Cloud Engineer Technical medium

What is a cloud-native application? How does it differ from a lifted-and-shifted one?

#Cloud Native #Migration
Cloud Engineer Technical hard

Explain multi-cloud vs hybrid cloud architectures and their tradeoffs.

#Multi-Cloud #Hybrid
Cloud Engineer Technical hard

Explain Kubernetes architecture: control plane, nodes, pods, and services.

#K8s #Containers
Cloud Engineer Technical hard

What is a Kubernetes Operator and when would you build one?

#Operators #CRD
Cloud Engineer Technical medium

Explain Kubernetes resource requests vs limits. What happens if a pod exceeds its memory limit?

#Resources #OOM
Cloud Engineer Technical hard

What is a service mesh? Explain how Istio works.

#Istio #Service Mesh
Cloud Engineer Technical hard

How would you set up horizontal pod autoscaling based on custom metrics?

#HPA #Custom Metrics
Cloud Engineer Technical medium

Explain the difference between Docker and containerd.

#Docker #containerd
Cloud Engineer Technical medium

How does a Kubernetes Ingress controller work?

#Ingress #Load Balancing
Cloud Engineer Technical hard

Explain Terraform's state management. What happens if the state file is corrupted?

#IaC #State
Cloud Engineer Technical medium

What is the difference between Terraform and Pulumi?

#Terraform #Pulumi
Cloud Engineer Technical medium

How do you manage secrets in cloud infrastructure? (HashiCorp Vault, AWS Secrets Manager)

#Secrets Management #Vault
Cloud Engineer Technical medium

Explain idempotency in infrastructure provisioning.

#Idempotency #Terraform
Cloud Engineer Technical hard

How do you handle Terraform state across multiple teams?

#State Management #Collaboration
Cloud Engineer Technical hard

Compare AWS EMR, GCP Dataproc, and Azure HDInsight for Spark workloads.

#EMR #Dataproc #Spark
Cloud Engineer Technical medium

Explain the difference between AWS Lambda and EC2 for data processing.

#Lambda #Serverless
Cloud Engineer Technical hard

What is BigQuery Slots? How do you optimize BigQuery query costs?

#GCP #Cost
Cloud Engineer Technical medium

Explain AWS S3 storage classes and lifecycle policies.

#S3 #Cost
Cloud Engineer Technical medium

How does AWS Glue Data Catalog work with Athena?

#Glue #Athena
Cloud Engineer Technical hard

What is zero-trust networking? How do you implement it on cloud?

#Zero Trust #Networking
Cloud Engineer Technical medium

Explain TLS/SSL termination in a cloud load balancer.

#TLS #Load Balancer
Cloud Engineer Technical medium

How do cloud IAM roles and policies work? Explain least-privilege principle.

#IAM #Permissions
Cloud Engineer Technical medium

What is AWS PrivateLink? When would you use it?

#PrivateLink #VPC
Cloud Engineer Technical hard

How would you implement network segmentation for a multi-tier application?

#Security #Subnets
Cloud Engineer Technical medium

What are SLOs, SLAs, and SLIs? How do you define them for a data platform?

#SLO #Reliability
Cloud Engineer Technical hard

Explain chaos engineering. How would you implement it for a data pipeline?

#Chaos Engineering #Fault Injection
Cloud Engineer Technical medium

How do you do capacity planning for a cloud data platform?

#Scaling #Planning
Cloud Engineer Technical easy

What is a runbook? How do you create effective runbooks for data infrastructure?

#Runbook #On-Call
Cloud Engineer Technical medium

Explain the three pillars of observability: logs, metrics, and traces.

#Logs #Metrics #Traces
Cloud Engineer Technical medium

How would you set up CloudWatch dashboards for a data pipeline?

#CloudWatch #AWS
Cloud Engineer Technical medium

What is OpenTelemetry? How does it standardize observability?

#OpenTelemetry #Tracing
Cloud Engineer Technical medium

Explain the AWS Well-Architected Framework's five pillars.

#Well-Architected
Cloud Engineer Technical medium

What is AWS Transit Gateway? When would you use it?

#Transit Gateway #Networking
Cloud Engineer Technical hard

How do you implement cost governance in a large AWS organization?

#Cost #AWS
Cloud Engineer Technical medium

A customer complains that their EC2 instance in a private subnet cannot reach the internet to download updates, even though a NAT Gateway is configured. Walk me through your troubleshooting steps.

#VPC #EC2 #NAT Gateway #Troubleshooting
Cloud Engineer Technical hard

How would you design an IAM strategy for a large enterprise moving to AWS to ensure least privilege while allowing developers to innovate? Explain how you would use SCPs, IAM Roles, and Permission Boundaries.

#IAM #AWS Organizations #SCPs #Security
Cloud Engineer Technical hard

You have an S3 bucket receiving thousands of PUT requests per second. Users are reporting 503 Slow Down errors. What is causing this and how do you architect around S3 request limits?

#S3 #Performance Optimization #Throttling
Cloud Engineer Technical medium

An EC2 Linux instance is experiencing high CPU utilization, but when you run 'top', the CPU usage from user processes is low, while 'wa' (iowait) is very high. What does this mean and how do you fix it?

#Linux #Performance Tuning #EBS #I/O
Cloud Engineer Technical medium

You are deploying infrastructure using AWS CloudFormation. A stack update fails and is stuck in the UPDATE_ROLLBACK_FAILED state. Why does this happen and how do you recover the stack?

#CloudFormation #IaC #Troubleshooting
Cloud Engineer Technical easy

Compare and contrast Amazon RDS and Amazon DynamoDB. In what specific scenarios would you choose DynamoDB over RDS for a microservice?

#RDS #DynamoDB #NoSQL #Relational

Difficulty Radar

Based on recent AI-sourced data.

Meet Your Interviewers

The "Standard" Interviewer

Senior Engineer

Focuses on core competencies, system constraints, and clear communication.

Simulate

Unwritten Rules

Think Out Loud

Always explain your thought process before writing code or drawing architecture.

Practice Now