Meta
Social media and metaverse company behind Facebook, Instagram, and WhatsApp.
4 Rounds
~21 Days
Very Hard
The Interview Loop
Recruiter Screen (30 min)
Standard fit check, behavioral questions, and resume overview.
Technical Loop (3-4 Rounds)
Deep dive into domain knowledge, coding, and system design.
Interview Question Bank
AI Engineer
•
Behavioral
•
hard
Tell me about an AI project where you had to balance innovation with reliability.
#Reliability
#Innovation
AI Engineer
•
Behavioral
•
hard
Describe a situation where you had to debug a hard-to-reproduce AI model failure.
#Problem Solving
AI Engineer
•
Behavioral
•
hard
Describe an AI product you built from scratch. What were the key technical decisions?
#Product Development
AI Engineer
•
Behavioral
•
medium
How do you handle stakeholder uncertainty around AI capabilities and limitations?
#Stakeholders
#Expectations
AI Engineer
•
Behavioral
•
medium
Tell me about a time you optimized an LLM application for cost or latency.
#Cost
#Latency
AI Engineer
•
Behavioral
•
medium
Describe a time you had to choose between using an AI model and a simpler rule-based system.
#Tradeoffs
#Pragmatism
AI Engineer
•
Behavioral
•
easy
How do you stay current with the fast-moving AI/ML research landscape?
#Research
#Continuous Learning
AI Engineer
•
Behavioral
•
hard
Tell me about a time an AI system you built produced unexpected or harmful outputs.
#Responsibility
#Ethics
AI Engineer
•
Coding
•
medium
Write a Python class to manage conversation history for a multi-turn chatbot.
#Chatbot
#Memory
AI Engineer
•
Coding
•
medium
Write a retry mechanism with exponential backoff for LLM API calls.
#Reliability
#APIs
AI Engineer
•
Coding
•
hard
Implement a simple RAG pipeline using Python, LangChain, and FAISS.
#RAG
#Python
AI Engineer
•
Coding
•
hard
Implement a semantic chunking strategy for long documents.
#Chunking
#Embeddings
AI Engineer
•
System Design
•
hard
How would you architect an AI platform that supports 1000 concurrent LLM requests?
#Scaling
#LLM Serving
AI Engineer
•
System Design
•
hard
Design a real-time AI safety filter for user-generated content.
#Content Moderation
#Real-Time
AI Engineer
•
System Design
•
hard
How would you build a multi-modal AI system that processes both text and images?
#Multi-Modal
#Vision
AI Engineer
•
System Design
•
hard
Design an AI code review system that integrates with GitHub PRs.
#Code Review
#LLM
AI Engineer
•
System Design
•
hard
Design a document question-answering system using RAG.
#RAG
#Vector Search
AI Engineer
•
System Design
•
hard
Design an AI-powered customer support chatbot for an e-commerce platform.
#Chatbot
#LLM
AI Engineer
•
System Design
•
hard
Design an AI agent system that can autonomously browse the web and complete tasks.
#Agents
#Tool Use
AI Engineer
•
Technical
•
medium
How do you choose the right embedding model for a domain-specific search task?
#Embedding Models
#Search
AI Engineer
•
Technical
•
medium
What is semantic search? How does it differ from keyword-based search?
#Semantic Search
#NLP
AI Engineer
•
Technical
•
hard
Explain function calling / tool use in LLMs. How do you implement it?
#Function Calling
#Tool Use
AI Engineer
•
Technical
•
medium
What is LangChain? What are its key components (Chains, Agents, Tools)?
#LangChain
#Agents
AI Engineer
•
Technical
•
medium
How do you integrate OpenAI API or Gemini API into a production application?
#OpenAI
#Gemini
AI Engineer
•
Technical
•
hard
Explain the difference between dense and sparse retrieval in RAG.
#Dense Retrieval
#BM25
AI Engineer
•
Technical
•
hard
How do you evaluate retrieval quality in a RAG system?
#Evaluation
#Retrieval
AI Engineer
•
Technical
•
hard
How do you evaluate the quality of an LLM-generated response?
#LLM Evaluation
#RAGAS
AI Engineer
•
Technical
•
hard
Explain the concept of AI bias. How do you detect and mitigate it in production?
#Bias
#Fairness
AI Engineer
•
Technical
•
hard
What is AI alignment? What are the key safety concerns with large-scale AI deployment?
#Alignment
#Safety
AI Engineer
•
Technical
•
hard
What is Constitutional AI? How does Anthropic use it?
#Constitutional AI
#Anthropic
AI Engineer
•
Technical
•
hard
How do you red-team an AI system?
#Red Teaming
#Security
AI Engineer
•
Technical
•
medium
What are guardrails in LLM applications? How do they work?
#Guardrails
#Output Filtering
AI Engineer
•
Technical
•
medium
What is streaming response from an LLM API? How do you implement it in a web app?
#Streaming
#API
AI Engineer
•
Technical
•
medium
How do you manage LLM API rate limits and costs in production?
#Rate Limiting
#Cost
AI Engineer
•
Technical
•
medium
Explain structured output generation from LLMs (JSON mode, Instructor library).
#Structured Output
#JSON
AI Engineer
•
Technical
•
hard
Explain the difference between GPT, BERT, and T5 architectures.
#GPT
#BERT
#T5
AI Engineer
•
Technical
•
medium
What is prompt engineering? What are few-shot, zero-shot, and chain-of-thought prompting?
#Prompt Engineering
#Few-Shot
AI Engineer
•
Technical
•
hard
Explain how RLHF (Reinforcement Learning from Human Feedback) improves LLMs.
#RLHF
#Alignment
AI Engineer
•
Technical
•
hard
What is RAG (Retrieval-Augmented Generation)? When would you use it over fine-tuning?
#RAG
#Fine-Tuning
AI Engineer
•
Technical
•
medium
Explain the difference between fine-tuning and in-context learning.
#Fine-Tuning
#ICL
AI Engineer
•
Technical
•
medium
What is token context window? How do you handle documents longer than the context limit?
#Context Window
#Chunking
AI Engineer
•
Technical
•
hard
Explain positional encoding in transformers. What are the differences between absolute and rotary position embeddings?
#Positional Encoding
#RoPE
AI Engineer
•
Technical
•
hard
What is hallucination in LLMs? How do you detect and mitigate it?
#Hallucination
#Safety
AI Engineer
•
Technical
•
medium
Explain the difference between autoregressive and masked language modeling.
#Autoregressive
#Masked LM
AI Engineer
•
Technical
•
hard
What is a mixture of experts (MoE) architecture? How does it scale?
#MoE
#Scaling
AI Engineer
•
Technical
•
hard
Explain how vector similarity search works. What are HNSW and IVF indices?
#HNSW
#Similarity Search
AI Engineer
•
Technical
•
medium
Compare vector databases: Pinecone, Weaviate, Qdrant, and pgvector.
#Vector DB
#Embeddings
Cloud Engineer
•
Behavioral
•
medium
Tell me about a time you improved the reliability of a cloud-based data system.
#SRE
#Impact
Cloud Engineer
•
Behavioral
•
medium
Describe a situation where you had to choose between two cloud architectures. How did you decide?
#Architecture
#Tradeoffs
Cloud Engineer
•
Behavioral
•
medium
How do you communicate a complex cloud architecture to non-technical stakeholders?
#Stakeholders
Cloud Engineer
•
Behavioral
•
medium
Describe your experience with incident post-mortems. What do you include?
#Post-Mortem
#Learning
Cloud Engineer
•
Behavioral
•
medium
Tell me about a time you had to optimize a system or infrastructure to reduce costs or improve efficiency. What metrics did you use to measure success?
#Efficiency
#Impact
#Metrics
Cloud Engineer
•
Behavioral
•
hard
Tell me about a major cloud outage you experienced. How did you respond?
#Outage
#On-Call
Cloud Engineer
•
Behavioral
•
hard
Describe a time you migrated a critical workload to the cloud with zero downtime.
#Cloud Migration
Cloud Engineer
•
Behavioral
•
medium
Describe a situation where you strongly disagreed with a senior engineer or a cross-functional partner on an architectural decision. How did you resolve the conflict?
#Conflict Resolution
#Communication
#Data-Driven Decisions
Cloud Engineer
•
Behavioral
•
easy
How do you stay updated with new cloud services and features?
#Continuous Learning
Cloud Engineer
•
Behavioral
•
medium
Tell me about a time you caused a significant production outage. How did you handle the immediate mitigation, and what was the post-mortem process?
#Incident Management
#Ownership
#Blameless Culture
Cloud Engineer
•
Behavioral
•
medium
Tell me about a time you significantly reduced cloud infrastructure costs.
#FinOps
#Impact
Cloud Engineer
•
Coding
•
medium
Given a massive, 100GB web server log file, write a script to find the top K most frequent IP addresses making requests. The script must run on a machine with only 4GB of RAM.
#Log Parsing
#Memory Management
#Heaps
#Hash Maps
Cloud Engineer
•
Coding
•
easy
Write a function to validate if a given string is a valid IPv4 or IPv6 address. Do not use built-in IP validation libraries.
#String Manipulation
#Validation
#Edge Cases
Cloud Engineer
•
Coding
•
hard
Implement a thread-safe bounded blocking queue. It should support enqueue and dequeue operations, blocking if the queue is full or empty, respectively.
#Multithreading
#Synchronization
#Data Structures
Cloud Engineer
•
Coding
•
medium
Write a concurrent script in Python or Go to monitor a local directory for new files and upload them to a remote storage service. It must handle network failures and retries gracefully.
#Concurrency
#Error Handling
#File I/O
#Asynchronous Programming
Cloud Engineer
•
System Design
•
hard
How would you set up a streaming data pipeline on GCP using Pub/Sub and Dataflow?
#GCP
#Pub/Sub
#Dataflow
Cloud Engineer
•
System Design
•
hard
How would you architect a data platform that reduces spend by 40% without impacting performance?
#FinOps
#Cloud
Cloud Engineer
•
System Design
•
hard
How do you implement disaster recovery for a cloud data warehouse?
#DR
#RTO
#RPO
Cloud Engineer
•
System Design
•
hard
Design a photo storage system similar to what Instagram uses. It must handle billions of reads and millions of writes per day with extremely low latency.
#Blob Storage
#CDN
#Caching
#Database Scaling
Cloud Engineer
•
System Design
•
medium
Design a distributed caching system for Meta's newsfeed. How do you handle cache invalidation and the 'thundering herd' problem?
#Caching
#Distributed Systems
#Memcached
Cloud Engineer
•
System Design
•
hard
Design a globally distributed rate limiter for Meta's Graph API. It needs to handle millions of requests per second and enforce limits based on user ID and application ID.
#Rate Limiting
#Distributed Systems
#Redis
#Concurrency
Cloud Engineer
•
System Design
•
hard
Design a data lake on AWS using S3, Glue, and Athena.
#AWS
#S3
#Athena
Cloud Engineer
•
Technical
•
medium
How do cloud IAM roles and policies work? Explain least-privilege principle.
#IAM
#Permissions
Cloud Engineer
•
Technical
•
medium
What is AWS PrivateLink? When would you use it?
#PrivateLink
#VPC
Cloud Engineer
•
Technical
•
hard
How would you implement network segmentation for a multi-tier application?
#Security
#Subnets
Cloud Engineer
•
Technical
•
medium
What are SLOs, SLAs, and SLIs? How do you define them for a data platform?
#SLO
#Reliability
Cloud Engineer
•
Technical
•
hard
Explain chaos engineering. How would you implement it for a data pipeline?
#Chaos Engineering
#Fault Injection
Cloud Engineer
•
Technical
•
medium
How do you do capacity planning for a cloud data platform?
#Scaling
#Planning
Cloud Engineer
•
Technical
•
easy
What is a runbook? How do you create effective runbooks for data infrastructure?
#Runbook
#On-Call
Cloud Engineer
•
Technical
•
medium
Explain the three pillars of observability: logs, metrics, and traces.
#Logs
#Metrics
#Traces
Cloud Engineer
•
Technical
•
medium
How would you set up CloudWatch dashboards for a data pipeline?
#CloudWatch
#AWS
Cloud Engineer
•
Technical
•
medium
What is OpenTelemetry? How does it standardize observability?
#OpenTelemetry
#Tracing
Cloud Engineer
•
Technical
•
medium
What happens exactly when a process runs out of memory in Linux? Explain the OOM Killer mechanism and how you would configure a critical database process to avoid being killed.
#Memory Management
#OOM Killer
#Kernel Tuning
Cloud Engineer
•
Technical
•
medium
Explain the difference between TCP and UDP. How does the QUIC protocol improve upon TCP, and why would Meta use it for video streaming or WhatsApp calls?
#Protocols
#TCP/UDP
#QUIC
#Latency
Cloud Engineer
•
Technical
•
hard
You notice a sudden spike in 5xx HTTP errors from a specific microservice cluster. Walk me through your exact debugging steps from the edge load balancer down to the application layer.
#Microservices
#Debugging
#Observability
Cloud Engineer
•
Technical
•
medium
Explain the exact difference between an L4 load balancer and an L7 load balancer. Describe how Direct Server Return (DSR) works in a massive scale environment like Meta's.
#Load Balancing
#OSI Model
#TCP/IP
#DSR
Cloud Engineer
•
Technical
•
hard
You are paged because a Linux server is experiencing a high load average (e.g., 40 on a 16-core machine), but CPU utilization is under 10%. How do you troubleshoot this?
#OS Troubleshooting
#I/O Wait
#Process States
Cloud Engineer
•
Technical
•
hard
Compare AWS, GCP, and Azure for a data-intensive workload. What are the key differentiators?
#AWS
#GCP
#Azure
Cloud Engineer
•
Technical
•
medium
What is the shared responsibility model in cloud security?
#Cloud Security
#IAM
Cloud Engineer
•
Technical
•
easy
Explain IaaS, PaaS, and SaaS with examples.
#IaaS
#PaaS
#SaaS
Cloud Engineer
•
Technical
•
hard
What is a VPC (Virtual Private Cloud)? How do you design a secure VPC architecture?
#VPC
#Security
Cloud Engineer
•
Technical
•
easy
Explain the difference between regions, availability zones, and edge locations.
#Regions
#AZs
Cloud Engineer
•
Technical
•
medium
How does auto-scaling work? What are the different scaling strategies?
#Auto-Scaling
#EC2
Cloud Engineer
•
Technical
•
medium
What is a cloud-native application? How does it differ from a lifted-and-shifted one?
#Cloud Native
#Migration
Cloud Engineer
•
Technical
•
hard
Explain multi-cloud vs hybrid cloud architectures and their tradeoffs.
#Multi-Cloud
#Hybrid
Cloud Engineer
•
Technical
•
hard
Explain Kubernetes architecture: control plane, nodes, pods, and services.
#K8s
#Containers
Cloud Engineer
•
Technical
•
hard
What is a Kubernetes Operator and when would you build one?
#Operators
#CRD
Cloud Engineer
•
Technical
•
hard
How does container networking work in Kubernetes?
#Networking
#CNI
Cloud Engineer
•
Technical
•
medium
Explain Kubernetes resource requests vs limits. What happens if a pod exceeds its memory limit?
#Resources
#OOM
Cloud Engineer
•
Technical
•
hard
What is a service mesh? Explain how Istio works.
#Istio
#Service Mesh
Cloud Engineer
•
Technical
•
hard
How would you set up horizontal pod autoscaling based on custom metrics?
#HPA
#Custom Metrics
Cloud Engineer
•
Technical
•
medium
Explain the difference between Docker and containerd.
#Docker
#containerd
Cloud Engineer
•
Technical
•
medium
How does a Kubernetes Ingress controller work?
#Ingress
#Load Balancing
Cloud Engineer
•
Technical
•
hard
Explain Terraform's state management. What happens if the state file is corrupted?
#IaC
#State
Cloud Engineer
•
Technical
•
medium
What is the difference between Terraform and Pulumi?
#Terraform
#Pulumi
Cloud Engineer
•
Technical
•
medium
How do you manage secrets in cloud infrastructure? (HashiCorp Vault, AWS Secrets Manager)
#Secrets Management
#Vault
Cloud Engineer
•
Technical
•
medium
Explain idempotency in infrastructure provisioning.
#Idempotency
#Terraform
Cloud Engineer
•
Technical
•
hard
How do you handle Terraform state across multiple teams?
#State Management
#Collaboration
Cloud Engineer
•
Technical
•
hard
Compare AWS EMR, GCP Dataproc, and Azure HDInsight for Spark workloads.
#EMR
#Dataproc
#Spark
Cloud Engineer
•
Technical
•
medium
Explain the difference between AWS Lambda and EC2 for data processing.
#Lambda
#Serverless
Cloud Engineer
•
Technical
•
hard
What is BigQuery Slots? How do you optimize BigQuery query costs?
#GCP
#Cost
Cloud Engineer
•
Technical
•
medium
Explain AWS S3 storage classes and lifecycle policies.
#S3
#Cost
Cloud Engineer
•
Technical
•
medium
How does AWS Glue Data Catalog work with Athena?
#Glue
#Athena
Cloud Engineer
•
Technical
•
hard
What is zero-trust networking? How do you implement it on cloud?
#Zero Trust
#Networking
Cloud Engineer
•
Technical
•
medium
Explain TLS/SSL termination in a cloud load balancer.
#TLS
#Load Balancer
Data Analyst
•
Behavioral
•
medium
How do you handle a situation where a stakeholder challenges your analysis?
#Stakeholders
#Confidence
Data Analyst
•
Behavioral
•
medium
Describe a time you found an insight that was counterintuitive.
#Curiosity
Data Analyst
•
Behavioral
•
hard
Tell me about a time you had incomplete data but still needed to deliver analysis.
#Ambiguity
Data Analyst
•
Behavioral
•
medium
How do you prioritize analytical requests when multiple teams need you?
#Time Management
Data Analyst
•
Behavioral
•
medium
Describe a dashboard you built that was widely adopted. What made it successful?
#Visualization
Data Analyst
•
Behavioral
•
medium
Tell me about a time you discovered data quality issues mid-analysis. What did you do?
#Problem Solving
Data Analyst
•
Behavioral
•
easy
How do you ensure your analyses are reproducible?
#Reproducibility
Data Analyst
•
Behavioral
•
medium
Tell me about an analysis that changed a major business decision.
#Business Impact
#Influence
Data Analyst
•
Coding
•
medium
Explain window functions. Write a query using LAG() to compute day-over-day change.
#Window Functions
Data Analyst
•
Coding
•
hard
What is a funnel query? Write one for a 3-step user onboarding flow.
#Funnel Analysis
Data Analyst
•
Coding
•
medium
What is a pivot table in SQL? How would you implement it without native PIVOT support?
#Pivot
#Data Transformation
Data Analyst
•
Coding
•
hard
Write a SQL query to find customers who made purchases in both January and February but not March.
#Set Operations
Data Analyst
•
Coding
•
hard
Write a SQL query to calculate the rolling 28-day average session duration per user.
#Rolling Average
#Sessions
Data Analyst
•
Coding
•
hard
How would you build a cohort analysis for user retention in SQL?
#Cohort Analysis
#Retention
Data Analyst
•
Coding
•
medium
Write a SQL query to calculate month-over-month revenue growth.
#Revenue
#Growth Analytics
Data Analyst
•
Coding
•
medium
How would you use pandas to compute a 7-day rolling average of sessions?
#Pandas
#Time Series
Data Analyst
•
Coding
•
easy
Explain how groupby and agg work in pandas with an example.
#Pandas
#GroupBy
Data Analyst
•
Coding
•
medium
How would you merge two large DataFrames efficiently in pandas?
#Pandas
#Merging
Data Analyst
•
Coding
•
medium
Describe how to detect and handle outliers in a dataset using Python.
#Outliers
#Data Cleaning
Data Analyst
•
Coding
•
easy
Write Python code to load a CSV, clean missing values, and compute summary statistics.
#Data Cleaning
#Pandas
Data Analyst
•
Coding
•
hard
How would you detect anomalies in a daily revenue time series using SQL?
#Anomaly Detection
#SQL
Data Analyst
•
Technical
•
easy
What is ARPU (Average Revenue Per User)? How do you segment ARPU analysis?
#ARPU
#Revenue
Data Analyst
•
Technical
•
medium
How would you measure the success of a new feature launch?
#Feature Success
#Metrics
Data Analyst
•
Technical
•
easy
Explain the difference between DAU, WAU, and MAU. Which is most useful and when?
#Engagement
#KPIs
Data Analyst
•
Technical
•
medium
What is customer lifetime value (LTV)? How would you calculate it?
#LTV
#Retention
Data Analyst
•
Technical
•
hard
Daily Active Users dropped 15% yesterday. Walk me through how you'd investigate.
#Root Cause Analysis
#Metrics
Data Analyst
•
Technical
•
medium
How do you handle timezone conversions in SQL analytics?
#Timezones
#Analytics
Data Analyst
•
Technical
•
easy
Explain the difference between a HAVING clause and a WHERE clause.
#SQL Basics
Data Analyst
•
Technical
•
medium
Describe your process for creating an executive-level analytics presentation.
#Executive Reporting
Data Analyst
•
Technical
•
easy
How do you choose between a bar chart, line chart, and scatter plot?
#Charts
#Design
Data Analyst
•
Technical
•
easy
What tools do you use for dashboarding? Compare Tableau, Looker, and Metabase.
#Tableau
#Looker
Data Analyst
•
Technical
•
medium
How would you explain statistical significance to a non-technical product manager?
#Storytelling
#Statistics
Data Analyst
•
Technical
•
medium
What makes a good data visualization? Walk me through your design principles.
#Design
#Communication
Data Analyst
•
Technical
•
hard
How do you handle multiple metrics in an A/B test (metric tradeoffs)?
#Multiple Metrics
#Tradeoffs
Data Analyst
•
Technical
•
medium
What is a novelty effect in experimentation? How do you account for it?
#Novelty Effect
#Bias
Data Analyst
•
Technical
•
hard
A/B test shows p=0.04, but the effect size is tiny. Would you ship?
#Practical Significance
#Decision Making
Data Analyst
•
Technical
•
hard
What sample size do you need for an A/B test? How do you calculate it?
#Sample Size
#Power
Data Analyst
•
Technical
•
hard
Explain how you'd set up an A/B test to validate a new checkout flow.
#A/B Testing
#Statistics
Data Analyst
•
Technical
•
hard
How would you measure the impact of a pricing change on revenue?
#Pricing
#A/B Test
Data Analyst
•
Technical
•
easy
What is net promoter score (NPS)? How do you analyse NPS trends?
#NPS
#Customer Satisfaction
Data Analyst
•
Technical
•
hard
What metrics would you use to measure the health of a marketplace?
#Marketplace
#Supply & Demand
Data Analyst
•
Technical
•
medium
How would you build a dashboard to monitor e-commerce funnel health?
#Visualization
#Funnel
Data Analyst
•
Technical
•
hard
Explain the concept of attribution modeling. What are last-click vs multi-touch models?
#Marketing Analytics
Data Engineer
•
Behavioral
•
medium
How do you handle disagreements with data analysts or scientists who want features that compromise pipeline reliability?
#Conflict Resolution
Data Engineer
•
Behavioral
•
medium
Describe a situation where a data pipeline you owned went down in production. How did you handle it?
#On-Call
#Problem Solving
Data Engineer
•
Behavioral
•
medium
Tell me about a time you simplified a complex data platform decision across multiple teams.
#Communication
#Stakeholders
Data Engineer
•
Behavioral
•
medium
Tell me about a time you had a disagreement with a cross-functional partner (like a Data Scientist or Product Manager) regarding the definition of a metric or a data pipeline requirement. How did you resolve it?
#Conflict Resolution
#Communication
#Cross-functional Collaboration
Data Engineer
•
Behavioral
•
medium
Tell me about a time you identified a major bottleneck or inefficiency in an existing data pipeline. What steps did you take to optimize it, and what was the impact?
#Impact
#Proactivity
#Optimization
Data Engineer
•
Behavioral
•
easy
How do you stay current with rapidly evolving data engineering tools and practices?
#Growth Mindset
Data Engineer
•
Behavioral
•
easy
Describe your experience mentoring junior data engineers.
#Mentoring
#Collaboration
Data Engineer
•
Behavioral
•
medium
Tell me about a time you onboarded a new data source that had significant quality issues.
#Problem Solving
Data Engineer
•
Behavioral
•
hard
Describe how you've balanced technical debt vs. new feature development in a data platform.
#Prioritization
Data Engineer
•
Behavioral
•
medium
Tell me about a time you significantly improved the performance of a data system.
#Performance
#Optimization
Data Engineer
•
Coding
•
easy
Given a `friend_requests` table (sender_id, receiver_id, date, status) and an `acceptances` table, write a SQL query to find the overall acceptance rate of friend requests by date.
#Joins
#Aggregations
#Ratios
Data Engineer
•
Coding
•
medium
Given a `user_logins` table with `user_id` and `login_date`, write a SQL query to calculate the 7-day rolling average of Daily Active Users (DAU) for the last 30 days.
#Window Functions
#Rolling Averages
#DAU
Data Engineer
•
Coding
•
easy
Given an array of integers representing the number of likes on a user's posts, write a Python function to move all zeros (posts with zero likes) to the end of the array while maintaining the relative order of the non-zero elements. Do this in-place.
#Arrays
#Two Pointers
#In-place Manipulation
Data Engineer
•
Coding
•
medium
Write a Python script to parse a massive JSONL file (100GB+) containing WhatsApp message metadata. Calculate the total number of messages sent per country code. You cannot load the entire file into memory.
#File I/O
#Memory Management
#Generators
#JSON
Data Engineer
•
Coding
•
medium
Write a SQL query to find users who have interacted with a Meta ad and subsequently made a purchase on the advertiser's website within 24 hours. You have an `ad_clicks` table and a `conversions` table.
#Joins
#Date/Time Functions
#Attribution
Data Engineer
•
Coding
•
medium
Write a SQL query to compute a 7-day rolling average of daily sales.
#Window Functions
#Analytics
Data Engineer
•
Coding
•
medium
Write a SQL query to find the second highest salary per department.
#Window Functions
#SQL
Data Engineer
•
Coding
•
hard
Write a SQL query to calculate the retention rate of new users on a 1-day, 7-day, and 30-day basis. You are given a `user_activity` table with `user_id` and `activity_date`.
#Cohort Analysis
#Retention
#Self Joins
#Conditional Aggregation
Data Engineer
•
Coding
•
medium
Given a list of dictionaries representing Facebook post interactions (user_id, post_id, interaction_type, timestamp), write a Python script to return the top 3 most engaged posts for each interaction type.
#Dictionaries
#Heaps
#Data Aggregation
Data Engineer
•
Coding
•
medium
Write a Python function to merge overlapping user session intervals. Given an array of intervals where intervals[i] = [start_i, end_i], merge all overlapping intervals and return an array of the non-overlapping intervals.
#Arrays
#Sorting
#Intervals
Data Engineer
•
System Design
•
hard
How would you design a data warehouse for a ride-sharing company from scratch?
#Architecture
#Design
Data Engineer
•
System Design
•
hard
Design a system to detect ad-click fraud in real-time. The system processes billions of events per day and needs to flag suspicious IPs or user accounts within seconds.
#Real-time Processing
#Fraud Detection
#Distributed Systems
#Caching
Data Engineer
•
System Design
•
hard
Design a data pipeline to process and store telemetry data for Instagram Reels. The pipeline needs to support real-time dashboarding for creators and batch processing for machine learning recommendations.
#Lambda Architecture
#Kafka
#Stream Processing
#Data Warehousing
Data Engineer
•
System Design
•
hard
Design an ad delivery data pipeline that tracks impressions at 10M/sec.
#Streaming
#Scale
Data Engineer
•
System Design
•
hard
How would you design Meta's data pipeline for News Feed ranking signals?
#Ranking
#Pipeline
Data Engineer
•
System Design
•
hard
Design an ETL pipeline that ingests 10TB of raw clickstream data daily.
#ETL
#Batch Processing
Data Engineer
•
System Design
•
hard
Design a data model for an e-commerce platform tracking orders, users, and products.
#ER Modeling
#Dimensional Modeling
Data Engineer
•
System Design
•
hard
How would you design a real-time anomaly detection pipeline for 100K events/sec?
#Real-Time
#Anomaly Detection
Data Engineer
•
System Design
•
hard
How would you design a data pipeline that needs exactly-once delivery guarantees?
#Exactly-Once
#Kafka
Data Engineer
•
Technical
•
hard
Explain how Meta uses Scribe for structured logging at petabyte scale.
#Scribe
#Infrastructure
Data Engineer
•
Technical
•
medium
Explain the difference between OLAP and OLTP systems. When would you use each?
#OLAP
#OLTP
#Databases
Data Engineer
•
Technical
•
hard
What is a slowly changing dimension (SCD)? Describe SCD Type 1, 2, and 3 with examples.
#SCD
#Dimensional Modeling
Data Engineer
•
Technical
•
hard
How would you optimize a SQL query that is running slowly on a 1 billion row table?
#Query Optimization
#Indexing
Data Engineer
•
Technical
•
medium
Explain the difference between RANK(), DENSE_RANK(), and ROW_NUMBER().
#Window Functions
#SQL
Data Engineer
•
Technical
•
medium
What is a materialized view? How does it differ from a regular view?
#Materialized Views
#Performance
Data Engineer
•
Technical
•
hard
Describe partitioning strategies in a data warehouse. When would you use range vs hash partitioning?
#Partitioning
#Performance
Data Engineer
•
Technical
•
medium
What are CTEs (Common Table Expressions) and how do they differ from subqueries?
#CTEs
#SQL
Data Engineer
•
Technical
•
medium
Explain ACID properties. Which databases sacrifice ACID for performance and why?
#ACID
#Distributed Systems
Data Engineer
•
Technical
•
hard
How do you handle late-arriving data in a streaming pipeline?
#Kafka
#Watermarks
Data Engineer
•
Technical
•
medium
What is idempotency and why is it critical in data pipelines?
#Idempotency
#Data Quality
Data Engineer
•
Technical
•
hard
Explain the Lambda architecture. What are its tradeoffs vs Kappa architecture?
#Lambda
#Kappa
#Streaming
Data Engineer
•
Technical
•
hard
What is backfilling? How do you handle a backfill of 2 years of historical data without impacting production?
#Backfill
#Airflow
Data Engineer
•
Technical
•
medium
How do you monitor data pipeline health in production? What metrics do you track?
#Monitoring
#Alerting
Data Engineer
•
Technical
•
medium
What is Apache Airflow? How does it differ from Prefect or Dagster?
#Airflow
#Prefect
#Dagster
Data Engineer
•
Technical
•
easy
Explain the difference between push-based and pull-based data ingestion.
#Push
#Pull
#CDC
Data Engineer
•
Technical
•
hard
Explain how Apache Spark's execution model works. What is a DAG in Spark?
#Spark
#DAG
#Distributed Computing
Data Engineer
•
Technical
•
hard
What is data skew in Spark? How do you diagnose and fix it?
#Data Skew
#Performance
Data Engineer
•
Technical
•
hard
Explain the difference between map-side and reduce-side joins in MapReduce/Spark.
#Joins
#MapReduce
Data Engineer
•
Technical
•
medium
What is Apache Kafka? Explain topics, partitions, consumer groups, and offsets.
#Kafka
#Streaming
Data Engineer
•
Technical
•
medium
How does Kafka handle message ordering guarantees?
#Ordering
#Partitions
Data Engineer
•
Technical
•
medium
What is the CAP theorem? Give an example of a real-world system tradeoff.
#CAP
#Consistency
#Availability
Data Engineer
•
Technical
•
medium
Explain how Parquet and ORC file formats work and when you'd use each.
#Parquet
#ORC
#Columnar
Data Engineer
•
Technical
•
hard
What is Delta Lake? How does it provide ACID transactions on data lakes?
#Delta Lake
#ACID
#Time Travel
Data Engineer
•
Technical
•
medium
Explain compaction in Delta Lake / Iceberg. Why is it important?
#Compaction
#Performance
Data Engineer
•
Technical
•
medium
What is the star schema vs snowflake schema? When would you use each?
#Star Schema
#Snowflake Schema
Data Engineer
•
Technical
•
hard
What is Data Vault methodology? How does it differ from Kimball?
#Data Vault
#Kimball
Data Engineer
•
Technical
•
medium
Explain the concept of a data lakehouse. What are its advantages over a traditional data warehouse?
#Data Lakehouse
#Data Warehouse
Data Engineer
•
Technical
•
hard
How do you handle schema evolution in a data pipeline without breaking downstream consumers?
#Schema Evolution
#Backward Compatibility
Data Engineer
•
Technical
•
medium
What is a medallion architecture (Bronze/Silver/Gold)?
#Medallion
#Data Lake
Data Engineer
•
Technical
•
medium
How do you implement data quality checks in a production pipeline?
#Great Expectations
#Data Validation
Data Engineer
•
Technical
•
medium
What is data lineage and why is it important? How do you implement it?
#Lineage
#Metadata
Data Engineer
•
Technical
•
hard
How would you detect and handle data drift in a production system?
#Data Drift
#Monitoring
Data Engineer
•
Technical
•
medium
What is PII (Personally Identifiable Information) and how do you handle it in a data pipeline?
#PII
#Privacy
#Compliance
Data Engineer
•
Technical
•
medium
Explain the concept of a data catalog. What tools have you used?
#Data Catalog
#Metadata
Data Engineer
•
Technical
•
hard
Compare AWS Redshift, Google BigQuery, and Snowflake for a petabyte-scale warehouse.
#Redshift
#BigQuery
#Snowflake
Data Engineer
•
Technical
•
hard
How does BigQuery handle large joins efficiently? What is its columnar storage approach?
#BigQuery
#Columnar Storage
Data Engineer
•
Technical
•
medium
Explain the difference between S3, HDFS, and GCS for data storage.
#S3
#HDFS
#GCS
Data Engineer
•
Technical
•
medium
How would you reduce costs in a cloud-based data platform?
#Cloud
#Cost
Data Engineer
•
Technical
•
medium
What is infrastructure as code (IaC)? Have you used Terraform for data infrastructure?
#Terraform
#IaC
Data Engineer
•
Technical
•
hard
What is Presto? How does Meta use it at scale?
#Presto
#SQL
Data Engineer
•
Technical
•
medium
Describe how you'd implement circuit breakers in a data pipeline.
#Circuit Breakers
#Fault Tolerance
Data Engineer
•
Technical
•
hard
How would you handle data consistency across Meta's global sharded MySQL?
#Sharding
#Consistency
Data Engineer
•
Technical
•
medium
Design the data model for Facebook Marketplace. We need to track users, product listings, categories, and transactions. How would you structure the fact and dimension tables to allow product managers to analyze daily sales volume by category and user demographics?
#Dimensional Modeling
#Star Schema
#Fact Tables
#Dimension Tables
Data Engineer
•
Technical
•
hard
How would you handle late-arriving data in a daily ETL pipeline that computes Facebook's Daily Active Users (DAU)? Assume the pipeline runs at 2 AM UTC, but mobile clients might upload offline logs days later.
#ETL
#Late-Arriving Data
#Idempotency
#Backfilling
Data Engineer
•
Technical
•
medium
How do you design a Slowly Changing Dimension (SCD) Type 2 table for Facebook user profiles? Explain how you would handle updates to a user's 'current_city' while preserving the history of their previous locations.
#SCD Type 2
#Data Warehousing
#Historical Tracking
Data Scientist
•
Behavioral
•
medium
Tell me about a time when you discovered a significant flaw in your own analysis after you had already presented the findings to leadership. What did you do?
#Integrity
#Accountability
#Communication
Data Scientist
•
Behavioral
•
medium
Tell me about a data science project where the results surprised you. What did you do?
#Analytical Thinking
Data Scientist
•
Behavioral
•
medium
Describe how you communicated a complex model result to a non-technical stakeholder.
#Storytelling
Data Scientist
•
Behavioral
•
hard
Tell me about a time you had to push back on a business request for an analysis that would be misleading.
#Ethics
#Communication
Data Scientist
•
Behavioral
•
medium
Describe a project where you had to iterate significantly on your initial approach.
#Iteration
#Learning
Data Scientist
•
Behavioral
•
medium
How do you prioritize between multiple data science requests from different teams?
#Stakeholder Management
Data Scientist
•
Behavioral
•
hard
Tell me about a time your model failed in production. What did you learn?
#Production
#MLOps
Data Scientist
•
Behavioral
•
medium
How do you approach ethical considerations in ML model building?
#Fairness
#Bias
Data Scientist
•
Behavioral
•
hard
Describe a time you used data to challenge a widely held assumption in your organization.
#Influence
#Analytics
Data Scientist
•
Behavioral
•
medium
Tell me about a time when you strongly disagreed with a Product Manager about the interpretation of an A/B test result. How did you handle it and what was the outcome?
#Conflict Resolution
#Stakeholder Management
#Communication
Data Scientist
•
Coding
•
hard
Write a SQL query to calculate 30-day user retention.
#Retention
#Analytics
Data Scientist
•
Coding
•
hard
How would you write a funnel analysis query in SQL?
#Funnel
#Analytics
Data Scientist
•
Coding
•
medium
Write a query to identify duplicate records and deduplicate them.
#Deduplication
#Data Quality
Data Scientist
•
Coding
•
hard
Given a table `friend_requests` (sender_id, receiver_id, date, status), write a SQL query to calculate the overall friend acceptance rate for each day. Note that a request sent on day 1 might be accepted on day 3.
#Self Joins
#Window Functions
#Ratio Calculation
Data Scientist
•
Coding
•
medium
Given a pandas DataFrame containing user login logs with columns `user_id` and `login_timestamp`, write a Python function to find the longest consecutive streak of login days for each user.
#Python
#Pandas
#Data Manipulation
Data Scientist
•
Coding
•
medium
Write a SQL query to calculate the 7-day rolling retention rate of users who signed up through a specific marketing campaign. You have a `users` table (user_id, signup_date, campaign_id) and a `user_activity` table (user_id, activity_date).
#Retention
#Rolling Metrics
#Joins
Data Scientist
•
Coding
•
medium
Given two tables: `ad_campaigns` (campaign_id, advertiser_id, spend) and `ad_clicks` (click_id, campaign_id, user_id, timestamp), write a SQL query to find the top 5 advertisers with the highest number of clicks per dollar spent who have spent at least $10,000 in the last 30 days.
#Joins
#Aggregations
#Date Functions
Data Scientist
•
System Design
•
hard
Design a feature store. What are its key components?
#Feature Store
#MLOps
Data Scientist
•
System Design
•
hard
We are evaluating a new machine learning model for the Facebook News Feed ranking. The offline AUC is significantly higher than the production model, but an online A/B test shows no change in user engagement. What could be causing this discrepancy?
#Model Evaluation
#Offline vs Online Metrics
#Ranking
Data Scientist
•
System Design
•
hard
How would you build a recommendation system? Compare collaborative vs content-based filtering.
#Collaborative Filtering
#Content-Based
Data Scientist
•
System Design
•
hard
Design a real-time fraud detection system for a payments platform.
#Fraud Detection
#Real-Time ML
Data Scientist
•
System Design
•
hard
How would you build and deploy a churn prediction model?
#Churn
#MLOps
Data Scientist
•
System Design
•
hard
We want to test a new feature in Facebook Groups that relies heavily on user interaction. How would you design an A/B test for this feature, keeping in mind that treating one user might affect the experience of another user in the same group?
#Network Effects
#Cluster Randomization
#A/B Testing
Data Scientist
•
Technical
•
medium
What is transfer learning? How would you fine-tune a pre-trained model?
#Transfer Learning
#Fine-Tuning
Data Scientist
•
Technical
•
medium
What is embedding? How do word embeddings like Word2Vec and GloVe work?
#Embeddings
#Word2Vec
Data Scientist
•
Technical
•
medium
Explain batch normalization and why it helps training.
#Batch Normalization
#Training
Data Scientist
•
Technical
•
medium
What is a confidence interval? How does it differ from a prediction interval?
#Confidence Interval
#Intervals
Data Scientist
•
Technical
•
hard
Explain the curse of dimensionality and its implications for ML models.
#Dimensionality
#Feature Engineering
Data Scientist
•
Technical
•
medium
How would you detect and handle multicollinearity in a regression model?
#Multicollinearity
#Regression
Data Scientist
•
Technical
•
hard
Explain gradient boosting. How does XGBoost differ from a standard gradient boosting machine?
#Gradient Boosting
#XGBoost
Data Scientist
•
Technical
•
medium
How does a Random Forest work? What are its hyperparameters and how do you tune them?
#Random Forest
#Hyperparameter Tuning
Data Scientist
•
Technical
•
medium
What is regularization? Explain L1 vs L2 regularization and their effects.
#Regularization
#L1
#L2
Data Scientist
•
Technical
•
medium
How do you handle class imbalance in a classification problem?
#Imbalanced Data
#SMOTE
Data Scientist
•
Technical
•
medium
Explain the ROC curve and AUC metric. When would you prefer AUC over accuracy?
#ROC
#AUC
#Metrics
Data Scientist
•
Technical
•
hard
If we introduce a new ad format in the Instagram feed that increases ad revenue by 10% but decreases user time spent by 1%, how do you decide whether to roll this out globally?
#Trade-offs
#LTV
#Monetization
Data Scientist
•
Technical
•
hard
Meta wants to estimate the proportion of fake accounts on the platform. Since we cannot manually review billions of accounts, how would you design a sampling strategy to estimate this proportion with a 95% confidence interval and a 1% margin of error?
#Sampling
#Confidence Intervals
#Sample Size Calculation
Data Scientist
•
Technical
•
medium
Facebook Marketplace is seeing a high number of items listed but a low number of transactions completed. What metrics would you look at to understand where the friction is, and what product changes would you propose?
#Funnel Analysis
#Marketplace
#Conversion Rate
Data Scientist
•
Technical
•
medium
The number of daily active users (DAU) on WhatsApp has dropped by 3% week-over-week. Walk me through how you would investigate the root cause of this decline.
#Root Cause Analysis
#DAU
#Metric Debugging
Data Scientist
•
Technical
•
medium
You have two coins. One is fair, and the other is biased, coming up heads 75% of the time. You pick one coin at random and flip it three times. It comes up heads all three times. What is the probability that you picked the biased coin?
#Bayes' Theorem
#Conditional Probability
Data Scientist
•
Technical
•
hard
We ran an A/B test on Instagram where we increased the size of the 'Like' button. The results show a 5% increase in likes, but a 2% decrease in comments. Would you launch this change? How do you investigate further?
#Experimentation
#Trade-offs
#Statistical Significance
Data Scientist
•
Technical
•
medium
Meta is considering launching a new feature that allows users to tip creators on Reels. How would you determine if this feature is successful?
#Product Strategy
#Metrics Definition
#Reels
Data Scientist
•
Technical
•
medium
What is cross-validation? Explain k-fold and stratified k-fold.
#Cross Validation
#k-Fold
Data Scientist
•
Technical
•
easy
What is the difference between Type I and Type II errors?
#Hypothesis Testing
#Errors
Data Scientist
•
Technical
•
medium
Explain the bias-variance tradeoff. How does it influence model selection?
#Bias-Variance
#Model Selection
Data Scientist
•
Technical
•
medium
How do you approach feature selection?
#Feature Selection
#LASSO
Data Scientist
•
Technical
•
hard
Explain Bayesian vs Frequentist statistics. When would you use each?
#Bayesian
#Frequentist
Data Scientist
•
Technical
•
hard
How does Meta measure the ROI of a Reels algorithm change?
#Reels
#Metrics
Data Scientist
•
Technical
•
hard
What is a 'triggered analysis' in the context of Meta's experimentation framework?
#Triggered Analysis
Data Scientist
•
Technical
•
hard
How would you detect bot/spam activity using data analysis at Meta?
#Bot Detection
#Analytics
Data Scientist
•
Technical
•
medium
Explain the difference between bagging and boosting.
#Bagging
#Boosting
Data Scientist
•
Technical
•
medium
What is principal component analysis (PCA)? What are its limitations?
#PCA
#SVD
Data Scientist
•
Technical
•
medium
Explain how backpropagation works.
#Backpropagation
#Neural Networks
Data Scientist
•
Technical
•
hard
What is the vanishing gradient problem? How do LSTM and ResNet address it?
#LSTM
#ResNet
#Gradients
Data Scientist
•
Technical
•
medium
What is a p-value? Why is a p-value of 0.05 not always sufficient?
#Hypothesis Testing
#p-value
Data Scientist
•
Technical
•
medium
Explain the central limit theorem and its importance in data science.
#CLT
#Sampling
Data Scientist
•
Technical
•
hard
Explain the transformer architecture. What are attention mechanisms?
#Transformers
#Attention
#BERT
Data Scientist
•
Technical
•
medium
How would you approach an NLP problem like sentiment analysis from scratch?
#Sentiment Analysis
#Text Classification
Data Scientist
•
Technical
•
hard
How do you monitor model performance in production? What is model drift?
#Model Drift
#Monitoring
Data Scientist
•
Technical
•
easy
Explain the difference between INNER JOIN, LEFT JOIN, and CROSS JOIN.
#Joins
#SQL
Data Scientist
•
Technical
•
easy
What is an experiment holdout group?
#Holdout
#Control Group
Data Scientist
•
Technical
•
hard
How would you identify the root cause of a sudden 20% drop in DAU?
#Root Cause Analysis
#Debugging
Data Scientist
•
Technical
•
easy
Explain the difference between a leading indicator and a lagging indicator.
#Metrics
#KPIs
Data Scientist
•
Technical
•
medium
How do you choose a north star metric for a product?
#Metrics
#Product Strategy
Data Scientist
•
Technical
•
hard
What is a network effect in experimentation? How do you handle SUTVA violation?
#SUTVA
#Network Effects
Data Scientist
•
Technical
•
hard
How would you design an experiment to measure the impact of a new ranking algorithm?
#Experimentation
#Metrics
Data Scientist
•
Technical
•
hard
What is the multiple testing problem? How do you correct for it?
#Bonferroni
#FDR
Data Scientist
•
Technical
•
hard
How do you design an A/B test for a new product feature?
#A/B Testing
#Statistics
Data Scientist
•
Technical
•
medium
How would you detect and mitigate overfitting in a neural network?
#Overfitting
#Dropout
#Regularization
Machine Learning Engineer
•
Behavioral
•
medium
Describe a situation where a machine learning model you deployed degraded in production. How did you detect the degradation, and what steps did you take to resolve it?
#Model Monitoring
#Incident Response
#Ownership
Machine Learning Engineer
•
Behavioral
•
medium
Give an example of a project where you had to pivot your technical strategy halfway through due to changing business requirements or unexpected technical roadblocks.
#Agility
#Problem Solving
#Resilience
Machine Learning Engineer
•
Behavioral
•
medium
Tell me about a time you had a fundamental disagreement with a cross-functional partner, such as a Product Manager, regarding the choice of an ML metric versus a business metric.
#Conflict Resolution
#Cross-functional Collaboration
#Business Acumen
Machine Learning Engineer
•
Behavioral
•
medium
Tell me about a time you had to make a difficult trade-off between model accuracy and inference latency. How did you approach the decision?
#Trade-offs
#Optimization
#System Constraints
Machine Learning Engineer
•
Coding
•
medium
Given two sparse vectors represented as arrays of non-zero elements and their indices, write a function to compute their dot product. Optimize for both time and space complexity.
#Arrays
#Hash Table
#Two Pointers
Machine Learning Engineer
•
Coding
•
medium
Write a function to sample a batch of data from a large dataset on disk without loading the entire dataset into memory. Implement a custom PyTorch-like DataLoader class with __iter__ and __next__ methods that supports shuffling and batching.
#Object-Oriented Design
#Data Structures
#Generators
Machine Learning Engineer
•
Coding
•
medium
Given a binary tree, write an algorithm to find the lowest common ancestor (LCA) of two given nodes. Assume each node has a pointer to its parent.
#Trees
#Pointers
#Hash Table
Machine Learning Engineer
•
Coding
•
medium
Implement a function to calculate the Intersection over Union (IoU) of two bounding boxes. Extend this to implement Non-Maximum Suppression (NMS) for a list of bounding boxes and their confidence scores.
#Computer Vision
#Geometry
#Sorting
Machine Learning Engineer
•
System Design
•
hard
Design a multimodal content moderation system to detect hate speech in Facebook posts containing both text and images. How do you fuse the modalities?
#Multimodal ML
#Classification
#NLP
#Computer Vision
Machine Learning Engineer
•
System Design
•
hard
Design an Ads Click-Through Rate (CTR) prediction system for Meta's news feed. How do you handle the extreme class imbalance and delayed feedback in ad clicks?
#Ads Ranking
#Imbalanced Data
#Streaming Pipelines
#DLRM
Machine Learning Engineer
•
System Design
•
hard
Design the machine learning architecture for Instagram Reels recommendations. How would you structure the funnel from candidate generation to final ranking?
#Recommendation Systems
#Two-Tower Models
#Ranking
#Candidate Generation
Machine Learning Engineer
•
System Design
•
hard
Design the 'People You May Know' (PYMK) feature. How would you scale the graph traversals and ML inference to billions of users?
#Graph Neural Networks
#Link Prediction
#Batch Processing
#Scalability
Machine Learning Engineer
•
Technical
•
hard
You are training a large PyTorch model across multiple GPUs using DistributedDataParallel (DDP) and notice that the GPU utilization is consistently low (around 30%). How do you diagnose and fix this?
#PyTorch
#Distributed Training
#Performance Profiling
Machine Learning Engineer
•
Technical
•
medium
In a deep learning recommendation model (DLRM), how do you handle the explosion of vocabulary size for categorical features like user IDs or item IDs?
#Embeddings
#Hashing
#Memory Optimization
Machine Learning Engineer
•
Technical
•
medium
Explain the difference between Contrastive Loss and Triplet Loss. In what scenarios would you choose one over the other for training a retrieval model?
#Loss Functions
#Metric Learning
#Retrieval
ML Engineer
•
Behavioral
•
medium
Describe how you collaborated with data scientists to productionize their research code.
#Research to Production
ML Engineer
•
Behavioral
•
medium
Describe a model you deployed to production. What were the biggest challenges?
#Deployment
#Challenges
ML Engineer
•
Behavioral
•
medium
What is Meta's approach to responsible AI?
#Responsible AI
#Fairness
ML Engineer
•
Behavioral
•
medium
How do you decide when a model is 'good enough' to ship?
#Quality
#Judgment
ML Engineer
•
Behavioral
•
medium
Tell me about a disagreement you had with a researcher. How did you resolve it?
#Communication
ML Engineer
•
Behavioral
•
hard
Describe a time you had to re-architecture a system because the original ML approach didn't scale.
#Scalability
ML Engineer
•
Behavioral
•
easy
How do you keep up with the rapidly evolving ML landscape?
#Continuous Learning
ML Engineer
•
Behavioral
•
hard
Tell me about a time an ML model caused an unexpected real-world impact.
#Responsibility
#AI Safety
ML Engineer
•
Behavioral
•
hard
Tell me about a time you had to optimize a model for latency without sacrificing too much accuracy.
#Latency
#Accuracy
ML Engineer
•
Coding
•
hard
How would you write a batched inference pipeline using Python and Triton server?
#Triton
#Batching
ML Engineer
•
Coding
•
hard
Implement a K-means clustering algorithm from scratch in Python.
#K-Means
#Clustering
ML Engineer
•
Coding
•
hard
Implement logistic regression with gradient descent in NumPy.
#Logistic Regression
#NumPy
ML Engineer
•
Coding
•
hard
Write a custom PyTorch Dataset and DataLoader for irregular time series data.
#PyTorch
#DataLoader
ML Engineer
•
Coding
•
medium
Implement a sliding window approach to detect anomalies in a time series.
#Anomaly Detection
#Time Series
ML Engineer
•
System Design
•
hard
What is a feature store? Design one from scratch.
#Feature Engineering
#MLOps
ML Engineer
•
System Design
•
hard
Design a training and serving architecture for a large language model at scale.
#Infrastructure
#Scale
ML Engineer
•
System Design
•
hard
Design a real-time content moderation system.
#NLP
#Real-Time
ML Engineer
•
System Design
•
hard
Design a search ranking system for an e-commerce platform.
#Ranking
#Relevance
ML Engineer
•
System Design
•
hard
Design YouTube's video recommendation system end to end.
#Recommendations
#Ranking
ML Engineer
•
System Design
•
hard
How would you serve a model that needs to respond in under 10ms?
#Low Latency
#Serving
ML Engineer
•
System Design
•
hard
Design a system to retrain models automatically when performance degrades.
#Retraining
#Automation
ML Engineer
•
System Design
•
hard
How would you build a personalized ad targeting system?
#Targeting
#ML Systems
ML Engineer
•
System Design
•
hard
Design a CI/CD pipeline for ML models.
#CI/CD
#Deployment
ML Engineer
•
Technical
•
hard
Explain blue-green deployment vs canary deployment for ML models.
#Blue-Green
#Canary
ML Engineer
•
Technical
•
hard
How do you detect data drift vs model drift? How do you respond to each?
#Drift
#Production
ML Engineer
•
Technical
•
medium
What is shadow mode deployment in ML?
#Shadow Mode
#A/B Testing
ML Engineer
•
Technical
•
medium
Explain model serialization formats: ONNX, TorchScript, SavedModel.
#ONNX
#Serialization
ML Engineer
•
Technical
•
medium
What is Kubernetes? How is it used for ML model serving?
#Kubernetes
#Serving
ML Engineer
•
Technical
•
hard
How do you optimize GPU utilization during training?
#GPU
#Performance
ML Engineer
•
Technical
•
hard
Explain mixed precision training (FP16/BF16). What are the risks?
#Mixed Precision
#Performance
ML Engineer
•
Technical
•
medium
What are the differences between PyTorch and TensorFlow for production?
#PyTorch
#TensorFlow
ML Engineer
•
Technical
•
medium
How do you profile and debug a slow training run?
#Profiling
#Debugging
ML Engineer
•
Technical
•
hard
Explain the RLHF (Reinforcement Learning from Human Feedback) training approach.
#RLHF
#Fine-Tuning
ML Engineer
•
Technical
•
hard
What is LoRA (Low-Rank Adaptation)? How does it reduce fine-tuning costs?
#LoRA
#Fine-Tuning
ML Engineer
•
Technical
•
hard
What is RAG (Retrieval-Augmented Generation)? Describe its architecture.
#RAG
#Vector Search
ML Engineer
•
Technical
•
hard
How would you evaluate an LLM for a production use case?
#Evaluation
#Benchmarking
ML Engineer
•
Technical
•
medium
Explain vector databases. What are FAISS, Pinecone, and Weaviate?
#Vector DB
#Embeddings
ML Engineer
•
Technical
•
medium
What is model ensembling? When does it help, and when does it hurt?
#Ensembling
#Performance
ML Engineer
•
Technical
•
hard
How does PyTorch Distributed work for large-scale model training at Meta?
#PyTorch
#DDP
ML Engineer
•
Technical
•
hard
Explain how Meta's DLRM (Deep Learning Recommendation Model) works.
#DLRM
#Embeddings
ML Engineer
•
Technical
•
easy
What is the difference between a data scientist and an ML engineer?
#Roles
#MLOps
ML Engineer
•
Technical
•
medium
Explain the model training pipeline from raw data to deployment.
#Pipeline
#Training
ML Engineer
•
Technical
•
medium
What is the difference between online learning and offline learning?
#Online Learning
#Batch Learning
ML Engineer
•
Technical
•
medium
How do you handle missing data in ML model features?
#Imputation
#Missing Data
ML Engineer
•
Technical
•
medium
Explain gradient descent variants: batch, stochastic, and mini-batch.
#Gradient Descent
#Optimization
ML Engineer
•
Technical
•
medium
What are learning rate schedulers and why are they important?
#Learning Rate
#Training
ML Engineer
•
Technical
•
hard
Explain the attention mechanism in transformers with mathematical detail.
#Attention
#Transformers
ML Engineer
•
Technical
•
hard
What is quantization in neural networks? How does it reduce inference cost?
#Quantization
#Inference
ML Engineer
•
Technical
•
hard
Explain knowledge distillation. When would you use it?
#Distillation
#Compression
ML Engineer
•
Technical
•
hard
What is the difference between model parallelism and data parallelism in distributed training?
#Parallelism
#Training
ML Engineer
•
Technical
•
medium
How do you version ML models and datasets? What tools do you use?
#Versioning
#DVC
#MLflow
Product Manager
•
Behavioral
•
medium
Tell me about a time a product launch failed to meet its goals. What did you learn, and how did you communicate the failure to Meta leadership?
#Accountability
#Post-mortem
#Resilience
Product Manager
•
Behavioral
•
medium
Tell me about a time you had to pivot your product roadmap significantly due to a sudden shift in company priorities or resource constraints.
#Adaptability
#Prioritization
#Stakeholder Management
Product Manager
•
Behavioral
•
medium
Tell me about a time you strongly disagreed with an Engineering Lead on a technical trade-off. How did you resolve it?
#Conflict Resolution
#Cross-functional Collaboration
#Influence without Authority
Product Manager
•
Coding
•
medium
Write a SQL query to find the top 10% of users by engagement (defined as messages sent) on Facebook Marketplace over the last 30 days.
#Window Functions
#Data Analysis
#Marketplace
Product Manager
•
System Design
•
hard
Design a system to detect and mitigate spam accounts generated by AI bots on Facebook Threads.
#Trust & Safety
#Machine Learning
#System Architecture
Product Manager
•
System Design
•
hard
Design the high-level architecture for a real-time collaborative feature in Instagram Notes where multiple users can type simultaneously.
#Scalability
#Real-time Systems
#Concurrency
Product Manager
•
Technical
•
medium
We want to launch a new paid monetization feature for WhatsApp Business. How do you size the market and set year-one revenue goals?
#Market Sizing
#Pricing Strategy
#B2B
Product Manager
•
Technical
•
hard
How would you integrate Generative AI (like Llama 3) into Facebook Messenger to increase daily active users among Gen Z?
#Generative AI
#User Segmentation
#Feature Design
Product Manager
•
Technical
•
hard
Meta's overall ad revenue dropped by 5% yesterday. Walk me through how you would investigate the root cause of this decline.
#Root Cause Analysis
#Metrics
#Advertising Ecosystem
Product Manager
•
Technical
•
medium
You are the Product Manager for Instagram Reels. What is your North Star metric, and what counter-metrics would you track to ensure you aren't hurting the broader Instagram ecosystem?
#Goal Setting
#Ecosystem Impact
#Cannibalization
Product Manager
•
Technical
•
hard
WhatsApp Status usage is up 20%, but direct messages sent are down 5%. Is this a good or bad thing? How do you decide whether to intervene?
#A/B Testing
#Trade-offs
#Data Interpretation
Product Manager
•
Technical
•
hard
Design a feature for Meta Quest to improve remote work collaboration for enterprise clients.
#AR/VR
#Enterprise Software
#Innovation
Product Manager
•
Technical
•
hard
You are running an A/B test for a new Feed ranking algorithm. Engagement (likes/comments) goes up by 4%, but average session length goes down by 2%. Do you launch this change?
#Experimentation
#Ranking Algorithms
#Decision Making
Product Manager
•
Technical
•
medium
Meta is considering entering the travel space. How would you design a travel discovery product integrated into Instagram?
#Market Expansion
#User Journey
#Monetization
Product Manager
•
Technical
•
easy
Describe a product you recently used that you loved, and tell me how you would improve it if you were the PM.
#Product Critique
#User Empathy
#Design Thinking
Software Engineer
•
Behavioral
•
medium
Describe a project where you had to 'Move Fast' but realized you were accumulating significant technical debt. How did you balance the need for speed with long-term maintainability?
#Trade-offs
#Technical Debt
#Move Fast
Software Engineer
•
Behavioral
•
medium
Tell me about a time you identified a problem outside of your immediate team's scope and took the initiative to solve it.
#Ownership
#Cross-functional Collaboration
#Focus on Long-Term Impact
Software Engineer
•
Behavioral
•
medium
Tell me about a time you received critical feedback from a peer or manager that surprised you. How did you process it and what changes did you make?
#Self-Awareness
#Growth Mindset
#Feedback
Software Engineer
•
Behavioral
•
medium
Tell me about a time you had a technical disagreement with a senior engineer or tech lead. How did you handle the situation, and what was the outcome?
#Conflict Resolution
#Communication
#Be Direct and Respect Your Colleagues
Software Engineer
•
Coding
•
medium
Given a list of user session intervals [start, end] representing the time a user spent active on the Facebook app, merge all overlapping sessions to return an array of non-overlapping continuous active sessions.
#Sorting
#Arrays
#Intervals
Software Engineer
•
Coding
•
medium
Given two nodes p and q in a binary tree representing an organizational chart, return their lowest common ancestor (LCA). Each node has a reference to its parent node, but you do not have a reference to the root.
#Trees
#Pointers
#Linked Lists
Software Engineer
•
Coding
•
medium
Given an array of integers representing daily ad click counts and a target integer k, return the total number of continuous days (subarrays) whose sum of clicks equals exactly k.
#Prefix Sum
#Hash Table
#Arrays
Software Engineer
•
Coding
•
easy
Given a string s representing a user's text input, return true if the string can be a palindrome after deleting at most one character.
#Two Pointers
#String Manipulation
Software Engineer
•
Coding
•
medium
Given a string of '(' , ')' and lowercase English characters representing a malformed mathematical expression in a Meta internal tool. Remove the minimum number of parentheses so that the resulting string is valid and return it.
#Stacks
#String Manipulation
Software Engineer
•
Coding
•
medium
Given the root of a binary tree representing a hierarchical DOM structure, return the vertical order traversal of its nodes' values (from top to bottom, column by column).
#Breadth-First Search
#Hash Table
#Binary Trees
Software Engineer
•
System Design
•
hard
Design a system to track and display 'Trending Topics' (Top K Heavy Hitters) across Facebook and Instagram in real-time. The system must process billions of events per day.
#Stream Processing
#Count-Min Sketch
#MapReduce
#Kafka
Software Engineer
•
System Design
•
hard
Design the Facebook News Feed system. Users should be able to see a continuously updated list of posts from their friends and pages they follow, ranked by relevance.
#Distributed Systems
#Database Schema
#Caching
#Fan-out
Software Engineer
•
System Design
•
medium
Design a distributed Rate Limiter for the Meta Graph API. It needs to handle millions of requests per second globally, ensuring that individual third-party developers do not exceed their allocated quota.
#Rate Limiting
#Redis
#Distributed Consensus
Software Engineer
•
System Design
•
hard
Design a distributed web crawler that Meta could use to scrape the internet to build an index for a Large Language Model (LLM) training dataset. Focus on politeness, deduplication, and scale.
#Distributed Systems
#Graph Traversal
#Deduplication
#DNS Resolution
Software Engineer
•
System Design
•
hard
Design WhatsApp's end-to-end message delivery system. Focus on how messages are routed when a user is offline, and how media (images/videos) are handled efficiently.
#WebSockets
#Message Queues
#Blob Storage
#E2E Encryption
Difficulty Radar
Based on recent AI-sourced data.
Meet Your Interviewers
The "Standard" Interviewer
Senior EngineerFocuses on core competencies, system constraints, and clear communication.
SimulateUnwritten Rules
Think Out Loud
Always explain your thought process before writing code or drawing architecture.