Snowflake
Cloud data platform enabling data warehousing, data lakes, and data sharing.
4 Rounds
~21 Days
Hard
The Interview Loop
Recruiter Screen (30 min)
Standard fit check, behavioral questions, and resume overview.
Technical Loop (3-4 Rounds)
Deep dive into domain knowledge, coding, and system design.
Interview Question Bank
Cloud Engineer
•
Coding
•
medium
Given a list of IP CIDR blocks, write a function to merge all overlapping blocks and return the minimized list of CIDRs.
#Arrays
#Networking
#Intervals
#Sorting
Data Engineer
•
Coding
•
medium
Implement a rate limiter in Python that allows a maximum of N requests per minute per user. The function should return True if the request is allowed, and False otherwise.
#Hash Maps
#Queues
#Concurrency
#System Design Basics
Data Engineer
•
Coding
•
medium
Write a Python function to merge overlapping time intervals for user sessions. Given an array of intervals where intervals[i] = [start_i, end_i], merge all overlapping intervals and return an array of the non-overlapping intervals that cover all the intervals in the input.
#Arrays
#Sorting
#Intervals
#Python
Data Engineer
•
Coding
•
hard
Given an array of daily stock prices, find the maximum profit you can achieve with at most two transactions. You may not engage in multiple transactions simultaneously (i.e., you must sell the stock before you buy again).
#Dynamic Programming
#Arrays
#State Machine
Data Scientist
•
Coding
•
easy
You are given a dataset containing JSON strings in a VARIANT column representing query execution metadata. Write a Python script using pandas to parse this JSON, extract the 'bytes_scanned' and 'execution_time_ms' fields, and identify queries that scan massive data but execute suspiciously fast.
#Data Parsing
#JSON
#Pandas
Data Scientist
•
Coding
•
medium
Write a Python function that takes a list of query execution intervals (start_time, end_time) for a specific compute warehouse and an auto-suspend threshold (in seconds). Calculate the total billed uptime, keeping in mind that the warehouse stays on for the auto-suspend duration after the last query finishes.
#Merge Intervals
#Array Manipulation
#Python
Machine Learning Engineer
•
Coding
•
medium
Given a list of log entries containing timestamps and user IDs, write a function to find the longest continuous session for each user. A session ends if there is a gap of more than 30 minutes.
#Sliding Window
#Hash Maps
#Time Series Data
Machine Learning Engineer
•
Coding
•
medium
Implement a Trie (Prefix Tree) to support fast autocomplete for SQL keywords and table names in a web-based query editor.
#Trie
#String Manipulation
#Search
Machine Learning Engineer
•
Coding
•
hard
Design a concurrent hash map from scratch that supports high-throughput read and write operations, typical of a distributed database environment.
#Concurrency
#Data Structures
#Multithreading
#Locks
Machine Learning Engineer
•
Coding
•
hard
Given a list of words, a begin word, and an end word, find all shortest transformation sequences from the begin word to the end word. (Word Ladder II)
#Graphs
#BFS
#Backtracking
Software Engineer
•
Coding
•
medium
Given a list of materialized views and their dependencies on other views or base tables, write a function to determine a valid build order. If a circular dependency exists, detect and report it.
#Graph Theory
#Topological Sort
#Breadth-First Search
Software Engineer
•
Coding
•
hard
Implement a function that supports wildcard string matching with '?' and '*'. '?' matches any single character, and '*' matches any sequence of characters. Optimize it for large strings, simulating how a database engine might evaluate a complex LIKE clause.
#Dynamic Programming
#String Manipulation
#Greedy Algorithms
Software Engineer
•
Coding
•
hard
Write an algorithm to serialize and deserialize an N-ary tree. Assume this tree represents a SQL query execution plan where nodes are operators (Scan, Join, Filter) and edges are data flows.
#Trees
#Serialization
#Depth-First Search
Software Engineer
•
Coding
•
medium
Implement an algorithm to merge K sorted iterators. Assume this is part of an external sort operation where data exceeds available RAM, and you are merging sorted runs from disk.
#Heaps
#Pointers
#Sorting
Difficulty Radar
Based on recent AI-sourced data.
Meet Your Interviewers
The "Standard" Interviewer
Senior EngineerFocuses on core competencies, system constraints, and clear communication.
SimulateUnwritten Rules
Think Out Loud
Always explain your thought process before writing code or drawing architecture.