LTIMindtree

Global technology consulting and digital solutions company.

4 Rounds ~21 Days Medium

Start Mock Interview

The Interview Loop

Recruiter Screen (30 min)

Standard fit check, behavioral questions, and resume overview.

Technical Loop (3-4 Rounds)

Deep dive into domain knowledge, coding, and system design.

Interview Question Bank

All Roles Backend Engineer 69 Cloud Engineer 35 Data Engineer 50 Data Scientist 35 DevOps Engineer 35 Frontend Engineer 35 Full Stack Engineer 35 Machine Learning Engineer 35 Product Manager 35

All Topics Machine Learning 9 SQL 4 NLP 3 Pandas 2 GenAI 2 Algorithms 2 MLOps 2 Leadership 1

Data Scientist • Behavioral • medium

Tell me about a time you had to explain a complex machine learning model's predictions to a non-technical business stakeholder.

#Stakeholder Management #Explainable AI #Communication

Practice

Data Scientist • Behavioral • hard

Describe a situation where your model performed exceptionally well in training and testing but failed in production. How did you debug and fix it?

#Debugging #Data Drift #Production ML

Practice

Data Scientist • Behavioral • easy

Why do you want to join LTIMindtree, and how does your experience align with our focus on digital transformation for enterprise clients?

#Company Knowledge #Motivation #Consulting

Practice

Data Scientist • Behavioral • medium

Tell me about a time you had to work with a difficult client or team member to deliver a data science project on time.

#Conflict Resolution #Teamwork #Client Management

Practice

Data Scientist • Coding • easy

Write a Python function using Pandas to calculate the 7-day rolling average of daily sales data for a retail client.

#Python #Pandas #Time Series #Data Manipulation

Practice

Data Scientist • Coding • medium

Implement a custom cross-validation split function in Python for time-series data without using scikit-learn's TimeSeriesSplit.

#Python #Time Series #Cross Validation #Algorithms

Practice

Data Scientist • Coding • easy

Given a dictionary of employee names and their salaries, write a Python script to find the second highest salary efficiently.

#Python #Data Structures #Sorting

Practice

Data Scientist • Coding • medium

Write a SQL query to find all customers who purchased a product in three consecutive months.

#SQL #Window Functions #Date Functions

Practice

Data Scientist • Coding • medium

Calculate the cumulative sum of revenue partitioned by geographic region and ordered by transaction date.

#SQL #Window Functions #Aggregations

Practice

Data Scientist • Coding • hard

Identify duplicate records in a massive transaction table and write a query to delete them, keeping only the row with the lowest ID, without using a temporary table.

#SQL #Data Cleaning #CTEs

Practice

Data Scientist • Coding • medium

Given a string, write a Python program to find the length of the longest substring without repeating characters.

#Python #Sliding Window #Strings

Practice

Data Scientist • Coding • medium

Write a Pandas script to merge two datasets on a common key, but only keep rows where a specific column's value is above the 75th percentile of the merged dataset.

#Python #Pandas #Data Merging #Statistical Filtering

Practice

Data Scientist • Coding • medium

Write a SQL query to find the top 3 highest-grossing products per category. Assume a 'sales' table and a 'products' table.

#SQL #Window Functions #Joins

Practice

Data Scientist • Coding • medium

Implement a Python function from scratch to calculate the cosine similarity between two sparse vectors represented as dictionaries.

#Python #Math #Vectors #Algorithms

Practice

Data Scientist • Coding • medium

Write a Python generator function to read a massive 50GB CSV file line by line, process it, and yield the result to prevent memory overflow.

#Python #Generators #Memory Management #Big Data

Practice

Data Scientist • System Design • medium

Design a churn prediction system for a telecom client. What features would you engineer, and how would you frame the target variable?

#System Design #Feature Engineering #Classification

Practice

Data Scientist • System Design • medium

Walk me through how you would deploy a trained machine learning model as a REST API using FastAPI on Azure.

#Azure #FastAPI #Deployment #Docker

Practice

Data Scientist • System Design • hard

A retail client wants to forecast inventory demand for 10,000 SKUs for the next 4 weeks. Walk me through your end-to-end approach.

#Forecasting #System Design #Scalability #ARIMA/Prophet/LGBM

Practice

Data Scientist • System Design • medium

Explain how you would build a hybrid recommendation engine for an e-commerce platform.

#Collaborative Filtering #Content-Based Filtering #System Design

Practice

Data Scientist • System Design • hard

We have a client in the BFSI sector looking to automate loan approvals using ML. How do you ensure the model is fair, unbiased, and compliant with regulations?

#Fairness #Bias #BFSI #Regulatory Compliance

Practice

Data Scientist • System Design • medium

How do you monitor data drift and concept drift in a deployed machine learning model? What actions do you take if drift is detected?

#MLOps #Model Monitoring #Drift

Practice

Data Scientist • Technical • medium

Explain the bias-variance tradeoff. How does this concept apply differently to Random Forests compared to Gradient Boosting Machines?

#Theory #Ensemble Methods #Model Evaluation

Practice

Data Scientist • Technical • medium

We are building a fraud detection model for a banking client where the fraud rate is 0.01%. How do you handle this highly imbalanced dataset?

#Imbalanced Data #SMOTE #Class Weights #Evaluation Metrics

Practice

Data Scientist • Technical • hard

Explain the mathematical intuition behind Support Vector Machines (SVM) and the kernel trick. When would you use an RBF kernel over a linear kernel?

#SVM #Mathematics #Kernels

Practice

Data Scientist • Technical • medium

What is the difference between L1 (Lasso) and L2 (Ridge) regularization? How do they affect feature selection?

#Regularization #Feature Selection #Linear Models

Practice

Data Scientist • Technical • medium

How does XGBoost handle missing values internally without requiring explicit imputation beforehand?

#XGBoost #Missing Data #Tree Algorithms

Practice

Data Scientist • Technical • hard

Explain the architecture of Transformers and the mathematical mechanism behind scaled dot-product self-attention.

#Deep Learning #Transformers #Attention Mechanism

Practice

Data Scientist • Technical • medium

How would you extract specific entities like invoice numbers, dates, and amounts from a large corpus of unstructured PDF documents?

#OCR #NER #Information Extraction

Practice

Data Scientist • Technical • hard

How do you implement a Retrieval-Augmented Generation (RAG) pipeline for a corporate knowledge base? Discuss vector databases and chunking strategies.

#RAG #LLMs #Vector Databases #Embeddings

Practice

Data Scientist • Technical • medium

Explain the difference between fine-tuning an LLM (like Llama 2) and using prompt engineering with few-shot learning. When would you choose which?

#LLMs #Fine-tuning #Prompt Engineering

Practice

Data Scientist • Technical • medium

What are the core assumptions of Linear Regression? How do you detect and correct for heteroscedasticity?

#Linear Regression #Statistics #Assumptions

Practice

Data Scientist • Technical • easy

Explain the fundamental difference between bagging and boosting ensemble methods.

#Ensemble Methods #Bagging #Boosting

Practice

Data Scientist • Technical • medium

How do you choose the optimal number of clusters in a K-Means algorithm? Explain how the Silhouette score works.

#Clustering #K-Means #Evaluation Metrics

Practice

Data Scientist • Technical • medium

What evaluation metrics would you use for a multi-class classification problem where the classes are highly imbalanced?

#Metrics #Multi-class #Imbalanced Data

Practice

Data Scientist • Technical • easy

What is the difference between Word2Vec and TF-IDF? In what scenarios would you choose one over the other?

#NLP #Embeddings #Text Processing

Practice

Difficulty Radar

Based on recent AI-sourced data.

Meet Your Interviewers

The "Standard" Interviewer

Senior Engineer

Focuses on core competencies, system constraints, and clear communication.

Simulate

Unwritten Rules

Think Out Loud

Always explain your thought process before writing code or drawing architecture.

Practice Now