The Deployed Data Scientist

The Deployed Data Scientist: MLOps and Analytics in Practice, by Ankit Anand, Scott Burk, and Kinshuk Dutta

Transform your Machine Learning Operations (MLOps) projects into reliable and scalable data products that meet the complex demands of data science.

Topics

Chapter 1: The Mindset Shift

Field Story: The Ghost in the Join

Case Study: The Silent Model Failure

From Projects to Products

The Netflix Recommendation Engine

Illustrative Code Examples

A Bank’s Real-Time Fraud Detection System

The Deployed Data Scientist’s Manifesto

The Data Scientist as a Business Leader

The Ethics of Deployment: A Proactive Responsibility

Chapter Summary

Chapter 2: Data Strategy: The Lifeblood and Liability of Your Model

Beyond the CSV

The Modern Data Stack: The Best of All Worlds

Towards Resilient Data Products

The Philosophy of Model Decay: Why All Models Are Born to Fail

Field Story: When Seasons Change

The Necessary Guardrails

Building a Data-First Culture

Chapter Summary

Chapter 3: Forging Production-Ready Models

Strategic Model Selection

The Model Selection Decision Framework

Advanced Evaluation Metrics

For Probabilistic Predictions: Log-Loss

The Model Registry: Your Single Source of Truth

Model Interpretability and Explainability

A Preview of Common Techniques

Comprehensive Explainability Framework

Field Story: The Hyperparameter Nightmare

The Silent Failure

Chapter Summary

Chapter 4: Your Automated Assembly Line to Production

The Role of Containerization: Docker and Kubernetes

The ML Testing Pyramid: A Stratified Defense

Layer 2: Integration Tests (The Glue)

Layer 3: End-to-End Tests (The Dress Rehearsal)

Continuous Integration (CI): Your Automated Quality Gates

Continuous Delivery (CD): Your Automated Release Engine

Stories from the Assembly Line

Chapter Summary

Chapter 5: Cloud Infrastructure: Architecting a Home for Your Model

Architecting a Cloud Strategy: Choosing Your Foundation

The Two Deployment Philosophies: Managed vs. Self-Managed

Architecting for Performance and Cost: Serverless vs. Dedicated

Field Story: The Billion-Request Challenge

Chapter Summary

Chapter 6: Model Monitoring and Observability: The Unwavering Watchtower

The Watchtower and the Map of Failures

From Monitoring to Observability: A Critical Distinction

The Taxonomy of Failure: A Field Guide to Drift

Building Your Proactive Warning System

In the Trenches and in the Boardroom

The Business of MLOps: Quantifying the ROI

Chapter Summary

Chapter 7: Advanced Deployment: From Black Boxes to the Physical World

Explainable AI (XAI)

Human-in-the-Loop (HITL) Frameworks: AI-Human Collaboration

Edge Computing and TinyML: Deploying Intelligence on Devices

Chapter Summary

Chapter 8: The Next Frontier: Building with Generative AI and LLMOps

The LLM Lifecycle: A New Set of Rules

LLMOps in Practice: Building the Guardrails

Monitoring LLMs in Production: A New Breed of Metrics

Field Story: Building an AI Co-pilot for Data Governance

An Architect of Intelligence

Chapter Summary

Chapter 9: Data Leader’s Playbook: Navigating the New Frontier

The FAANG Fallacy: Why You Can’t ‘Throw Money at the Problem’

The Architect’s Choice: Buy the Infrastructure, Build the Advantage

The Internal Data Platform: A “Data Ecosystem” for Your Enterprise

Navigating the Great Consolidation: Enclosure vs. Unification

Building and Scaling Data and AI Teams

The Economics of AI: Managing the FinOps Frontier

Chapter Summary

This practical guide is for data scientists, machine learning engineers, data leaders, and analytics professionals who want to move beyond notebooks, experiments, and one-time models. Analyze the real reason so many machine learning projects fail, and you will find that the problem is often not the algorithm. It is the data pipeline, the deployment process, the missing monitoring, the weak governance, or the lack of business ownership. This book shows how to treat models as living data products that must be designed, deployed, monitored, improved, and trusted.

Explore the full MLOps lifecycle, from data strategy and data contracts to model engineering, CI/CD pipelines, cloud infrastructure, model observability, and production machine learning. Design systems that can handle schema changes, data drift, feature drift, silent failures, unreliable data feeds, and changing business needs. Apply practical thinking to modern data platforms, data warehouses, data lakes, lakehouses, streaming architecture, automated retraining, model registries, and the tools that help data teams build dependable AI systems.

Evaluate the next frontier of applied AI with chapters on LLMOps, generative AI, prompt engineering, Retrieval-Augmented Generation (RAG), hallucination monitoring, explainable AI (XAI), Human-in-the-Loop (HITL) systems, and responsible AI governance. Create better enterprise AI applications by understanding how large language models change the deployment game while still requiring the same discipline, testing, observability, cost management, and accountability that define strong MLOps.

Assess your role not just as a model builder, but as an owner of business outcomes. The Deployed Data Scientist helps readers connect data science, machine learning, data governance, AI strategy, model deployment, cloud architecture, and business value into one practical roadmap. Whether you are building your first production model or leading a team responsible for enterprise AI, this book gives you the mindset, methods, and language to turn data science into systems that work.

About Ankit, Scott, and Kinshuk

Ankit Anand is a strategic Data Management Leader with over 17 years of experience designing and implementing enterprise-scale data solutions. He has led complex, global data initiatives at enterprises including Koch Industries and Lennox Industries, focusing on CI/CD automation, data governance, and modern data stack implementation. At Koch, he architected a pioneering multi-tenant, multi-domain MDM strategy and led the implementation of a robust CI/CD pipeline.

Dr. Scott Burk is the founder of It’s All Analytics. He is the author of nine books on AI, data science, and analytics, including the It’s All Analytics Series. He currently teaches in the MS in Data Science program at CUNY and has taught and developed curriculum at Baylor, Texas A&M, and SMU. He has solved complex AI, statistical, and analytical problems at Dell, Texas Instruments, PayPal, eBay, Overstock.com, healthcare organizations, energy firms, semiconductor and manufacturing companies, startups, and many others worldwide. Scott holds a BS in biology and chemistry, MS degrees in finance, statistics, and data mining, and a PhD in statistics. Data has been the unifying thread across his professional experience.

Kinshuk Dutta is a technology leader with nearly two decades of experience driving innovation in Data, Integration & Cyber Security. He is the co-author of the recent best selling Data for AI, founder of Be Cognizant of Data (datanizant.com) where he advocates for responsible data practices and AI enablement, a named inventor on AI patent. Kinshuk is a Senior Member of IEEE, recognized for shaping the future of intelligent systems and guiding organization through AI-driven transformation.

Bestsellers

Faculty may request complimentary digital desk copies

Please complete all fields.

The Deployed Data Scientist PDF Instant Download quantity	The Deployed Data Scientist PDF Instant Download	Original price was: $49.95.Current price is: $44.95.
The Deployed Data Scientist Print Version quantity	The Deployed Data Scientist Print Version	$49.95