Browse writing by topic. AI/ML, product, engineering, and investing now live under one blog view. For production AI systems, visit the dedicated AI Engineering column.
-
[2026] [Applied ML] A 90% Accurate Model That Still Loses Money: Why Churn Prediction Fails Without Uplift Thinking
[Applied ML] Why high-accuracy churn prediction can improve renewal rate while losing revenue, and how uplift thinking targets incremental renewals instead of risk scores.
-
[2026] [Applied ML] Beyond A/B Testing: A Practical Guide to Uplift Modeling in Industry
[Applied ML] How uplift modeling moves from average A/B test impact to user-level incremental effect, with CATE, meta-learners, Qini, AUUC, decile lift, and online ROI.
-
[2026] [LLM Evaluation] RAG Evaluation Guide: Metrics, Frameworks, and Python Examples
[LLM Evaluation] A practical guide to retrieval metrics, golden test sets, generation faithfulness, citations, no-answer behavior, and production monitoring.
-
[2026] [AI Engineering] Copilot Agent Golden Test Set: Cases, Rubrics, and Regression Gates
[AI Engineering] A practical guide to reusable Copilot agent test cases, rubrics, schema checks, regression gates, and release decisions.
-
[2026] [Experimentation] A/B Testing Sample Size in Python: Power, MDE, and Guardrails
[Experimentation] Calculate sample size, minimum detectable effect, statistical power, and guardrail planning before product experiments launch.
-
[2026] [Product Analytics] Causal Inference for Product Analytics
[Product Analytics] Connect experiments, observational data, ATE, CATE, uplift modeling, guardrails, and product decision quality.
-
[2026] [Trustworthy ML] Model Calibration in Python: Reliability Diagrams, ECE, and Decision Thresholds
[Trustworthy ML] Evaluate calibrated probabilities with reliability diagrams, expected calibration error, confidence bins, thresholds, and monitoring checks.
-
[2026] [LLM Evaluation] The Most Important Part of AI Agents Is Not Prompting. It Is Evaluation.
[LLM Evaluation] Why production AI agents need custom eval sets, trajectory checks, calibrated judges, regression tests, and business-ready metrics.
-
[2026] [AI Engineering] Testing and Evaluating Copilot Agents: From Demo to Reliable AI System
[AI Engineering] A schema-first guide to testing Copilot Studio agents with evaluation sets, custom graders, validation gates, human review, and post-publish monitoring.
-
[2026] PyTorch Review 01 (Popularized 2018-2020)
[ML] A practical PyTorch review of tensors, shapes, broadcasting, reshaping, and torch.distributions.
-
[2024] [MLOps] Leveraging Docker in Machine Learning and Data Science (Popularized 2014-2016)
[MLOps] A comprehensive guide to using Docker for ML/DS projects - from development to deployment
-
[2024] [MLOps] A Must-Have Skill for Efficient Model Management and Deployment (Popularized 2019-2021)
[MLOps] Understanding MLOps principles and implementation with MLflow and Weights & Biases
-
[2025] [ML] Trustworthy Machine Learning (Popularized 2018-2021)
[ML] What is trustworthy machine learning.
-
[2025] [ML] Fine-Tune BERT for Sentiment Analysis (Popularized 2018-2020)
[ML] sft Large Language Model
-
[2025] [ML] Fine-Tune BERT for Sentiment Analysis 02 (Popularized 2018-2020)
[ML] sft Tune Large Language Model2
-
[2025] [MLOps] Ray for Distributed ML (Popularized 2019-2021)
[MLOps] What is Ray.
-
[2024] [ML] Why Decoder-Only Architectures Became Standard in LLMs (Popularized 2020-2023)
[ML] Understanding the dominance of decoder-only architectures in modern LLMs
-
[2025] [ML] Building an Enterprise-Level AI Agent for Document Transformation (Popularized 2023-2025)
[ML] A comprehensive guide to building document processing AI agents using LlamaReport and LlamaCloud
-
[2025] [AI] Model Context Protocol (MCP): Connected AI Tools (Popularized 2024-2025)
[ML] A comprehensive guide to understanding and implementing the Model Context Protocol for AI integration
-
[2025] [ML] Uncertainty Quantification for LLMs with UQLM (Popularized 2024-2025)
[ML] A hands-on teaching guide for using UQLM to quantify and understand uncertainty in large language models.
-
[2025] [AI] Agentic AI Systems with n8n (Popularized 2023-2025)
[ML] A comprehensive guide to agentic AI system architecture, tool use, and production integration.
-
[2025] [Classical ML] Random Forest: Ensemble Learning Explained (Popularized 2001-2005)
[Classical ML] Dive into the mechanics, advantages, and applications of Random Forest - an ensemble learning algorithm that combines multiple decision trees for robust classification and regression tasks.
-
[2025] [ML] Machine Unlearning: A Complete Technical Guide (Popularized 2019-2023)
[ML] A step-by-step guide to implementing machine unlearning systems with algorithms, APIs, and monitoring processes.