Library

StatsTest Blog

Experimental design, data analysis, and statistical tooling for modern teams. No hype, just the math.

Cohen's Kappa
RelationshipJan 29New

Cohen's Kappa

Cohen's Kappa measures inter-rater agreement between two raters classifying items into categories. Use it to quantify how much two raters agree beyond what would be expected by chance.

Confounding: The One Thing That Breaks Every Observational Study
Causal InferenceJan 29New

Confounding: The One Thing That Breaks Every Observational Study

What confounding is, why it invalidates naive causal claims, and how to identify and handle confounders in product analytics and observational studies.

Cox Proportional Hazards Regression
Survival AnalysisJan 29New

Cox Proportional Hazards Regression

Cox Proportional Hazards Regression models the effect of multiple variables on time-to-event outcomes. Use it to identify which factors accelerate or delay an event such as churn, conversion, or failure.

Credible Intervals vs. Confidence Intervals: What Changes
Bayesian MethodsJan 29New

Credible Intervals vs. Confidence Intervals: What Changes

Understand the real difference between credible and confidence intervals. Learn what each actually means, when it matters, and how to interpret both correctly.

DAGs for Analysts: Drawing Your Assumptions Before You Analyze
Causal InferenceJan 29New

DAGs for Analysts: Drawing Your Assumptions Before You Analyze

How directed acyclic graphs help analysts identify confounders, avoid collider bias, and choose the right variables to control for in causal analysis.

Detecting Trends in Metrics: Mann-Kendall, LOESS, and Change Points
Time SeriesJan 29New

Detecting Trends in Metrics: Mann-Kendall, LOESS, and Change Points

How to statistically detect trends in product metrics using Mann-Kendall tests, LOESS smoothing, and change point analysis.

Comparing Pre/Post Periods: Difference-in-Differences for Product
Time SeriesJan 29New

Comparing Pre/Post Periods: Difference-in-Differences for Product

How to use difference-in-differences to measure product impact by comparing treatment and control groups across pre and post periods.

Double/Debiased Machine Learning: Causal Effects with Flexible Models
Causal InferenceJan 29New

Double/Debiased Machine Learning: Causal Effects with Flexible Models

How double/debiased machine learning combines ML flexibility with valid causal inference. Learn cross-fitting, Neyman orthogonality, and practical DML workflows.

Forecasting Product Metrics: ARIMA, Prophet, and When Simple Wins
Time SeriesJan 29New

Forecasting Product Metrics: ARIMA, Prophet, and When Simple Wins

A practical guide to forecasting product metrics with ARIMA, Prophet, and baseline methods. Learn when complexity helps and when it hurts.

Granger Causality: Does Feature Usage Actually Drive Retention?
Time SeriesJan 29New

Granger Causality: Does Feature Usage Actually Drive Retention?

How to use Granger causality to test whether feature usage predicts retention, and why correlation over time is not causation.

Instrumental Variables: Finding Natural Experiments in Product Data
Causal InferenceJan 29New

Instrumental Variables: Finding Natural Experiments in Product Data

How instrumental variables let you estimate causal effects when unmeasured confounding makes direct comparison impossible. Practical IV examples for tech.

Interrupted Time Series: Measuring Impact Without a Control Group
Time SeriesJan 29New

Interrupted Time Series: Measuring Impact Without a Control Group

How to use interrupted time series analysis to measure causal impact of launches, policy changes, and events without a control group.