Library

StatsTest Blog

Experimental design, data analysis, and statistical tooling for modern teams. No hype, just the math.

StatsTest
DistributionsJan 26New

Comparing ARPU and ARPPU: Segmentation vs. Modeling Approaches

How to properly analyze revenue per user metrics in A/B tests. Learn the statistical pitfalls of ARPU vs. ARPPU, when to segment, and how to avoid Simpson's paradox.

StatsTest
Two-Group ComparisonsJan 26New

Comparing Medians: Statistical Tests and Better Options

When you need to compare medians instead of means, standard tests often fall short. Learn about Mood's median test, quantile regression, and bootstrap methods for proper median comparison.

StatsTest
Two-Group ComparisonsJan 26New

Comparing Rates: Events per User, Events per Time, and Rate Ratios

How to properly compare rates like clicks per user, purchases per session, or events per hour. Covers rate ratios, Poisson tests, and common pitfalls with ratio metrics.

StatsTest
Survival AnalysisJan 26New

Comparing Retention Curves Across Segments: Multiplicity and Visualization

A practical guide to comparing survival curves across multiple segments. Learn how to visualize multiple retention curves, handle multiple comparisons, and communicate segment differences clearly.

StatsTest
Model EvaluationJan 26New

Comparing Two Models: Win Rate, Binomial CI, and Proper Tests

How to rigorously compare two ML models using win rate analysis. Learn about binomial confidence intervals, significance tests, and how many examples you actually need.

StatsTest
Two-Group ComparisonsJan 26New

Comparing Variances: Levene's Test, Bartlett's Test, and the F-Test

When you need to test whether two or more groups have equal variances. Covers Levene's test, Bartlett's test, Brown-Forsythe, and when each is appropriate.

StatsTest
Effect SizesJan 26New

Confidence Intervals for Non-Normal Metrics: Bootstrap Methods

How to construct confidence intervals when your data isn't normal. Covers percentile, BCa, and studentized bootstrap methods with practical guidance on when each works best.

StatsTest
Multi-Group ComparisonsJan 26New

Controlling for Covariates: ANCOVA vs. Regression

When and how to control for covariates in group comparisons. Covers ANCOVA, regression adjustment, and the key assumptions that make covariate adjustment valid.

StatsTest
Survival AnalysisJan 26New

Cox Proportional Hazards: What 'Proportional' Actually Means

A practical guide to Cox regression for product analysts. Learn what the proportional hazards assumption means, how to check it, what to do when it fails, and how to interpret hazard ratios correctly.

StatsTest
A/B TestingJan 26New

CUPED and Variance Reduction: When It Helps and When It Backfires

Learn how CUPED (Controlled-experiment Using Pre-Experiment Data) can dramatically reduce variance in A/B tests, when to use it, and the pitfalls that can make it backfire.

StatsTest
DistributionsJan 26New

Dealing with Zeros: Zero-Inflated and Two-Part Models

How to handle metrics with many zeros—revenue from non-purchasers, engagement from inactive users, events that didn't happen. Learn when to use zero-inflated models, two-part models, and simpler alternatives.

StatsTest
DistributionsJan 26New

Delta Method vs. Bootstrap: When Each Is Appropriate

A practical guide to choosing between delta method and bootstrap for variance estimation. Learn when each approach excels, their assumptions, and how to implement both.