The mathematics of the last century worth of experiment design.
Compare with multiple testing
Probably the least sexy thing in statistics and as such, usually taught by the least interesting professor in the department, or at least one who couldn’t find an interesting enough excuse to get out of it, which is a fair indication. Said professor will then teach it to you as if you were in turn the least interesting student in the school, and so it goes on.
This is unfair, because it turns out to be elegant and powerful tool if you can move past block- and combinatorial design stamp collecting, which few classes do, because it is the easiest way to fill in those long lecture hours.
- Jonas Kristoffer Lindeløv’s anti stamp-collecting prescription is to unify a lot of the classic tests: Common statistical tests are linear models
- Lucile L, Robert Chang and Dmitriy Ryaboy of Twitter have a practical guide to risky testing at scale: Power, minimal detectable effect, and bucket size estimation in A/B tests
- Bob Sturm’s neat take, from Bailey, R. A. (2008). Design of Comparative Experiments. Cambridge; New York: Cambridge University Press.