Hypothesis Testing
Hypothesis Test Statistics
-test:
, Student's Test:
, Two-Sample -Test:
, Two-Sample Test:
, Paired Test:
Goodness-of-Fit:
where:
In the previous chapter we established the framework for estimating parameters of a population using sample data , and then characterising the uncertainty around these estimates using confidence intervals (i.e. since they will vary with every new sample). In this chapter we will heavily draw upon these tools we have created for computing confidence intervals to solve the inverse problem: if a sample(s) come from different population(s). For instance, we could be interested at claiming that the experiment results are not consistent with the previously reported ones, and the inconsistency is unlikely to have been caused due to pure chance. We call this branch of statistics hypothesis testing.
Generally, we approach hypothesis testing in a three-step fashion:
- First, we establish two contradictory hypotheses about the population: the null hypothesis (written ) and alternative hypothesis (written ). We are usually interested in reasoning about the alternative hypothesis, while the null hypothesis represents the status quo.
- Once we have a null hypothesis, we then obtain the sampling distribution for the statistic of interest (e.g. sample means, sample variance) assuming the null hypothesis was true - this is referred to as the null distribution.
- Finally, we collect a sample to compute our test statistic (e.g. sample mean), and see where this falls on our null distribution. If the test statistic falls on the tail(s) of the null distribution then we say we reject the null hypothesis (and thus accept the alternative hypothesis).
This is somewhat analogous to how a jury reaches a decision during a court trial. In this scenario, the null hypothesis is analogous to the defendant being innocent until "proven" guilty, and this null hypothesis is "accepted" unless substantial evidence (or in our case, our sample data) is provided to prove them guilty (i.e. alternate hypothesis, ) beyond any reasonable doubt.