Conditional Probability and Chain Rule
We can also define the probability of an event conditional upon "evidence" of another event. Let's consider the following Venn diagram to illustrate this concept:
We can formally express this as: Let and be events with . The conditional probability (probability of given ) is defined as:
This represents the probability that will occur given that has occurred.
Conditioning on means that we are restricting the sample space to the outcomes contained in . We know that either or will occur.
Rearrangement of this definition can be further generalised to provide the Chain Rule in probability theory:
Derivation:
We first condition on :
Then we condition on :
Combining these equations results in the Chain Rule definition.
Note that conditioning changes the probability, not the event. In the same way that the function assigns a probability to any event , the function assigns a conditional probability (given ) to any event . Thus, all the identities we have encountered so far will still work, for example:
Example: Conditional Probability
Using the survey example from the previous section, what is the probability that a person has seen Game of Thrones, given that the said person is a Breaking Bad fan?
Using the same notation as before, we want to find . This is a straightforward application of the conditional-probability definition:
Since we know the probabilities and we can directly compute the answer:
Note that this probability is the same as the probability of selecting a Game of Thrones viewer at random from the list of Breaking Bad viewers. Conditional probabilities can be thought to be reducing the sample space to outcomes satisfied by the conditioned event only.