Outcomes, Events, and Sample Space

Chapter 1 Notes

Author

Meike Niederhausen

Published

September 23, 2023

Modified

July 31, 2023

Note that I will keep the example and definition numbering that is presented in the textbook, so it may seem like numbers are being skipped. This just means I may not have presented an example from the textbook here. Feel free to consult the textbook for more examples!

1.1 Introduction

The introduction in this chapter starts with the following quote:

“Probability theory is the study of randomness and all things associated with randomness.”

I couldn’t say it better myself. There are examples with obvious hints at randomness (i.e. dice rolls, coins flips) and there are examples with not-so-obvious randomness involved (i.e. time until any given traffic light turns green).

Definition 1.1

When something happens at random there are several potential outcomes. Exactly one of the outcomes occurs. An event is defined to be a collection of some outcomes.

Our goal in probability and statistics is to characterize randomness. Then as statisticians, we can use what we know about randomness to see if certain features of the world is within randomness or outside the randomness. For example, according to a New York Times article published on July 25, 2023, the Education Department is investigating Harvard University’s legacy admissions policy. Here is an excerpt from that article:

Harvard gives preference to applicants who are recruited athletes, legacies, relatives of donors and children of faculty and staff. As a group, they make up less than 5 percent of applicants, but around 30 percent of those admitted each year. About 67.8 percent of these applicants are white, according to court papers.

Using probability, we can start to see why this practice might be inequitable. If these applicants make up 5% of the total applicant pool, and we completely, randomly chose applicants to admit, then we would expect 5% of admitted students to be this group of recruited athletes, legacies, relatives of donors, and children of faculty and staff. We could argue that recruited athletes are more likely to be admitted since they are, in fact, recruited, or that children of faculty and staff might “game” high school succesfully because they live in a home that subscribes to the academic world. If we want to compare the acceptance rate of 30% for this group to the average acceptance rate, the article actually fails to present a crucial peice of information: the average acceptance rate. I looked it up, and it is ~4%. So now we can start to think about questions like: With an average acceptance rate of 4%, does an acceptance of 30% of this group of recruited athletes, legacies, relatives of donors and children of faculty and staff make sense? We will explore this example further as we progress through our class.

Baaack to our definitions: When we have several potential outcomes, and an event is a collection of outcomes, then we can start thinking of the set of potential outcomes. When there are no outcomes, then we have the empty set \(\emptyset\). When we look at all outcomes, we call it the sample space \(S\). For now, when we look at a single event, we will focus on the sample space. When we start thinking of multiple events, the empty set might come into play. And just to reiterate: The empty set is an event. It is an impossible event, but it is still considered an event. So when you consider all possible events, the empty set is included. When you consider all possible outcomes, this is the sample space.

Now let’s look at at an example with an obvious hint at randomness:

Example 1.2

You roll a 6-sided die.

Example 1.2 Explanation

A 6-sided die has sides labelled 1-6. Thus, the sample space is \(S=\{1,2,3,4,5,6\}\) since we can land on any of the 6 sides. For every roll, only one of these outcomes occur, and that outcome is random.

An event can be any subgroup (more formally, subset) of the sample space, but not necessarily. Events can technically be outside of the sample space, but we often only consider events within the sample space.

If we can say our event is if we roll a 6, then our event is defined mathematically as \(\{6\}\). We can define a different event as rolling an odd numbered side, then our event is defined mathematically as \(\{1,3,5\}\). See TB pg 4 for more event examples.

We introduced subset, but here’s the definition:

Definition 1.3

Event A is a subset of event B, written \(A \subset B\). if every outcome in A is also an outcome in B.

Let’s work on defining different types of events within sample spaces:

Example 1.4

A student buys a book and opens it to a random page. They note the number of typographical errors on the page. Let’s define the sample space and discuss one potential event.

Example 1.4 Explanation

Since there must be 0 or more errors, and errors are counted with whole numbers, the sample space will be the set of nonnegative integers: \(S=\mathbb{Z}^{>=0}\). More plainly, \(S=\{0, 1, 2, 3, 4, ...\}\).

Here are a few possible events we can consider: (I invite you to think of others if you’d like)

event that a page contains 4 errors
event that a page contains at most 3 errors
event that a page contains more than 3 errors
event that a page contains an odd number of errors.

Definitions:

Definition 1.11

The union of events \(A\) and \(B\), denoted by \(A \cup B\), contains all outcomes that are in \(A\) or \(B\).

Definition 1.12

The intersection of events \(A\) and \(B\), denoted by \(A \cap B\), contains all outcomes that are both in \(A\) and \(B\).

1.2 Complements and DeMorgan’s Laws

Notes

There are a few things I’d like to address from the book:

There is a set of examples, 1.5 and 1.6, that talk about the birth of a baby. While this example is helpful for considering potential sample spaces, there is an oversight on the biology of births. The examples refer to the event of the sex assigned at birth (SAB). There are two issues in this example. First, SAB is considered binary in this example (I will get back to this). Second, the binary SAB is written as “boy” or “girl.” These labels have gender identity inherently attached to their meaning, and thus, the more accurate binary versions of these are “male” or “female,” respectively (“Assigned Sex at Birth,” n.d.). These are often denoted as “assigned male at birth” (AMAB) or “assigned female at birth” (AFAB). Going back the first issue, a binary representation of SAB does not cover all possible events. People can also be intersex, meaning their genitals, chromosomes, and/or reproductive organs are not exclusively AMAB or AFAB (“Intersex: What Is Intersex, Gender Identity, Intersex Surgery,” n.d.).

Extension to Example 1.5 and 1.6

You can go back to these examples (TB pg. 5) and work through them with the more accurate representation of sex at birth. Please use the possible outcomes of AFAB, AMAB, or intersex.

References

“Assigned Sex at Birth.” n.d. https://www.bmc.org/glossary-culture-transformation/assigned-sex-birth.

“Intersex: What Is Intersex, Gender Identity, Intersex Surgery.” n.d. https://my.clevelandclinic.org/health/articles/16324-intersex.