Lesson 5: So Many Questions
Lesson 5: So Many Questions [The Data Cycle: Ask Questions]
Objective
You will deepen your understanding of statistical questions.
Vocabulary
statistical questions
Essential Concepts
Lesson 5 Essential Concepts
Statistical questions address variability.
ATTENTION
For every single lesson:
Answer the questions in red font in your Introduction to Data Science (IDS) Journal.
Lesson
-
During the previous lesson, you were introduced to the Data Cycle. In your IDS Journal, list the 4 components of the Data Cycle.
-
You will continue to learn about what makes a question statistical. Recall the definition of a statistical question. Statistical questions are questions that:
- address variability
- can be answered with data.
-
A good way to determine if a question is a statistical question is to ask: Do we need to see the data to answer the question? The following are examples of non-statistical questions that can be modified to be statistical questions:
- Non-statistical: What is LeeAnn's favorite color? While this can be answered with data (by asking LeeAnn), it does not address variability, since LeeAnn presumably has only one favorite color.
- Statistical: What is the most common favorite color in our classroom? This can be answered by collecting data (asking each student his or her favorite color), and through recognizing that different students may have different answers.
- Non-statistical: Did anyone select red as a favorite color? Although the question can be answered with data, this question can be answered with a single observation. Statistical questions require looking at all of the data available (or a substantial portion) for a variable.
- Statistical: What proportion of people selected red as their favorite color? Now we need to examine everyone.
- Non-statistical: What is the airspeed velocity of an eagle? This is non-statistical because it assumes all eagles have the same velocity. A follow-up question, such as "African or European?" might point out that there is variation between species, but were we to collect data, we would see variation between individuals. This question could be rephrased into a statistical question.
- Statistical: What is the typical (or most common, or most extreme, etc.) airspeed velocity of a European Eagle?
-
The Statistical Questions Scenarios handout contains 4 different scenarios with a set of statistical questions. Read through Scenario #1:
-
Below you will find a student's responses to Scenario #1.
Do you agree or disagree with the student's assessment of statistical questions?
-
Go back to the Statistical Scenarios handout. Read Scenarios 2-4 and choose the scenario that most interests you. Using the definition of a statistical question, write in your IDS Journal what makes each question in the scenario you chose a statistical question.
-
In the next lesson, you will begin using the Data Cycle to learn about your food habits. To prepare for this, you should begin collecting the “Nutrition Facts” labels from foods/snacks you typically eat.
Reflection
What are the essential learnings you are taking away from this lesson?
Homework
Collect at least two cutouts of the “Nutrition Facts” labels of the snacks you typically eat (e.g., chips, yogurt, blended drinks, etc.).