Lesson 2: What Does Mean Mean?
Lesson 2: What Does Mean Mean?
Objective
You will learn that values that gather around the center of a distribution show the typical value. This value is also referred to as the mean, or average.
Vocabulary
measures of central tendency (or center), measures of variability (or spread), typical, mean, average, balancing point
Essential Concepts
Lesson 2 Essential Concepts
The center of a distribution is the "typical" value. One way of measuring the center is with the mean, which finds the balancing point of the distribution. The mean gives us the typical value, but does not tell the whole story. We need a way to measure the variability to understand how observations might differ from the typical value.
Lesson
-
Read the following definitions for measures of central tendency and measures of variability.
-
Measures of central tendency. A value that shows the tendency of quantitative data to gather around a central, or typical, value. Also known as measures of center.
-
Measures of variability. Values that show how much the quantitative data varies. Also known as measures of spread.
-
-
Then in your IDS Journal, paraphrase the definition as if you were explaining it to a 5th-grader:
-
You will be learning more about these measures and what they tell us about data as we progress through this unit.
-
Do you still remember how to compute the mean, or average?
-
In your IDS Journal, write a sentence or two explaining how to compute the mean, or average.
-
Another way to find the mean is to find the balancing point of a distribution. You will learn about the balancing point via the activity in Steps 7 & 8.
-
Using the IDS Balancing Point app, answer the questions in the Balancing Point handout.
Click on the document name to download a fillable copy of the Balancing Point handout (LMR_2.2b).
-
As you answer the questions in the in the Balancing Point handout, write any additional wonderings or thoughts you may have in your IDS Journal. It is very important to understand the idea that the mean of a distribution can be identified by finding its balancing point.
-
Consider the visualization below showing the results from the Personality Color survey from another class. Recall your own predominant and secondary colors from when you completed the survey.
-
Find the dotplot Blue (whether or not that was your predominant color when you took the survey). Refer to the picture to answer the following questions:
a. What do you think the typical Blue score is (predicted balancing point)?
b. Are the data roughly symmetric? Where is the balancing point of this distribution? Once a value is chosen, describe the location of the point in the dotplot in your IDS Journal.
-
Compute the mean Blue score for the entire class and compare this value to the your own prediction of the balancing point. Blue scores: 8, 7, 12, 18, 12, 19, 12, 14, 16, 10, 8, 14, 14, 13, 15, 14, 8, 5, 8, 9, 10, 14, 12, 10, 11.
You may not remember exactly how to compute the mean, so try to remember the general algorithm, or refer back to the responses from Step 5 above.
-
The formula for calculating the mean:
-
Now that you have calculated the mean for the Blue score, copy the formula in your IDS Journal. Label each symbol in the formula with a step in your algorithm for finding the mean, and write the meaning of the symbols in the formula in your IDS Journal.
For example, xi represents each individual data point and n represents the total number of observations.
-
Sketch the dotplot Blue in your IDS Journal. Indicate the location of the calculated mean on your sketch by drawing a vertical line at the value on the x-axis. Notice how close the mean value is to your predicted balancing point.
-
Your teacher may ask you to compute the mean score for your actual class's blue scores or another one of the personality colors.
-
During the next lesson, you will learn about another method that can be used for measuring the center of a distribution.
-
There is an even easier method of calculating the mean – using RStudio! The command RStudio uses to calculate the mean incorporates the algorithm of summing up all the data and dividing by the total number of observations. You will be able to use this command for quick calculations now.
If you have already "Exported, Downloaded, Imported" the class's Personality Color campaign data, you can simply use the exact command below to calculate the mean Blue score:
> mean(~blue, data = colors)
In general, the function can be denoted as follows:
> mean(~variable, data = datafile)
-
Write a short answer entry in your IDS Journal about how the mean value of a group of data could be used to easily describe complicated things. For example, instead of giving someone the entire class’s Blue scores, we could just tell him/her the mean score and he/she would have a general idea about the class.
Reflection
What are the essential learnings you are taking away from this lesson?
Homework
Complete the Mr. Jones's Mile Run Times handout and answer the questions below in your IDS Journal. You can practice finding the mean of distributions by determining a balancing point for the data. The mean values in #3 on the handout do NOT need to be exact.
-
What kind of plots did Mr. Jones create for his classes?
-
Where does each distribution balance? Find and label the balancing point of each distribution.
-
Based on the balancing points you found, what would you say the mean mile run time is for each class?
i. Period 1:
ii. Period 2:
iii. Period 3:
iv. Period 4:
Click on the document name to download a fillable copy of the Mr. Jones's Mile Rule Times handout (LMR_2.3).