
{"id":137,"date":"2021-12-09T22:32:08","date_gmt":"2021-12-09T22:32:08","guid":{"rendered":"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/"},"modified":"2025-08-28T00:20:50","modified_gmt":"2025-08-28T00:20:50","slug":"chapter-3","status":"publish","type":"chapter","link":"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/","title":{"raw":"Chapter 3: Measures of Central Tendency and Variability","rendered":"Chapter 3: Measures of Central Tendency and Variability"},"content":{"raw":"<div class=\"textbox textbox--sidebar textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Key Terms<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor099\"><span class=\"Hyperlink-underscore\">arithmetic mean<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor092\"><span class=\"Hyperlink-underscore\">central tendency<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor122\"><span class=\"Hyperlink-underscore\">degrees of freedom<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor115\"><span class=\"Hyperlink-underscore\">dispersion<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor117\"><span class=\"Hyperlink-underscore\">interquartile range (IQR)<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor101\"><span class=\"Hyperlink-underscore\">median<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor102\"><span class=\"Hyperlink-underscore\">mode<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor116\"><span class=\"Hyperlink-underscore\">range<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor121\"><span class=\"Hyperlink-underscore\">robustness<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor114\"><span class=\"Hyperlink-underscore\">spread<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor123\"><span class=\"Hyperlink-underscore\">standard deviation<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor118\"><span class=\"Hyperlink-underscore\">sum of squares<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor113\"><span class=\"Hyperlink-underscore\">variability<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor120\"><span class=\"Hyperlink-underscore\">variance<\/span><\/a><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<p data-start=\"195\" data-end=\"910\">As we turn from visualizing data to summarizing it with numbers, it\u2019s important to remember that descriptive statistics are not just mathematical tools\u2014they are also tools of social interpretation. Measures like the mean, median, and mode can shape how we understand inequality, fairness, and access. For example, reporting the <em data-start=\"523\" data-end=\"532\">average<\/em> income in a community may hide the fact that most families earn far less than a few wealthy outliers. By contrast, the <em data-start=\"652\" data-end=\"660\">median<\/em> provides a more equitable sense of the \u201ctypical\u201d experience. Likewise, measures of spread such as the range or standard deviation can reveal whether opportunities and resources are distributed evenly across groups or concentrated among only a few.<\/p>\r\n<p data-start=\"912\" data-end=\"1405\">From a social justice perspective, these concepts remind us that numbers are not neutral: they reflect the lived realities of people\u2019s lives. When we describe the center and variability of data, we are making choices about how to represent communities, whose voices are amplified, and whose experiences might be obscured. Learning to use these descriptive statistics critically allows us to uncover patterns of inequality and ensure that our analyses do not reproduce existing social biases.<\/p>\r\n\r\n\r\n<hr data-start=\"1407\" data-end=\"1410\" \/>\r\n<p class=\"Text-1st\">Now that we have visualized our data to understand its shape, we can begin with numerical analyses. The descriptive statistics presented in this chapter serve to describe the distribution of our data objectively and mathematically\u2014our first step into statistical analysis! The topics here will serve as the basis for everything we do in the rest of the course.<\/p>\r\n\r\n<h3 class=\"H1\"><a id=\"_idTextAnchor089\"><\/a>What Is Central Tendency?<\/h3>\r\n<p class=\"Text-1st\">What is central tendency, and why do we want to know the central tendency of a group of scores? Let us first try to answer these questions intuitively. Then we will proceed to a more formal discussion.<\/p>\r\n<p class=\"Text\">Imagine this situation: You are in a class with just four other students, and the five of you took a 5-point pop quiz. Today your instructor is walking around the room, handing back the quizzes. She stops at your desk and hands you your paper. Written in bold black ink on the front is \u201c3\/5.\u201d How do you react? Are you happy with your score of 3 or disappointed? How do you decide? You might calculate your percentage correct, realize it is 60%, and be appalled. But it is more likely that when deciding how to react to your performance, you will want additional information. What additional information would you like?<\/p>\r\n<p class=\"Text\">If you are like most students, you will immediately ask your classmates, \u201cWhat\u2019d ya get?\u201d and then ask the instructor, \u201cHow did the class do?\u201d In other words, the additional information you want is how your quiz score compares to other students\u2019 scores. You therefore understand the importance of comparing your score to the class distribution of scores. Should your score of 3 turn out to be among the higher scores, then you\u2019ll be pleased after all. On the other hand, if 3 is among the lower scores in the class, you won\u2019t be quite so happy.<\/p>\r\n<p class=\"Text\">This idea of comparing individual scores to a distribution of scores is fundamental to statistics. So let\u2019s explore it further, using the same example (the pop quiz you took with your four classmates). Three possible outcomes are shown in <a href=\"#_idTextAnchor090\"><span class=\"Fig-table-number-underscore\">Table 3.1<\/span><\/a>. They are labeled \u201cDataset A,\u201d \u201cDataset B,\u201d and \u201cDataset C.\u201d Which of the three datasets would make you happiest? In other words, in comparing your score with your fellow students\u2019 scores, in which dataset would your score of 3 be the most impressive?<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer122\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor090\"><\/a>Table 3.1.<\/span> Three possible datasets for the 5-point make-up quiz.<\/p>\r\n\r\n<table id=\"table019\" class=\"Foster-table\"><colgroup> <col class=\"_idGenTableRowColumn-44\" \/> <col class=\"_idGenTableRowColumn-12\" \/> <col class=\"_idGenTableRowColumn-12\" \/> <col class=\"_idGenTableRowColumn-14\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-2\">\r\n<p class=\"Table-col-hd\">Student<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-2\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Dataset A<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-2\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Dataset B<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Dataset C<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">You<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">Ahmed<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">2<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">Rosa<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">2<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">Tamika<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">2<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-11\">\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-2\">\r\n<p class=\"Table-body\">Luther<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<p class=\"Text\">In Dataset A, everyone\u2019s score is 3. This puts your score at the exact center of the distribution. You can draw satisfaction from the fact that you did as well as everyone else. But of course it cuts both ways: everyone else did just as well as you.<\/p>\r\n<p class=\"Text\">Now consider the possibility that the scores are described as in Dataset B. This is a depressing outcome even though your score is no different than the one in Dataset A. The problem is that the other four students had higher grades, putting yours below the center of the distribution.<\/p>\r\n<p class=\"Text\">Finally, let\u2019s look at Dataset C. This is more like it! All of your classmates score lower than you, so your score is above the center of the distribution.<\/p>\r\n<strong data-start=\"423\" data-end=\"470\">Social Justice Example (Income Inequality):<\/strong><br data-start=\"470\" data-end=\"473\" \/><em data-start=\"473\" data-end=\"997\">Consider annual incomes in a city: four families earn $35,000, $38,000, $40,000, and $42,000. A fifth family earns $1,000,000. The mean income is over $200,000, which suggests that the \u201caverage\u201d family is wealthy. But the median income\u2014the middle value, $40,000\u2014better reflects the experience of most families. In social justice work, this distinction matters. Reporting only the mean can mask inequality and give the false impression that everyone is doing well, when in fact most households struggle to meet basic needs.<\/em>\r\n<p class=\"Text\">Now let\u2019s change the example in order to develop more insight into the center of a distribution. <a href=\"#_idTextAnchor091\"><span class=\"Fig-table-number-underscore\">Figure 3.1<\/span><\/a> shows the results of an experiment on memory for chess positions. Subjects were shown a chess position and then asked to reconstruct it on an empty chess board. The number of pieces correctly placed was recorded. This was repeated for two more chess positions. The scores represent the total number of chess pieces correctly placed for the three chess positions. The maximum possible score was 89.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer123\" class=\"Side-legend\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor091\"><\/a>Figure 3.1.<\/span> Back-to-back stem-and-leaf display. The left side shows the memory scores of the non-players. The right side shows the scores of the tournament players. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/38\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Memory Scores Back-to-Back Stem and Leaf<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer124\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2021\/12\/Memory_Scores_Back-to-Back_Stem_and_Leaf-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Two groups are compared. On the left are people who don\u2019t play chess. On the right are people who play a great deal (tournament players). It is clear that the location of the center of the distribution for the non-players is much lower than the center of the distribution for the tournament players.<\/p>\r\n<p class=\"Text\">We\u2019re sure you get the idea now about the center of a distribution. It is time to move beyond intuition. We need a formal definition of the center of a distribution. In fact, we\u2019ll offer you three definitions! This is not just generosity on our part. There turn out to be (at least) three different ways of thinking about the center of a distribution, all of them useful in various contexts. In the remainder of this section we attempt to communicate the idea behind each concept. In the succeeding sections we will give statistical measures for these concepts of central tendency.<\/p>\r\n&nbsp;\r\n<h4 class=\"H2\">Definitions of Center<\/h4>\r\n<p class=\"Text-1st\">Now we explain the three measures of [pb_glossary id=\"635\"]<a id=\"_idTextAnchor092\"><\/a>[\/pb_glossary]<span class=\"key-term\">central tendency<\/span>: (1) the point on which a distribution will balance, (2) the value whose average absolute deviation from all the other values is minimized, and (3) the value whose squared deviation from all the other values is minimized.<\/p>\r\n\r\n<h5 class=\"H3\">Balance Scale<\/h5>\r\n<p class=\"Text-1st\">One definition of central tendency is the point at which the distribution is in balance. <a href=\"#_idTextAnchor093\"><span class=\"Fig-table-number-underscore\">Figure 3.2<\/span><\/a> shows the distribution of the five numbers 2, 3, 4, 9, 16 placed upon a balance scale. If each number weighs one pound, and is placed at its position along the number line, then it would be possible to balance them by placing a fulcrum at a particular point.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer125\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor093\"><\/a>Figure 3.2.<\/span> A balance scale. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/39\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Balance Scale<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer126\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Balance_Scale-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">For another example, consider the distribution shown in <a href=\"#_idTextAnchor094\"><span class=\"Fig-table-number-underscore\">Figure 3.3<\/span><\/a>. It is balanced by placing the fulcrum in the geometric middle.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer127\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor094\"><\/a>Figure 3.3.<\/span> A distribution balanced on the tip of a triangle. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/40\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Balanced Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer128\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Balanced_Distribution-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\"><a href=\"#_idTextAnchor095\"><span class=\"Fig-table-number-underscore\">Figure 3.4<\/span><\/a> illustrates that the same distribution can\u2019t be balanced by placing the fulcrum to the left of center.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer129\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor095\"><\/a>Figure 3.4.<\/span> The distribution is not balanced. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/41\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Unbalanced Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer130\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Unbalanced_Distribution-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\"><a href=\"#_idTextAnchor096\"><span class=\"Fig-table-number-underscore\">Figure 3.5<\/span><\/a> shows an asymmetric distribution. To balance it, we cannot put the fulcrum halfway between the lowest and highest values (as we did in <a href=\"#_idTextAnchor094\"><span class=\"Fig-table-number-underscore\">Figure 3.3<\/span><\/a>). Placing the fulcrum at the \u201chalf way\u201d point would cause it to tip towards the left.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer131\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor096\"><\/a>Figure 3.5.<\/span> An asymmetric distribution balanced on the tip of a triangle. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/42\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Asymmetric Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer132\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Asymmetric_Distribution-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<h5 class=\"H3\">Smallest Absolute Deviation<\/h5>\r\n<p class=\"Text-1st\">Another way to define the center of a distribution is based on the concept of the sum of the absolute deviations (differences). Consider the distribution made up of the five numbers 2, 3, 4, 9, 16. Let\u2019s see how far the distribution is from 10 (picking a number arbitrarily). <a href=\"#_idTextAnchor097\"><span class=\"Fig-table-number-underscore\">Table 3.2<\/span><\/a> shows the sum of the absolute deviations of these numbers from the number 10.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer133\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor097\"><\/a>Table 3.2.<\/span> An example of the sum of absolute deviations.<\/p>\r\n\r\n<table id=\"table020\" class=\"Foster-table\"><colgroup> <col class=\"_idGenTableRowColumn-45\" \/> <col class=\"_idGenTableRowColumn-46\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-12\">\r\n<p class=\"Table-col-hd\">Values<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd\">Absolute Deviations from 10<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\r\n<p class=\"Table-body\">8<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">7<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">6<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">9<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">16<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">6<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-8\">\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-12\">\r\n<p class=\"Table-body ParaOverride-4\"><span class=\"bold\">Sum<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\">\r\n<p class=\"Table-body\"><span class=\"bold\">28<\/span><\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<p class=\"Text\">The first row of the table shows that the absolute value of the difference between 2 and 10 is 8; the second row shows that the absolute difference between 3 and 10 is 7, and similarly for the other rows. When we add up the five absolute deviations, we get 28. So, the sum of the absolute deviations from 10 is 28. Likewise, the sum of the absolute deviations from 5 equals 3 + 2 + 1 + 4 + 11 = 21. So, the sum of the absolute deviations from 5 is smaller than the sum of the absolute deviations from 10. In this sense, 5 is closer, overall, to the other numbers than is 10.<\/p>\r\n<p class=\"Text\">We are now in a position to define a second measure of central tendency, this time in terms of absolute deviations. Specifically, according to our second definition, the center of a distribution is the number for which the sum of the absolute deviations is smallest. As we just saw, the sum of the absolute deviations from 10 is 28 and the sum of the absolute deviations from 5 is 21. Is there a value for which the sum of the absolute deviations is even smaller than 21? Yes. For these data, there is a value for which the sum of absolute deviations is only 20. See if you can find it.<\/p>\r\n\r\n<h5 class=\"H3\">Smallest Squared Deviation<\/h5>\r\n<p class=\"Text-1st\">We shall discuss one more way to define the center of a distribution. It is based on the concept of the sum of squared deviations (differences). Again, consider the distribution of the five numbers 2, 3, 4, 9, 16. <a href=\"#_idTextAnchor098\"><span class=\"Fig-table-number-underscore\">Table 3.3<\/span><\/a> shows the sum of the squared deviations of these numbers from the number\u00a010.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer134\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor098\"><\/a>Table 3.3.<\/span> An example of the sum of squared deviations.<\/p>\r\n\r\n<table id=\"table021\" class=\"Foster-table\"><colgroup> <col class=\"_idGenTableRowColumn-45\" \/> <col class=\"_idGenTableRowColumn-47\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-12\">\r\n<p class=\"Table-col-hd\">Values<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd\">Squared Deviations from 10<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\r\n<p class=\"Table-body\">64<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">49<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">9<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">16<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-8\">\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-12\">\r\n<p class=\"Table-body ParaOverride-4\"><span class=\"bold\">Sum<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\">\r\n<p class=\"Table-body\"><span class=\"bold\">186<\/span><\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<p class=\"Text\">The first row in the table shows that the squared value of the difference between 2 and 10 is 64; the second row shows that the squared difference between 3 and 10 is 49, and so forth. When we add up all these squared deviations, we get 186.<\/p>\r\n<p class=\"Text\">Changing the target from 10 to 5, we calculate the sum of the squared deviations from 5 as 9 + 4\u00a0+ 1 + 16 + 121 = 151. So, the sum of the squared deviations from 5 is smaller than the sum of the squared deviations from 10. Is there a value for which the sum of the squared deviations is even smaller than 151? Yes, it is possible to reach 134.8. Can you find the target number for which the sum of squared deviations is 134.8?<\/p>\r\n<p class=\"Text\">The target that minimizes the sum of squared deviations provides another useful definition of central tendency (the last one to be discussed in this section). It can be challenging to find the value that minimizes this sum.<\/p>\r\n\r\n<h3 class=\"H1\">Measures of Central Tendency<\/h3>\r\n<p class=\"Text-1st\">In the previous section we saw that there are several ways to define central tendency. This section defines the three most common measures of central tendency: the mean, the median, and the mode. The relationships among these measures of central tendency and the definitions given in the previous section will probably not be obvious to you.<\/p>\r\n<p class=\"Text\">This section gives only the basic definitions of the mean, median and mode. A further discussion of the relative merits and proper applications of these statistics is presented in a <a href=\"#_idTextAnchor104\"><span class=\"Hyperlink-underscore\">later section<\/span><\/a>.<\/p>\r\n\r\n<h4 class=\"H2\">Arithmetic Mean<\/h4>\r\n<p class=\"Text-1st\">The [pb_glossary id=\"634\"]<a id=\"_idTextAnchor099\"><\/a>[\/pb_glossary]<span class=\"key-term\">arithmetic mean<\/span>\u2014the sum of the numbers divided by the number of numbers\u2014is the most common measure of central tendency. The symbol \u201c<img class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/>\u201d (pronounced \u201cmew\u201d) is used for the mean of a population. The symbol <img class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/> is used for the mean of a sample. (In advanced statistics textbooks, the symbol <img class=\"_idGenObjectAttribute-33\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.1-xbar-2.png\" alt=\"ModAbove Upper X bar\" \/>, pronounced \u201cx bar,\u201d may be used to represent the mean of a sample.) The formula for <img class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/> is shown below:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-34\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.1-2.png\" alt=\"mu equals, Start-Frac, sigma-summation, Upper X, Over, Upper N, End-Frac\" \/><\/p>\r\n<p class=\"Text\">where <img class=\"_idGenObjectAttribute-35\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.15-SigX-2.png\" alt=\"sigma-summation Upper X\" \/> is the sum of all the numbers in the population and <img class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> is the number of numbers in the population.<\/p>\r\n<p class=\"Text\">The formula for <img class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/> is essentially identical:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-37\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-2.png\" alt=\"Upper M equals, Start-Frac, sigma-summation, Upper X, Over, n, End-Frac\" \/><\/p>\r\n<p class=\"Text\">where <img class=\"_idGenObjectAttribute-35\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.15-SigX-2.png\" alt=\"sigma-summation Upper X\" \/> is the sum of all the numbers in the sample and <span class=\"italic\">n <\/span>is the number of numbers in the sample. The only distinction between these two equations is whether we are referring to the population (in which case we use <img class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/> and <img class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/>) or a sample of that population (in which case we use <img class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/> and <span class=\"italic\">n<\/span>).<\/p>\r\n<p class=\"Text\">As an example, the mean of the numbers 1, 2, 3, 6, 8 is 20\/5 = 4 regardless of whether the numbers constitute the entire population or just a sample from the population.<\/p>\r\n<p class=\"Text\"><a href=\"#_idTextAnchor100\"><span class=\"Fig-table-number-underscore\">Figure 3.6<\/span><\/a> shows the number of touchdown (TD) passes thrown by each of the 31 teams in the National Football League in the 2000 season. The mean number of touchdown passes thrown is 20.45, as shown below.<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-38\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.4a-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">Although the arithmetic mean is not the only \u201cmean\u201d (there is also a geometric mean, a harmonic mean, and many others that are all beyond the scope of this course), it is by far the most commonly used. Therefore, if the term \u201cmean\u201d is used without specifying whether it is the arithmetic mean, the geometric mean, or some other mean, it is assumed to refer to the arithmetic mean.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer149\" class=\"Side-legend\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor100\"><\/a>Figure 3.6.<\/span> Number of touchdown passes. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/43\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Touchdown Passes Raw Data<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer150\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Touchdown_Passes_Raw_Data-4.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<h4 class=\"H2\">Median<\/h4>\r\n<p class=\"Text-1st\">The median is also a frequently used measure of central tendency. The [pb_glossary id=\"639\"]<a id=\"_idTextAnchor101\"><\/a>[\/pb_glossary]<span class=\"key-term\">median<\/span> is the midpoint of a distribution: the same number of scores is above the median as below it. For the data in <a href=\"#_idTextAnchor100\"><span class=\"Fig-table-number-underscore\">Figure 3.6<\/span><\/a>, there are 31 scores. The 16th highest score (which equals 20) is the median because there are 15 scores below the 16th score and 15 scores above the 16th score. The median can also be thought of as the 50th percentile.<\/p>\r\n<p class=\"Text\">When there is an odd number of numbers, the median is simply the middle number. For example, the median of 2, 4, and 7 is 4. When there is an even number of numbers, the median is the mean of the two middle numbers. Thus, the median of the numbers 2, 4, 7, 12 is:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-39\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.5-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">When there are numbers with the same values, each appearance of that value gets counted. For example, in the set of numbers 1, 3, 4, 4, 5, 8, and 9, the median is 4 because there are three numbers (1, 3, and 4) below it and three numbers (5, 8, and 9) above it. If we only counted 4 once, the median would incorrectly be calculated at 4.5 (4 + 5, divided by 2). When in doubt, writing out all of the numbers in order and marking them off one at a time from the top and bottom will always lead you to the correct answer.<\/p>\r\n\r\n<h4 class=\"H2\">Mode<\/h4>\r\n<p class=\"Text-1st\">The [pb_glossary id=\"640\"]<a id=\"_idTextAnchor102\"><\/a>[\/pb_glossary]<span class=\"key-term\">mode<\/span> is the most frequently occurring value in the dataset. For the data in <a href=\"#_idTextAnchor100\"><span class=\"Fig-table-number-underscore\">Figure 3.6<\/span><\/a>, the mode is 18 since more teams (4) had 18 touchdown passes than any other number of touchdown passes. With continuous data, such as response time measured to many decimals, the frequency of each value is one since no two scores will be exactly the same (see <a href=\"#_idTextAnchor012\"><span class=\"Hyperlink-underscore\">discussion of continuous variables<\/span><\/a>). Therefore the mode of continuous data is normally computed from a grouped frequency distribution. <a href=\"#_idTextAnchor103\"><span class=\"Fig-table-number-underscore\">Table 3.4<\/span><\/a> shows a grouped frequency distribution for a set of credit scores. Since the interval with the highest frequency is 600 to 700, the mode is the middle of that interval (650). Although the mode is not frequently used for continuous data, it is nevertheless an important measure of central tendency as it is the only measure we can use on qualitative or categorical data.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer152\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor103\"><\/a>Table 3.4.<\/span> Grouped frequency distribution.<\/p>\r\n\r\n<table id=\"table022\" class=\"Foster-table\" style=\"width: 404px\"><colgroup> <col class=\"_idGenTableRowColumn-48\" \/> <col class=\"_idGenTableRowColumn-40\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-13\" style=\"width: 203.75px;height: 17px\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Range<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\" style=\"width: 201.75px;height: 17px\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Frequency<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-1\" style=\"width: 203.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">500 to 600<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"width: 201.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-2\" style=\"width: 203.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">600 to 700<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"width: 201.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">6<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-2\" style=\"width: 203.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">700 to 800<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"width: 201.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">5<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-2\" style=\"width: 203.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">800 to 900<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"width: 201.75px;height: 17px\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<h4 class=\"H2\"><a id=\"_idTextAnchor104\"><\/a>More on the Mean and Median<\/h4>\r\n<p class=\"Text-1st\">In the section <a href=\"#_idTextAnchor089\"><span class=\"Hyperlink-underscore\">What Is Central Tendency?<\/span><\/a>, we saw that the center of a distribution could be defined three ways: (1) the point on which a distribution would balance, (2) the value whose average absolute deviation from all the other values is minimized, and (3) the value whose squared deviation from all the other values is minimized. The mean is the point on which a distribution would balance, the median is the value that minimizes the sum of absolute deviations, and the mean is the value that minimizes the sum of the squared deviations.<\/p>\r\n<p class=\"Text\"><a href=\"#_idTextAnchor105\"><span class=\"Fig-table-number-underscore\">Table 3.5<\/span><\/a> shows the absolute and squared deviations of the numbers 2, 3, 4, 9, and 16 from their median of 4 and their mean of 6.8. You can see that the sum of absolute deviations from the median (20) is smaller than the sum of absolute deviations from the mean (22.8). On the other hand, the sum of squared deviations from the median (174) is larger than the sum of squared deviations from the mean (134.8).<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer153\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor105\"><\/a>Table 3.5.<\/span> Absolute and squared deviations from the median of 4 and the mean of 6.8.<\/p>\r\n\r\n<table id=\"table023\" class=\"Foster-table\"><colgroup> <col class=\"_idGenTableRowColumn-49\" \/> <col class=\"_idGenTableRowColumn-50\" \/> <col class=\"_idGenTableRowColumn-50\" \/> <col class=\"_idGenTableRowColumn-51\" \/> <col class=\"_idGenTableRowColumn-12\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-19\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Value<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Absolute Deviation from Median<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Absolute Deviation from Mean<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Squared Deviation from Median<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd ParaOverride-4\">Squared Deviation from Mean<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">4.8<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\r\n<p class=\"Table-body\">23.04<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">3.8<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">14.44<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">2.8<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">7.84<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">9<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">2.2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">25<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">4.84<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">16<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">12<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">9.2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">144<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">84.64<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-8\">\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\r\n<p class=\"Table-body ParaOverride-4\"><span class=\"bold\">Total<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\r\n<p class=\"Table-body\"><span class=\"bold\">20<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\r\n<p class=\"Table-body\"><span class=\"bold\">22.8<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\r\n<p class=\"Table-body\"><span class=\"bold\">174<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\">\r\n<p class=\"Table-body\"><span class=\"bold\">134.80<\/span><\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<p class=\"Text\"><a href=\"#_idTextAnchor106\"><span class=\"Fig-table-number-underscore\">Figure 3.7<\/span><\/a> shows that the distribution balances at the mean of 6.8 and not at the median of 4. The relative advantages and disadvantages of the mean and median are discussed in the section <a href=\"#_idTextAnchor107\"><span class=\"Hyperlink-underscore\">Comparing Measures of Central Tendency<\/span><\/a>.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer154\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor106\"><\/a>Figure 3.7.<\/span> The distribution balances at the mean of 6.8 and not at the median of 4.0. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/44\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Balance Scale Numbered<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer155\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Balance_Scale_Numbered-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">When a distribution is symmetric, then the mean and the median are the same. Consider the following distribution: 1, 3, 4, 5, 6, 7, 9. The mean and median are both 5. The mean, median, and mode are identical in the bell-shaped normal distribution.<\/p>\r\n\r\n<h4 class=\"H2\"><a id=\"_idTextAnchor107\"><\/a>Comparing Measures of Central Tendency<\/h4>\r\n<p class=\"Text-1st\">How do the various measures of central tendency compare with each other? For symmetric distributions, the mean and median are the same value, as is the mode except in bimodal distributions. However, differences among the measures occur with skewed distributions. <a href=\"#_idTextAnchor108\"><span class=\"Fig-table-number-underscore\">Figure 3.8<\/span><\/a> shows the distribution of 642 scores on an introductory psychology test. Notice this distribution has a slight positive skew.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer156\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor108\"><\/a>Figure 3.8.<\/span> A distribution with a positive skew. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/45\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Histogram<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer157\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Histogram-4.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Measures of central tendency are shown in <a href=\"#_idTextAnchor109\"><span class=\"Fig-table-number-underscore\">Table 3.6<\/span><\/a>. Notice they do not differ greatly, with the exception that the mode is considerably lower than the other measures. When distributions have a positive skew, the mean is typically higher than the median, although it may not be in bimodal distributions. For these data, the mean of 91.58 is higher than the median of 90. This pattern holds true for any skew: the mode will remain at the highest point in the distribution, the median will be pulled slightly out into the skewed tail (the longer end of the distribution), and the mean will be pulled the farthest out. Thus, the mean is more sensitive to skew than the median or mode, and in cases of extreme skew, the mean may no longer be appropriate to use.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer158\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor109\"><\/a>Table 3.6.<\/span> Measures of central tendency for the test scores.<\/p>\r\n\r\n<table id=\"table024\" class=\"Foster-table\"><colgroup> <col class=\"_idGenTableRowColumn-39\" \/> <col class=\"_idGenTableRowColumn-39\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd\">Measure<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd\">Value<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\r\n<p class=\"Table-body\">Mode<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\r\n<p class=\"Table-body\">84.00<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">Median<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">90.00<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-11\">\r\n<td class=\"Foster-table Table-body-last Table-body\">\r\n<p class=\"Table-body\">Mean<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\">\r\n<p class=\"Table-body\">91.58<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer159\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<p class=\"Text\"><a href=\"#_idTextAnchor111\"><span class=\"Fig-table-number-underscore\">Table 3.7<\/span><\/a> shows the measures of central tendency for these data. The large skew results in very different values for these measures. No single measure of central tendency is sufficient for data such as these. Fortunately, there is no need to summarize a distribution with a single number. When the various measures differ, our opinion is that you should report the mean and median. Sometimes it is worth reporting the mode as well. In the media, the median is usually reported to summarize the center of skewed distributions. You will hear about median salaries and median prices of houses sold, etc. This is better than reporting only the mean, but it would be informative to hear more statistics.<\/p>\r\n\r\n<h3 class=\"H1\">Spread and Variability<\/h3>\r\n<strong data-start=\"1439\" data-end=\"1480\">Social Justice Example (Wealth Gaps):<\/strong><br data-start=\"1480\" data-end=\"1483\" \/>Two communities may have the same median household income of $50,000. But in Community A, most families cluster closely around that figure, while in Community B, a few very wealthy households contrast sharply with many families living far below $50,000. Both communities share the same median, but their spreads (variability) are very different. From a social justice lens, measures of variability\u2014like range or standard deviation\u2014help uncover whether economic opportunity is equitably shared or concentrated among a privileged few.\r\n<p class=\"Text-1st\">Variability refers to how \u201cspread out\u201d a group of scores is. To see what we mean by spread out, consider the graphs in <a href=\"#_idTextAnchor112\"><span class=\"Fig-table-number-underscore\">Figure 3.10<\/span><\/a>. These graphs represent the scores on two quizzes. The mean score for each quiz is 7.0. Despite the equality of means, you can see that the distributions are quite different. Specifically, the scores on Quiz 1 are more densely packed and those on Quiz 2 are more spread out. The differences among students were much greater on Quiz 2 than on Quiz 1.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer162\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor112\"><\/a>Figure 3.10.<\/span> Bar charts of Quizzes 1 and 2. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/47\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Quiz Score Bar Charts<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer163\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Quiz_Score_Bar_Charts-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">The terms [pb_glossary id=\"646\"]<a id=\"_idTextAnchor113\"><\/a>[\/pb_glossary]<span class=\"key-term\">variability<\/span>, <a id=\"_idTextAnchor114\"><\/a><span class=\"key-term\">spread<\/span>, and <a id=\"_idTextAnchor115\"><\/a><span class=\"key-term\">dispersion<\/span> are synonyms and refer to how spread out a distribution is. Just as in the section on central tendency where we discussed measures of the center of a distribution of scores, in this section we will discuss measures of the variability of a distribution. There are three frequently used measures of variability: range, variance, and standard deviation. In the next few paragraphs, we will look at each of these measures of variability in more detail.<\/p>\r\n\r\n<h4 class=\"H2\">Range<\/h4>\r\n<p class=\"Text-1st\">The range is the simplest measure of variability to calculate, and one you have probably encountered many times in your life. The [pb_glossary id=\"641\"]<a id=\"_idTextAnchor116\"><\/a>[\/pb_glossary]<span class=\"key-term\">range<\/span> is simply the highest score minus the lowest score. Let\u2019s take a few examples. What is the range of the following group of numbers: 10, 2, 5, 6, 7, 3, 4? Well, the highest number is 10, and the lowest number is 2, so 10 \u2212 2 = 8. The range is 8. Let\u2019s take another example. Here\u2019s a dataset with 10 numbers: 99, 45, 23, 67, 45, 91, 82, 78, 62, 51. What is the range? The highest number is 99 and the lowest number is 23, so 99 \u2212 23 = 76; the range is 76. Now consider the two quizzes shown in <a href=\"#_idTextAnchor112\"><span class=\"Fig-table-number-underscore\">Figure 3.10<\/span><\/a>. On Quiz 1, the lowest score is 5 and the highest score is 9. Therefore, the range is 4. The range on Quiz 2 was larger: the lowest score was 4 and the highest score was 10. Therefore the range is 6.<\/p>\r\n<p class=\"Text\">The problem with using range is that it is extremely sensitive to outliers, and one number far away from the rest of the data will greatly alter the value of the range. For example, in the set of numbers 1, 3, 4, 4, 5, 8, and 9, the range is 8 (9 \u2212 1). However, if we add a single person whose score is nowhere close to the rest of the scores, say, 20, the range more than doubles from 8 to 19.<\/p>\r\n<strong data-start=\"2361\" data-end=\"2415\">Social Justice Example (Wage Gaps by Gender\/Race):<\/strong><br data-start=\"2415\" data-end=\"2418\" \/><em data-start=\"2418\" data-end=\"2875\">Suppose we compare wages in two workplaces. In Workplace A, salaries range from $40,000 to $60,000. In Workplace B, most employees earn between $40,000 and $50,000, but one executive earns $300,000. The range in Workplace B is much larger, and the presence of that single outlier masks the fact that women and people of color may be clustered at the lower end of salaries. This shows how looking only at range\u2014or even mean\u2014can obscure systemic inequities.<\/em>\r\n<h5 class=\"H3\">Interquartile Range<\/h5>\r\n<p class=\"Text-1st\">The [pb_glossary id=\"638\"]<a id=\"_idTextAnchor117\"><\/a>[\/pb_glossary]<span class=\"key-term\">interquartile range (IQR)<\/span> is the range of the middle 50% of the scores in a distribution and is sometimes used to communicate where the bulk of the data in the distribution are located. It is computed as follows:<\/p>\r\n<p class=\"Text ParaOverride-4\">IQR = 75th percentile \u2212 25th percentile<\/p>\r\n<p class=\"Text\">For Quiz 1, the 75th percentile is 8 and the 25th percentile is 6. The interquartile range is therefore 2. For Quiz 2, which has greater spread, the 75th percentile is 9, the 25th percentile is 5, and the interquartile range is 4. Recall that in the discussion of box plots, the 75th percentile was called the upper hinge and the 25th percentile was called the lower hinge. Using this terminology, the interquartile range is referred to as the H-spread.<\/p>\r\n\r\n<h4 class=\"H2\">Sum of Squares<\/h4>\r\n<p class=\"Text-1st\">Variability can also be defined in terms of how close the scores in the distribution are to the middle of the distribution. Using the mean as the measure of the middle of the distribution, we can see how far, on average, each data point is from the center. The data on community volunteer hours is shown in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a>.<\/p>\r\n<p class=\"Text\">There are a few things to note about how <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a> is formatted. The raw data scores (<span class=\"italic\">X<\/span>) are always placed in the left-most column. This column is then summed at the bottom (<span class=\"Symbol-sigma CharOverride-10\">\u03a3<\/span><span class=\"italic\">X<\/span>) to facilitate calculating the mean by dividing the sum of <span class=\"italic\">X<\/span> values by the number of scores in the table (<span class=\"italic\">N<\/span>). The mean score is 7.0 (<span class=\"Symbol-sigma CharOverride-10\">\u03a3<\/span><span class=\"italic\">X<\/span>\/<span class=\"italic\">N <\/span>= 140\/20 = 7.0). Once you have the mean, you can easily work your way down the second column calculating the deviation scores (<img class=\"_idGenObjectAttribute-40\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.6-2.png\" alt=\"\" \/>), representing how far each score deviates from the mean, here calculated as the score (<span class=\"italic\">X<\/span> value) minus 7. This column is also summed and has a very important property: it will always sum to 0, or close to zero if you have rounding error due to many decimal places (<span class=\"Symbol-sigma CharOverride-10\">\u03a3<\/span>(<img class=\"_idGenObjectAttribute-40\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.6-2.png\" alt=\"\" \/>) = 0). This step is used as a check on your math to make sure you haven\u2019t made a mistake. If this column sums to 0, you can move on to filling in the third column, which is composed of the squared deviation scores. The deviation scores are squared to remove negative values and appear in the third column <img class=\"_idGenObjectAttribute-41\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.7-2.png\" alt=\"\" \/>. When these values are summed, you have the sum of the squared deviations, or the [pb_glossary id=\"645\"]<a id=\"_idTextAnchor118\"><\/a>[\/pb_glossary]<span class=\"key-term\">sum of squares<\/span> (<span class=\"italic\">SS<\/span>), calculated with the formula <img class=\"_idGenObjectAttribute-42\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8-2.png\" alt=\"\" \/>.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer172\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor119\"><\/a>Table 3.8.<\/span> Calculation of variance for community volunteer hours.<\/p>\r\n\r\n<table id=\"table026\" class=\"Foster-table\"><colgroup> <col class=\"_idGenTableRowColumn-53\" \/> <col class=\"_idGenTableRowColumn-54\" \/> <col class=\"_idGenTableRowColumn-46\" \/> <col class=\"_idGenTableRowColumn-55\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-16\">\r\n<p class=\"Table-col-hd ParaOverride-4\"><span class=\"bold-italic\">X<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-16\">\r\n<p class=\"Table-col-hd ParaOverride-4\"><span class=\"bold-italic\">X<\/span> \u2212 <span class=\"bold-italic\">M<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-16\">\r\n<p class=\"Table-col-hd ParaOverride-4\">(<span class=\"bold-italic\">X<\/span> \u2212 <span class=\"bold-italic\">M<\/span>)<span class=\"superscript _idGenCharOverride-1\">2<\/span><\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd ParaOverride-4\"><span class=\"bold-italic\">X<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span><\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">9<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-1\">\r\n<p class=\"Table-body ParaOverride-4\">81<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">9<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">81<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">9<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">2<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">81<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">8<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">64<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">8<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">64<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">8<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">64<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">8<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">64<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">7<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">49<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">7<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">49<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">7<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">49<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">7<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">49<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">7<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">49<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">6<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">\u22121<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">6<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">\u22121<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">6<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">\u22121<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">6<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">\u22121<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">6<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">\u22121<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">6<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">\u22121<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">\u22122<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">25<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">\u22122<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">4<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\r\n<p class=\"Table-body ParaOverride-4\">25<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-56\">\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-16\">\r\n<p class=\"Table-body ParaOverride-14\"><span class=\"Symbol-sigma-Table CharOverride-10\">\u03a3<\/span><span class=\"italic\">X<\/span> = 140<\/p>\r\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\"><img class=\"_idGenObjectAttribute-43\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8f-2.png\" alt=\"\" \/><\/span> = 19,600<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-16\">\r\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\">\u03a3<\/span>(<span class=\"CharOverride-11\"><img class=\"_idGenObjectAttribute-44\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.61-2.png\" alt=\"\" \/><\/span>) = 0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-17\">\r\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\">\u03a3<\/span><span class=\"CharOverride-11\"><img class=\"_idGenObjectAttribute-45\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.71-2.png\" alt=\"\" \/><\/span> = 30<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-18\">\r\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\"><img class=\"_idGenObjectAttribute-46\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8d-2.png\" alt=\"\" \/><\/span> = 1,010<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-47\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8c-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">The preceding formula is called the definitional formula, as it shows the logic behind the sum of squared deviations calculation. As mentioned earlier, there can be rounding errors in calculating the deviation scores. Also, when the set of scores is large, calculating the deviation scores, squaring the scores, and then summing those values can be tedious. To simplify the sum of squares calculation, the computational formula is used instead. The computational formula is as follows:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-48\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8b-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">The last column in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a> represents the <span class=\"italic\">X<\/span> values squared and then summed\u2014<img class=\"_idGenObjectAttribute-49\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8d1-2.png\" alt=\"\" \/>. At the bottom of the first column, the <img class=\"_idGenObjectAttribute-35\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8e-2.png\" alt=\"\" \/> value is squared\u00ad\u2014<img class=\"_idGenObjectAttribute-50\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8f1-2.png\" alt=\"\" \/>. These are the values used in the computational formula for the sum of squares. As you can see in the calculation below, the <span class=\"italic\">SS<\/span> value is the same for both the definitional formula and the computational formula:<\/p>\r\n<img class=\"alignnone size-full wp-image-125\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/MicrosoftTeams-image-16.png\" alt=\"\" width=\"438\" height=\"69\" \/>\r\n<p class=\"Text\">As we will see, the sum of squares appears again and again in different formulas\u2014it is a very important value, and using the <span class=\"italic\">X<\/span> and <img class=\"_idGenObjectAttribute-52\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8h-2.png\" alt=\"\" \/> columns in this table makes it simple to calculate the <span class=\"italic\">SS<\/span> without error.<\/p>\r\n\r\n<h4 class=\"H2\">Variance<\/h4>\r\n<p class=\"Text-1st\">Now that we have the sum of squares calculated, we can use it to compute our formal measure of average distance from the mean\u2014the variance. The [pb_glossary id=\"647\"]<a id=\"_idTextAnchor120\"><\/a>[\/pb_glossary]<span class=\"key-term\">variance<\/span> is defined as the average squared difference of the scores from the mean. We square the deviation scores because, as we saw in the second column of <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a>, the sum of raw deviations is always 0, and there\u2019s nothing we can do mathematically without changing that.<\/p>\r\n<p class=\"Text\">The population parameter for variance is <span class=\"Symbol\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> (\u201csigma-squared\u201d) and is calculated as:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-53\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.11a-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">We can use the value we previously calculated for <span class=\"italic\">SS<\/span> in the numerator, then simply divide that value by <img class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> to get variance. If we assume that the values in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table\u00a03.8<\/span><\/a> represent the full population, then we can take our value of sum of squares and divide it by <img class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> to get our population variance:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-54\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.12-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">So, on average, scores in this population are 1.5 squared units away from the mean. This measure of spread exhibits much more [pb_glossary id=\"642\"]<a id=\"_idTextAnchor121\"><\/a>[\/pb_glossary]<span class=\"key-term\">robustness<\/span> (a term used by statisticians to mean resilience or resistance to outliers) than the range, so it is a much more useful value to compute. Additionally, as we will see in future chapters, variance plays a central role in inferential statistics.<\/p>\r\n<p class=\"Text\">The sample statistic used to estimate the variance is <span class=\"italic\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> (\u201c<span class=\"italic\">s<\/span>-squared\u201d):<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-55\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.13-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">This formula is very similar to the formula for the population variance with one change: we now divide by <span class=\"italic\">N <\/span>\u2212 1 instead of <img class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/>. The value <span class=\"italic\">N <\/span>\u2212 1 has a special name: the [pb_glossary id=\"636\"]<a id=\"_idTextAnchor122\"><\/a>[\/pb_glossary]<span class=\"key-term\">degrees of freedom<\/span> (abbreviated as <span class=\"italic\">d<\/span><span class=\"italic\">f<\/span>). You don\u2019t need to understand in depth what degrees of freedom are (essentially they account for the fact that we have to use a sample statistic to estimate the mean [<img class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/>] before we estimate the variance) in order to calculate variance, but knowing that the denominator is called <span class=\"italic\">d<\/span><span class=\"italic\">f <\/span>provides a nice shorthand for the variance formula:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-56\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.13a-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">Going back to the values in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a> and treating those scores as a sample, we can estimate the sample variance as:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-57\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.14-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">Notice that this value is slightly larger than the one we calculated when we assumed these scores were the full population. This is because our value in the denominator is slightly smaller, making the final value larger. In general, as your sample size <img class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> gets bigger, the effect of subtracting 1 becomes less and less. Comparing a sample size of 10 to a sample size of 1000; 10 \u2212 1 = 9, or 90% of the original value, whereas 1000 \u2212 1 = 999, or 99.9% of the original value. Thus, larger sample sizes will bring the estimate of the sample variance closer to that of the population variance. This is a key idea and principle in statistics that we will see over and over again: larger sample sizes better reflect the population.<\/p>\r\n\r\n<h4 class=\"H2\">Standard Deviation<\/h4>\r\n<p class=\"Text-1st\">The [pb_glossary id=\"644\"]<a id=\"_idTextAnchor123\"><\/a>[\/pb_glossary]<span class=\"key-term\">standard deviation<\/span> is simply the square root of the variance. This is a useful and interpretable statistic because taking the square root of the variance (recalling that variance is the average squared difference) puts the standard deviation back into the original units of the measure we used. Thus, when reporting descriptive statistics in a study, scientists virtually always report mean and standard deviation. Standard deviation is therefore the most commonly used measure of spread for our purposes, representing the average distance of the scores from the mean.<\/p>\r\n<p class=\"Text\">The population parameter for standard deviation is <span class=\"Symbol\">s<\/span> (\u201csigma\u201d), which, intuitively, is the square root of the variance parameter <span class=\"Symbol\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> (occasionally, the symbols work out nicely that way). The formula is simply the formula for variance under a square root sign:<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-58\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.15-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">The sample statistic follows the same conventions and is given as <span class=\"italic\">s<\/span> in mathematical formulas. (Note that in American Psychological Association [APA] format for reporting results, sample standard deviation is reported using the abbreviation <span class=\"italic\">SD<\/span>.)<\/p>\r\n<p class=\"Equation\"><img class=\"_idGenObjectAttribute-59\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.17-2.png\" alt=\"\" \/><\/p>\r\n<p class=\"Text\">The standard deviation is an especially useful measure of variability when the distribution is normal or approximately normal because the proportion of the distribution within a given number of standard deviations from the mean can be calculated. For example, 68% of the distribution is within one standard deviation (above and below) of the mean and approximately 95% of the distribution is within two standard deviations of the mean, as shown in <a href=\"#_idTextAnchor124\"><span class=\"Fig-table-number-underscore\">Figure 3.11<\/span><\/a>. Therefore, if you had a normal distribution with a mean of 50 and a standard deviation of 10, then 68% of the distribution would be between 50 \u2212 10 = 40 and 50 + 10 = 60. Similarly, about 95% of the distribution would be between 50 \u2212 2 \u00d7 10 = 30 and 50\u00a0+ 2 \u00d7 10 = 70.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer192\" class=\"Side-legend\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor124\"><\/a>Figure 3.11.<\/span> Percentages of the normal distribution. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/48\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Normal Distribution Percentages<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer193\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Normal_Distribution_Percentages-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\"><a href=\"#_idTextAnchor125\"><span class=\"Fig-table-number-underscore\">Figure 3.12<\/span><\/a> shows two normal distributions. The red (left-most) distribution has a mean of 40 and a standard deviation of 5; the blue (right-most) distribution has a mean of 60 and a standard deviation of 10. For the red distribution, 68% of the distribution is between 45 and 55; for the blue distribution, 68% is between 50 and 70. Notice that as the standard deviation gets smaller, the distribution becomes much narrower, regardless of where the center of the distribution (mean) is. <a href=\"#_idTextAnchor126\"><span class=\"Fig-table-number-underscore\">Figure 3.13<\/span><\/a> presents several more examples of this effect.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer194\" class=\"Side-legend\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor125\"><\/a>Figure 3.12.<\/span> Normal distributions with standard deviations of 5 and 10. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/49\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Normal Distributions with Standard Deviations<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer195\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Normal_Distributions_with_Standard_Deviations-2.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer196\" class=\"Legend-below\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor126\"><\/a>Figure 3.13.<\/span> Differences between two datasets. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/50\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Location and Variability Differences<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div><\/div>\r\n<div><\/div>\r\n<div><\/div>\r\n<div id=\"_idContainer197\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Location_and_Variability_Differences-2.png\" alt=\"\" \/><\/div>\r\n<div><strong data-start=\"2361\" data-end=\"2415\">Social Justice Example (Wage Gaps by Gender\/Race):<\/strong><br data-start=\"2415\" data-end=\"2418\" \/>Suppose we compare wages in two workplaces. In Workplace A, salaries range from $40,000 to $60,000. In Workplace B, most employees earn between $40,000 and $50,000, but one executive earns $300,000. The range in Workplace B is much larger, and the presence of that single outlier masks the fact that women and people of color may be clustered at the lower end of salaries. This shows how looking only at range\u2014or even mean\u2014can obscure systemic inequities.<\/div>\r\n<\/div>\r\n<h3 class=\"H1\">Exercises<\/h3>\r\n<ol>\r\n \t<li class=\"Numbered-list-Exercises-1st\">If the mean time to respond to a stimulus is much higher than the median time to respond, what can you say about the shape of the distribution of response times?<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Compare the mean, median, and mode in terms of their sensitivity to extreme scores.<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Your younger brother comes home one day after taking a science test. He says someone at school told him that \u201c60% of the students in the class scored above the median test grade.\u201d What is wrong with this statement? What if he had said \u201c60% of the students scored above the mean?\u201d<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Make up three datasets with five numbers each that have:\r\n<ol>\r\n \t<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">the same mean but different standard deviations.<\/li>\r\n \t<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">the same mean but different medians.<\/li>\r\n \t<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">the same median but different means.<\/li>\r\n<\/ol>\r\n<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Compute the population mean and population standard deviation for the following scores (remember to use the sum of squares table): 5, 7, 8, 3, 4, 4, 2, 7, 1, 6<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">For the following problem, use the following scores: 5, 8, 8, 8, 7, 8, 9, 12, 8, 9, 8, 10, 7, 9, 7, 6, 9, 10, 11, 8\r\n<ol>\r\n \t<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">Create a histogram of these data. What is the shape of this histogram?<\/li>\r\n \t<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">How do you think the three measures of central tendency will compare to each other in this dataset?<\/li>\r\n \t<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">Compute the sample mean, the median, and the mode<\/li>\r\n \t<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">Draw and label lines on your histogram for each of the above values. Do your results match your predictions?<\/li>\r\n<\/ol>\r\n<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Compute the range, sample variance, and sample standard deviation for the following scores: 25,\u00a036, 41, 28, 29, 32, 39, 37, 34, 34, 37, 35, 30, 36, 31, 31<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Using the same values from Problem 7, calculate the range, sample variance, and sample standard deviation, but this time include 65 in the list of values. How did each of the three values change?<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Two normal distributions have exactly the same mean, but one has a standard deviation of 20 and the other has a standard deviation of 10. How would the shapes of the two distributions compare?<\/li>\r\n \t<li class=\"Numbered-list-Exercises\">Compute the sample mean and sample standard deviation for the following scores: \u22128, \u22124, \u22127, \u22126, \u22128, \u22125, \u22127, \u22129, \u22122, 0<\/li>\r\n<\/ol>\r\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<h3 class=\"H1\">Answers to Odd-Numbered Exercises<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ul>\r\n \t<li>1) If the mean is higher, that means it is farther out into the right-hand tail of the distribution. Therefore, we know this distribution is positively skewed.<\/li>\r\n \t<li>3) The median is defined as the value with 50% of scores above it and 50% of scores below it; therefore, 60% of score cannot fall above the median. If 60% of scores fall above the mean, that would indicate that the mean has been pulled down below the value of the median, which means that the distribution is negatively skewed<\/li>\r\n \t<li>5) <img class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/> = 4.80, <span class=\"Symbol\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> = 2.36<\/li>\r\n \t<li>7) Range = 16, <span class=\"italic\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> = 18.40, <span class=\"italic\">s<\/span> = 4.29<\/li>\r\n \t<li>9) If both distributions are normal, then they are both symmetrical, and having the same mean causes them to overlap with one another. The distribution with the standard deviation of 10 will be narrower than the other distribution.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\n&nbsp;","rendered":"<div class=\"textbox textbox--sidebar textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Key Terms<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor099\"><span class=\"Hyperlink-underscore\">arithmetic mean<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor092\"><span class=\"Hyperlink-underscore\">central tendency<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor122\"><span class=\"Hyperlink-underscore\">degrees of freedom<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor115\"><span class=\"Hyperlink-underscore\">dispersion<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor117\"><span class=\"Hyperlink-underscore\">interquartile range (IQR)<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor101\"><span class=\"Hyperlink-underscore\">median<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor102\"><span class=\"Hyperlink-underscore\">mode<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor116\"><span class=\"Hyperlink-underscore\">range<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor121\"><span class=\"Hyperlink-underscore\">robustness<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor114\"><span class=\"Hyperlink-underscore\">spread<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor123\"><span class=\"Hyperlink-underscore\">standard deviation<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor118\"><span class=\"Hyperlink-underscore\">sum of squares<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor113\"><span class=\"Hyperlink-underscore\">variability<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor120\"><span class=\"Hyperlink-underscore\">variance<\/span><\/a><\/p>\n<\/div>\n<\/div>\n<p data-start=\"195\" data-end=\"910\">As we turn from visualizing data to summarizing it with numbers, it\u2019s important to remember that descriptive statistics are not just mathematical tools\u2014they are also tools of social interpretation. Measures like the mean, median, and mode can shape how we understand inequality, fairness, and access. For example, reporting the <em data-start=\"523\" data-end=\"532\">average<\/em> income in a community may hide the fact that most families earn far less than a few wealthy outliers. By contrast, the <em data-start=\"652\" data-end=\"660\">median<\/em> provides a more equitable sense of the \u201ctypical\u201d experience. Likewise, measures of spread such as the range or standard deviation can reveal whether opportunities and resources are distributed evenly across groups or concentrated among only a few.<\/p>\n<p data-start=\"912\" data-end=\"1405\">From a social justice perspective, these concepts remind us that numbers are not neutral: they reflect the lived realities of people\u2019s lives. When we describe the center and variability of data, we are making choices about how to represent communities, whose voices are amplified, and whose experiences might be obscured. Learning to use these descriptive statistics critically allows us to uncover patterns of inequality and ensure that our analyses do not reproduce existing social biases.<\/p>\n<hr data-start=\"1407\" data-end=\"1410\" \/>\n<p class=\"Text-1st\">Now that we have visualized our data to understand its shape, we can begin with numerical analyses. The descriptive statistics presented in this chapter serve to describe the distribution of our data objectively and mathematically\u2014our first step into statistical analysis! The topics here will serve as the basis for everything we do in the rest of the course.<\/p>\n<h3 class=\"H1\"><a id=\"_idTextAnchor089\"><\/a>What Is Central Tendency?<\/h3>\n<p class=\"Text-1st\">What is central tendency, and why do we want to know the central tendency of a group of scores? Let us first try to answer these questions intuitively. Then we will proceed to a more formal discussion.<\/p>\n<p class=\"Text\">Imagine this situation: You are in a class with just four other students, and the five of you took a 5-point pop quiz. Today your instructor is walking around the room, handing back the quizzes. She stops at your desk and hands you your paper. Written in bold black ink on the front is \u201c3\/5.\u201d How do you react? Are you happy with your score of 3 or disappointed? How do you decide? You might calculate your percentage correct, realize it is 60%, and be appalled. But it is more likely that when deciding how to react to your performance, you will want additional information. What additional information would you like?<\/p>\n<p class=\"Text\">If you are like most students, you will immediately ask your classmates, \u201cWhat\u2019d ya get?\u201d and then ask the instructor, \u201cHow did the class do?\u201d In other words, the additional information you want is how your quiz score compares to other students\u2019 scores. You therefore understand the importance of comparing your score to the class distribution of scores. Should your score of 3 turn out to be among the higher scores, then you\u2019ll be pleased after all. On the other hand, if 3 is among the lower scores in the class, you won\u2019t be quite so happy.<\/p>\n<p class=\"Text\">This idea of comparing individual scores to a distribution of scores is fundamental to statistics. So let\u2019s explore it further, using the same example (the pop quiz you took with your four classmates). Three possible outcomes are shown in <a href=\"#_idTextAnchor090\"><span class=\"Fig-table-number-underscore\">Table 3.1<\/span><\/a>. They are labeled \u201cDataset A,\u201d \u201cDataset B,\u201d and \u201cDataset C.\u201d Which of the three datasets would make you happiest? In other words, in comparing your score with your fellow students\u2019 scores, in which dataset would your score of 3 be the most impressive?<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer122\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor090\"><\/a>Table 3.1.<\/span> Three possible datasets for the 5-point make-up quiz.<\/p>\n<table id=\"table019\" class=\"Foster-table\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-44\" \/>\n<col class=\"_idGenTableRowColumn-12\" \/>\n<col class=\"_idGenTableRowColumn-12\" \/>\n<col class=\"_idGenTableRowColumn-14\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\n<td class=\"Foster-table Table-col-hd CellOverride-2\">\n<p class=\"Table-col-hd\">Student<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-2\">\n<p class=\"Table-col-hd ParaOverride-4\">Dataset A<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-2\">\n<p class=\"Table-col-hd ParaOverride-4\">Dataset B<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd ParaOverride-4\">Dataset C<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-1\">\n<p class=\"Table-body\">You<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body\">Ahmed<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">2<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body\">Rosa<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">2<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body\">Tamika<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-2 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">2<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-11\">\n<td class=\"Foster-table Table-body-last Table-body CellOverride-2\">\n<p class=\"Table-body\">Luther<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p class=\"Text\">In Dataset A, everyone\u2019s score is 3. This puts your score at the exact center of the distribution. You can draw satisfaction from the fact that you did as well as everyone else. But of course it cuts both ways: everyone else did just as well as you.<\/p>\n<p class=\"Text\">Now consider the possibility that the scores are described as in Dataset B. This is a depressing outcome even though your score is no different than the one in Dataset A. The problem is that the other four students had higher grades, putting yours below the center of the distribution.<\/p>\n<p class=\"Text\">Finally, let\u2019s look at Dataset C. This is more like it! All of your classmates score lower than you, so your score is above the center of the distribution.<\/p>\n<p><strong data-start=\"423\" data-end=\"470\">Social Justice Example (Income Inequality):<\/strong><br data-start=\"470\" data-end=\"473\" \/><em data-start=\"473\" data-end=\"997\">Consider annual incomes in a city: four families earn $35,000, $38,000, $40,000, and $42,000. A fifth family earns $1,000,000. The mean income is over $200,000, which suggests that the \u201caverage\u201d family is wealthy. But the median income\u2014the middle value, $40,000\u2014better reflects the experience of most families. In social justice work, this distinction matters. Reporting only the mean can mask inequality and give the false impression that everyone is doing well, when in fact most households struggle to meet basic needs.<\/em><\/p>\n<p class=\"Text\">Now let\u2019s change the example in order to develop more insight into the center of a distribution. <a href=\"#_idTextAnchor091\"><span class=\"Fig-table-number-underscore\">Figure 3.1<\/span><\/a> shows the results of an experiment on memory for chess positions. Subjects were shown a chess position and then asked to reconstruct it on an empty chess board. The number of pieces correctly placed was recorded. This was repeated for two more chess positions. The scores represent the total number of chess pieces correctly placed for the three chess positions. The maximum possible score was 89.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer123\" class=\"Side-legend\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor091\"><\/a>Figure 3.1.<\/span> Back-to-back stem-and-leaf display. The left side shows the memory scores of the non-players. The right side shows the scores of the tournament players. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/38\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Memory Scores Back-to-Back Stem and Leaf<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer124\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2021\/12\/Memory_Scores_Back-to-Back_Stem_and_Leaf-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">Two groups are compared. On the left are people who don\u2019t play chess. On the right are people who play a great deal (tournament players). It is clear that the location of the center of the distribution for the non-players is much lower than the center of the distribution for the tournament players.<\/p>\n<p class=\"Text\">We\u2019re sure you get the idea now about the center of a distribution. It is time to move beyond intuition. We need a formal definition of the center of a distribution. In fact, we\u2019ll offer you three definitions! This is not just generosity on our part. There turn out to be (at least) three different ways of thinking about the center of a distribution, all of them useful in various contexts. In the remainder of this section we attempt to communicate the idea behind each concept. In the succeeding sections we will give statistical measures for these concepts of central tendency.<\/p>\n<p>&nbsp;<\/p>\n<h4 class=\"H2\">Definitions of Center<\/h4>\n<p class=\"Text-1st\">Now we explain the three measures of <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_635\"><a id=\"_idTextAnchor092\"><\/a><\/a><span class=\"key-term\">central tendency<\/span>: (1) the point on which a distribution will balance, (2) the value whose average absolute deviation from all the other values is minimized, and (3) the value whose squared deviation from all the other values is minimized.<\/p>\n<h5 class=\"H3\">Balance Scale<\/h5>\n<p class=\"Text-1st\">One definition of central tendency is the point at which the distribution is in balance. <a href=\"#_idTextAnchor093\"><span class=\"Fig-table-number-underscore\">Figure 3.2<\/span><\/a> shows the distribution of the five numbers 2, 3, 4, 9, 16 placed upon a balance scale. If each number weighs one pound, and is placed at its position along the number line, then it would be possible to balance them by placing a fulcrum at a particular point.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer125\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor093\"><\/a>Figure 3.2.<\/span> A balance scale. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/39\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Balance Scale<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer126\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Balance_Scale-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">For another example, consider the distribution shown in <a href=\"#_idTextAnchor094\"><span class=\"Fig-table-number-underscore\">Figure 3.3<\/span><\/a>. It is balanced by placing the fulcrum in the geometric middle.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer127\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor094\"><\/a>Figure 3.3.<\/span> A distribution balanced on the tip of a triangle. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/40\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Balanced Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer128\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Balanced_Distribution-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\"><a href=\"#_idTextAnchor095\"><span class=\"Fig-table-number-underscore\">Figure 3.4<\/span><\/a> illustrates that the same distribution can\u2019t be balanced by placing the fulcrum to the left of center.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer129\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor095\"><\/a>Figure 3.4.<\/span> The distribution is not balanced. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/41\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Unbalanced Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer130\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Unbalanced_Distribution-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\"><a href=\"#_idTextAnchor096\"><span class=\"Fig-table-number-underscore\">Figure 3.5<\/span><\/a> shows an asymmetric distribution. To balance it, we cannot put the fulcrum halfway between the lowest and highest values (as we did in <a href=\"#_idTextAnchor094\"><span class=\"Fig-table-number-underscore\">Figure 3.3<\/span><\/a>). Placing the fulcrum at the \u201chalf way\u201d point would cause it to tip towards the left.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer131\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor096\"><\/a>Figure 3.5.<\/span> An asymmetric distribution balanced on the tip of a triangle. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/42\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Asymmetric Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer132\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Asymmetric_Distribution-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<h5 class=\"H3\">Smallest Absolute Deviation<\/h5>\n<p class=\"Text-1st\">Another way to define the center of a distribution is based on the concept of the sum of the absolute deviations (differences). Consider the distribution made up of the five numbers 2, 3, 4, 9, 16. Let\u2019s see how far the distribution is from 10 (picking a number arbitrarily). <a href=\"#_idTextAnchor097\"><span class=\"Fig-table-number-underscore\">Table 3.2<\/span><\/a> shows the sum of the absolute deviations of these numbers from the number 10.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer133\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor097\"><\/a>Table 3.2.<\/span> An example of the sum of absolute deviations.<\/p>\n<table id=\"table020\" class=\"Foster-table\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-45\" \/>\n<col class=\"_idGenTableRowColumn-46\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\n<td class=\"Foster-table Table-col-hd CellOverride-12\">\n<p class=\"Table-col-hd\">Values<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd\">Absolute Deviations from 10<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\n<p class=\"Table-body\">8<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">7<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">6<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">9<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">16<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">6<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-8\">\n<td class=\"Foster-table Table-body-last Table-body CellOverride-12\">\n<p class=\"Table-body ParaOverride-4\"><span class=\"bold\">Sum<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\">\n<p class=\"Table-body\"><span class=\"bold\">28<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p class=\"Text\">The first row of the table shows that the absolute value of the difference between 2 and 10 is 8; the second row shows that the absolute difference between 3 and 10 is 7, and similarly for the other rows. When we add up the five absolute deviations, we get 28. So, the sum of the absolute deviations from 10 is 28. Likewise, the sum of the absolute deviations from 5 equals 3 + 2 + 1 + 4 + 11 = 21. So, the sum of the absolute deviations from 5 is smaller than the sum of the absolute deviations from 10. In this sense, 5 is closer, overall, to the other numbers than is 10.<\/p>\n<p class=\"Text\">We are now in a position to define a second measure of central tendency, this time in terms of absolute deviations. Specifically, according to our second definition, the center of a distribution is the number for which the sum of the absolute deviations is smallest. As we just saw, the sum of the absolute deviations from 10 is 28 and the sum of the absolute deviations from 5 is 21. Is there a value for which the sum of the absolute deviations is even smaller than 21? Yes. For these data, there is a value for which the sum of absolute deviations is only 20. See if you can find it.<\/p>\n<h5 class=\"H3\">Smallest Squared Deviation<\/h5>\n<p class=\"Text-1st\">We shall discuss one more way to define the center of a distribution. It is based on the concept of the sum of squared deviations (differences). Again, consider the distribution of the five numbers 2, 3, 4, 9, 16. <a href=\"#_idTextAnchor098\"><span class=\"Fig-table-number-underscore\">Table 3.3<\/span><\/a> shows the sum of the squared deviations of these numbers from the number\u00a010.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer134\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor098\"><\/a>Table 3.3.<\/span> An example of the sum of squared deviations.<\/p>\n<table id=\"table021\" class=\"Foster-table\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-45\" \/>\n<col class=\"_idGenTableRowColumn-47\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\n<td class=\"Foster-table Table-col-hd CellOverride-12\">\n<p class=\"Table-col-hd\">Values<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd\">Squared Deviations from 10<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\n<p class=\"Table-body\">64<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">49<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">9<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-12 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">16<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-8\">\n<td class=\"Foster-table Table-body-last Table-body CellOverride-12\">\n<p class=\"Table-body ParaOverride-4\"><span class=\"bold\">Sum<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\">\n<p class=\"Table-body\"><span class=\"bold\">186<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p class=\"Text\">The first row in the table shows that the squared value of the difference between 2 and 10 is 64; the second row shows that the squared difference between 3 and 10 is 49, and so forth. When we add up all these squared deviations, we get 186.<\/p>\n<p class=\"Text\">Changing the target from 10 to 5, we calculate the sum of the squared deviations from 5 as 9 + 4\u00a0+ 1 + 16 + 121 = 151. So, the sum of the squared deviations from 5 is smaller than the sum of the squared deviations from 10. Is there a value for which the sum of the squared deviations is even smaller than 151? Yes, it is possible to reach 134.8. Can you find the target number for which the sum of squared deviations is 134.8?<\/p>\n<p class=\"Text\">The target that minimizes the sum of squared deviations provides another useful definition of central tendency (the last one to be discussed in this section). It can be challenging to find the value that minimizes this sum.<\/p>\n<h3 class=\"H1\">Measures of Central Tendency<\/h3>\n<p class=\"Text-1st\">In the previous section we saw that there are several ways to define central tendency. This section defines the three most common measures of central tendency: the mean, the median, and the mode. The relationships among these measures of central tendency and the definitions given in the previous section will probably not be obvious to you.<\/p>\n<p class=\"Text\">This section gives only the basic definitions of the mean, median and mode. A further discussion of the relative merits and proper applications of these statistics is presented in a <a href=\"#_idTextAnchor104\"><span class=\"Hyperlink-underscore\">later section<\/span><\/a>.<\/p>\n<h4 class=\"H2\">Arithmetic Mean<\/h4>\n<p class=\"Text-1st\">The <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_634\"><a id=\"_idTextAnchor099\"><\/a><\/a><span class=\"key-term\">arithmetic mean<\/span>\u2014the sum of the numbers divided by the number of numbers\u2014is the most common measure of central tendency. The symbol \u201c<img decoding=\"async\" class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/>\u201d (pronounced \u201cmew\u201d) is used for the mean of a population. The symbol <img decoding=\"async\" class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/> is used for the mean of a sample. (In advanced statistics textbooks, the symbol <img decoding=\"async\" class=\"_idGenObjectAttribute-33\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.1-xbar-2.png\" alt=\"ModAbove Upper X bar\" \/>, pronounced \u201cx bar,\u201d may be used to represent the mean of a sample.) The formula for <img decoding=\"async\" class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/> is shown below:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-34\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.1-2.png\" alt=\"mu equals, Start-Frac, sigma-summation, Upper X, Over, Upper N, End-Frac\" \/><\/p>\n<p class=\"Text\">where <img decoding=\"async\" class=\"_idGenObjectAttribute-35\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.15-SigX-2.png\" alt=\"sigma-summation Upper X\" \/> is the sum of all the numbers in the population and <img decoding=\"async\" class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> is the number of numbers in the population.<\/p>\n<p class=\"Text\">The formula for <img decoding=\"async\" class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/> is essentially identical:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-37\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-2.png\" alt=\"Upper M equals, Start-Frac, sigma-summation, Upper X, Over, n, End-Frac\" \/><\/p>\n<p class=\"Text\">where <img decoding=\"async\" class=\"_idGenObjectAttribute-35\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.15-SigX-2.png\" alt=\"sigma-summation Upper X\" \/> is the sum of all the numbers in the sample and <span class=\"italic\">n <\/span>is the number of numbers in the sample. The only distinction between these two equations is whether we are referring to the population (in which case we use <img decoding=\"async\" class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/> and <img decoding=\"async\" class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/>) or a sample of that population (in which case we use <img decoding=\"async\" class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/> and <span class=\"italic\">n<\/span>).<\/p>\n<p class=\"Text\">As an example, the mean of the numbers 1, 2, 3, 6, 8 is 20\/5 = 4 regardless of whether the numbers constitute the entire population or just a sample from the population.<\/p>\n<p class=\"Text\"><a href=\"#_idTextAnchor100\"><span class=\"Fig-table-number-underscore\">Figure 3.6<\/span><\/a> shows the number of touchdown (TD) passes thrown by each of the 31 teams in the National Football League in the 2000 season. The mean number of touchdown passes thrown is 20.45, as shown below.<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-38\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.4a-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">Although the arithmetic mean is not the only \u201cmean\u201d (there is also a geometric mean, a harmonic mean, and many others that are all beyond the scope of this course), it is by far the most commonly used. Therefore, if the term \u201cmean\u201d is used without specifying whether it is the arithmetic mean, the geometric mean, or some other mean, it is assumed to refer to the arithmetic mean.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer149\" class=\"Side-legend\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor100\"><\/a>Figure 3.6.<\/span> Number of touchdown passes. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/43\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Touchdown Passes Raw Data<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer150\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Touchdown_Passes_Raw_Data-4.png\" alt=\"\" \/><\/div>\n<\/div>\n<h4 class=\"H2\">Median<\/h4>\n<p class=\"Text-1st\">The median is also a frequently used measure of central tendency. The <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_639\"><a id=\"_idTextAnchor101\"><\/a><\/a><span class=\"key-term\">median<\/span> is the midpoint of a distribution: the same number of scores is above the median as below it. For the data in <a href=\"#_idTextAnchor100\"><span class=\"Fig-table-number-underscore\">Figure 3.6<\/span><\/a>, there are 31 scores. The 16th highest score (which equals 20) is the median because there are 15 scores below the 16th score and 15 scores above the 16th score. The median can also be thought of as the 50th percentile.<\/p>\n<p class=\"Text\">When there is an odd number of numbers, the median is simply the middle number. For example, the median of 2, 4, and 7 is 4. When there is an even number of numbers, the median is the mean of the two middle numbers. Thus, the median of the numbers 2, 4, 7, 12 is:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-39\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.5-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">When there are numbers with the same values, each appearance of that value gets counted. For example, in the set of numbers 1, 3, 4, 4, 5, 8, and 9, the median is 4 because there are three numbers (1, 3, and 4) below it and three numbers (5, 8, and 9) above it. If we only counted 4 once, the median would incorrectly be calculated at 4.5 (4 + 5, divided by 2). When in doubt, writing out all of the numbers in order and marking them off one at a time from the top and bottom will always lead you to the correct answer.<\/p>\n<h4 class=\"H2\">Mode<\/h4>\n<p class=\"Text-1st\">The <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_640\"><a id=\"_idTextAnchor102\"><\/a><\/a><span class=\"key-term\">mode<\/span> is the most frequently occurring value in the dataset. For the data in <a href=\"#_idTextAnchor100\"><span class=\"Fig-table-number-underscore\">Figure 3.6<\/span><\/a>, the mode is 18 since more teams (4) had 18 touchdown passes than any other number of touchdown passes. With continuous data, such as response time measured to many decimals, the frequency of each value is one since no two scores will be exactly the same (see <a href=\"#_idTextAnchor012\"><span class=\"Hyperlink-underscore\">discussion of continuous variables<\/span><\/a>). Therefore the mode of continuous data is normally computed from a grouped frequency distribution. <a href=\"#_idTextAnchor103\"><span class=\"Fig-table-number-underscore\">Table 3.4<\/span><\/a> shows a grouped frequency distribution for a set of credit scores. Since the interval with the highest frequency is 600 to 700, the mode is the middle of that interval (650). Although the mode is not frequently used for continuous data, it is nevertheless an important measure of central tendency as it is the only measure we can use on qualitative or categorical data.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer152\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor103\"><\/a>Table 3.4.<\/span> Grouped frequency distribution.<\/p>\n<table id=\"table022\" class=\"Foster-table\" style=\"width: 404px\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-48\" \/>\n<col class=\"_idGenTableRowColumn-40\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\" style=\"height: 17px\">\n<td class=\"Foster-table Table-col-hd CellOverride-13\" style=\"width: 203.75px;height: 17px\">\n<p class=\"Table-col-hd ParaOverride-4\">Range<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\" style=\"width: 201.75px;height: 17px\">\n<p class=\"Table-col-hd ParaOverride-4\">Frequency<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-1\" style=\"width: 203.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">500 to 600<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"width: 201.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-2\" style=\"width: 203.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">600 to 700<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"width: 201.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">6<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-2\" style=\"width: 203.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">700 to 800<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"width: 201.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">5<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-13 _idGenCellOverride-2\" style=\"width: 203.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">800 to 900<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"width: 201.75px;height: 17px\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<h4 class=\"H2\"><a id=\"_idTextAnchor104\"><\/a>More on the Mean and Median<\/h4>\n<p class=\"Text-1st\">In the section <a href=\"#_idTextAnchor089\"><span class=\"Hyperlink-underscore\">What Is Central Tendency?<\/span><\/a>, we saw that the center of a distribution could be defined three ways: (1) the point on which a distribution would balance, (2) the value whose average absolute deviation from all the other values is minimized, and (3) the value whose squared deviation from all the other values is minimized. The mean is the point on which a distribution would balance, the median is the value that minimizes the sum of absolute deviations, and the mean is the value that minimizes the sum of the squared deviations.<\/p>\n<p class=\"Text\"><a href=\"#_idTextAnchor105\"><span class=\"Fig-table-number-underscore\">Table 3.5<\/span><\/a> shows the absolute and squared deviations of the numbers 2, 3, 4, 9, and 16 from their median of 4 and their mean of 6.8. You can see that the sum of absolute deviations from the median (20) is smaller than the sum of absolute deviations from the mean (22.8). On the other hand, the sum of squared deviations from the median (174) is larger than the sum of squared deviations from the mean (134.8).<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer153\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor105\"><\/a>Table 3.5.<\/span> Absolute and squared deviations from the median of 4 and the mean of 6.8.<\/p>\n<table id=\"table023\" class=\"Foster-table\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-49\" \/>\n<col class=\"_idGenTableRowColumn-50\" \/>\n<col class=\"_idGenTableRowColumn-50\" \/>\n<col class=\"_idGenTableRowColumn-51\" \/>\n<col class=\"_idGenTableRowColumn-12\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-19\">\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\n<p class=\"Table-col-hd ParaOverride-4\">Value<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\n<p class=\"Table-col-hd ParaOverride-4\">Absolute Deviation from Median<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\n<p class=\"Table-col-hd ParaOverride-4\">Absolute Deviation from Mean<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-14\">\n<p class=\"Table-col-hd ParaOverride-4\">Squared Deviation from Median<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd ParaOverride-4\">Squared Deviation from Mean<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\n<p class=\"Table-body\">2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\n<p class=\"Table-body\">4.8<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-1\">\n<p class=\"Table-body\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\n<p class=\"Table-body\">23.04<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">3.8<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">14.44<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">2.8<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">7.84<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">9<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">2.2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">25<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">4.84<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">16<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">12<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">9.2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-14 _idGenCellOverride-2\">\n<p class=\"Table-body\">144<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">84.64<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-8\">\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\n<p class=\"Table-body ParaOverride-4\"><span class=\"bold\">Total<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\n<p class=\"Table-body\"><span class=\"bold\">20<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\n<p class=\"Table-body\"><span class=\"bold\">22.8<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-14\">\n<p class=\"Table-body\"><span class=\"bold\">174<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\">\n<p class=\"Table-body\"><span class=\"bold\">134.80<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p class=\"Text\"><a href=\"#_idTextAnchor106\"><span class=\"Fig-table-number-underscore\">Figure 3.7<\/span><\/a> shows that the distribution balances at the mean of 6.8 and not at the median of 4. The relative advantages and disadvantages of the mean and median are discussed in the section <a href=\"#_idTextAnchor107\"><span class=\"Hyperlink-underscore\">Comparing Measures of Central Tendency<\/span><\/a>.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer154\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor106\"><\/a>Figure 3.7.<\/span> The distribution balances at the mean of 6.8 and not at the median of 4.0. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/44\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Balance Scale Numbered<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer155\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Balance_Scale_Numbered-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">When a distribution is symmetric, then the mean and the median are the same. Consider the following distribution: 1, 3, 4, 5, 6, 7, 9. The mean and median are both 5. The mean, median, and mode are identical in the bell-shaped normal distribution.<\/p>\n<h4 class=\"H2\"><a id=\"_idTextAnchor107\"><\/a>Comparing Measures of Central Tendency<\/h4>\n<p class=\"Text-1st\">How do the various measures of central tendency compare with each other? For symmetric distributions, the mean and median are the same value, as is the mode except in bimodal distributions. However, differences among the measures occur with skewed distributions. <a href=\"#_idTextAnchor108\"><span class=\"Fig-table-number-underscore\">Figure 3.8<\/span><\/a> shows the distribution of 642 scores on an introductory psychology test. Notice this distribution has a slight positive skew.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer156\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor108\"><\/a>Figure 3.8.<\/span> A distribution with a positive skew. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/45\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Histogram<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer157\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Histogram-4.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">Measures of central tendency are shown in <a href=\"#_idTextAnchor109\"><span class=\"Fig-table-number-underscore\">Table 3.6<\/span><\/a>. Notice they do not differ greatly, with the exception that the mode is considerably lower than the other measures. When distributions have a positive skew, the mean is typically higher than the median, although it may not be in bimodal distributions. For these data, the mean of 91.58 is higher than the median of 90. This pattern holds true for any skew: the mode will remain at the highest point in the distribution, the median will be pulled slightly out into the skewed tail (the longer end of the distribution), and the mean will be pulled the farthest out. Thus, the mean is more sensitive to skew than the median or mode, and in cases of extreme skew, the mean may no longer be appropriate to use.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer158\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor109\"><\/a>Table 3.6.<\/span> Measures of central tendency for the test scores.<\/p>\n<table id=\"table024\" class=\"Foster-table\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-39\" \/>\n<col class=\"_idGenTableRowColumn-39\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd\">Measure<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd\">Value<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\n<p class=\"Table-body\">Mode<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\n<p class=\"Table-body\">84.00<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">Median<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">90.00<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-11\">\n<td class=\"Foster-table Table-body-last Table-body\">\n<p class=\"Table-body\">Mean<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\">\n<p class=\"Table-body\">91.58<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer159\" class=\"Legend-below\">\n<p class=\"Fig-legend\">\n<\/div>\n<\/div>\n<p class=\"Text\"><a href=\"#_idTextAnchor111\"><span class=\"Fig-table-number-underscore\">Table 3.7<\/span><\/a> shows the measures of central tendency for these data. The large skew results in very different values for these measures. No single measure of central tendency is sufficient for data such as these. Fortunately, there is no need to summarize a distribution with a single number. When the various measures differ, our opinion is that you should report the mean and median. Sometimes it is worth reporting the mode as well. In the media, the median is usually reported to summarize the center of skewed distributions. You will hear about median salaries and median prices of houses sold, etc. This is better than reporting only the mean, but it would be informative to hear more statistics.<\/p>\n<h3 class=\"H1\">Spread and Variability<\/h3>\n<p><strong data-start=\"1439\" data-end=\"1480\">Social Justice Example (Wealth Gaps):<\/strong><br data-start=\"1480\" data-end=\"1483\" \/>Two communities may have the same median household income of $50,000. But in Community A, most families cluster closely around that figure, while in Community B, a few very wealthy households contrast sharply with many families living far below $50,000. Both communities share the same median, but their spreads (variability) are very different. From a social justice lens, measures of variability\u2014like range or standard deviation\u2014help uncover whether economic opportunity is equitably shared or concentrated among a privileged few.<\/p>\n<p class=\"Text-1st\">Variability refers to how \u201cspread out\u201d a group of scores is. To see what we mean by spread out, consider the graphs in <a href=\"#_idTextAnchor112\"><span class=\"Fig-table-number-underscore\">Figure 3.10<\/span><\/a>. These graphs represent the scores on two quizzes. The mean score for each quiz is 7.0. Despite the equality of means, you can see that the distributions are quite different. Specifically, the scores on Quiz 1 are more densely packed and those on Quiz 2 are more spread out. The differences among students were much greater on Quiz 2 than on Quiz 1.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer162\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor112\"><\/a>Figure 3.10.<\/span> Bar charts of Quizzes 1 and 2. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/47\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Quiz Score Bar Charts<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer163\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Quiz_Score_Bar_Charts-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">The terms <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_646\"><a id=\"_idTextAnchor113\"><\/a><\/a><span class=\"key-term\">variability<\/span>, <a id=\"_idTextAnchor114\"><\/a><span class=\"key-term\">spread<\/span>, and <a id=\"_idTextAnchor115\"><\/a><span class=\"key-term\">dispersion<\/span> are synonyms and refer to how spread out a distribution is. Just as in the section on central tendency where we discussed measures of the center of a distribution of scores, in this section we will discuss measures of the variability of a distribution. There are three frequently used measures of variability: range, variance, and standard deviation. In the next few paragraphs, we will look at each of these measures of variability in more detail.<\/p>\n<h4 class=\"H2\">Range<\/h4>\n<p class=\"Text-1st\">The range is the simplest measure of variability to calculate, and one you have probably encountered many times in your life. The <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_641\"><a id=\"_idTextAnchor116\"><\/a><\/a><span class=\"key-term\">range<\/span> is simply the highest score minus the lowest score. Let\u2019s take a few examples. What is the range of the following group of numbers: 10, 2, 5, 6, 7, 3, 4? Well, the highest number is 10, and the lowest number is 2, so 10 \u2212 2 = 8. The range is 8. Let\u2019s take another example. Here\u2019s a dataset with 10 numbers: 99, 45, 23, 67, 45, 91, 82, 78, 62, 51. What is the range? The highest number is 99 and the lowest number is 23, so 99 \u2212 23 = 76; the range is 76. Now consider the two quizzes shown in <a href=\"#_idTextAnchor112\"><span class=\"Fig-table-number-underscore\">Figure 3.10<\/span><\/a>. On Quiz 1, the lowest score is 5 and the highest score is 9. Therefore, the range is 4. The range on Quiz 2 was larger: the lowest score was 4 and the highest score was 10. Therefore the range is 6.<\/p>\n<p class=\"Text\">The problem with using range is that it is extremely sensitive to outliers, and one number far away from the rest of the data will greatly alter the value of the range. For example, in the set of numbers 1, 3, 4, 4, 5, 8, and 9, the range is 8 (9 \u2212 1). However, if we add a single person whose score is nowhere close to the rest of the scores, say, 20, the range more than doubles from 8 to 19.<\/p>\n<p><strong data-start=\"2361\" data-end=\"2415\">Social Justice Example (Wage Gaps by Gender\/Race):<\/strong><br data-start=\"2415\" data-end=\"2418\" \/><em data-start=\"2418\" data-end=\"2875\">Suppose we compare wages in two workplaces. In Workplace A, salaries range from $40,000 to $60,000. In Workplace B, most employees earn between $40,000 and $50,000, but one executive earns $300,000. The range in Workplace B is much larger, and the presence of that single outlier masks the fact that women and people of color may be clustered at the lower end of salaries. This shows how looking only at range\u2014or even mean\u2014can obscure systemic inequities.<\/em><\/p>\n<h5 class=\"H3\">Interquartile Range<\/h5>\n<p class=\"Text-1st\">The <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_638\"><a id=\"_idTextAnchor117\"><\/a><\/a><span class=\"key-term\">interquartile range (IQR)<\/span> is the range of the middle 50% of the scores in a distribution and is sometimes used to communicate where the bulk of the data in the distribution are located. It is computed as follows:<\/p>\n<p class=\"Text ParaOverride-4\">IQR = 75th percentile \u2212 25th percentile<\/p>\n<p class=\"Text\">For Quiz 1, the 75th percentile is 8 and the 25th percentile is 6. The interquartile range is therefore 2. For Quiz 2, which has greater spread, the 75th percentile is 9, the 25th percentile is 5, and the interquartile range is 4. Recall that in the discussion of box plots, the 75th percentile was called the upper hinge and the 25th percentile was called the lower hinge. Using this terminology, the interquartile range is referred to as the H-spread.<\/p>\n<h4 class=\"H2\">Sum of Squares<\/h4>\n<p class=\"Text-1st\">Variability can also be defined in terms of how close the scores in the distribution are to the middle of the distribution. Using the mean as the measure of the middle of the distribution, we can see how far, on average, each data point is from the center. The data on community volunteer hours is shown in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a>.<\/p>\n<p class=\"Text\">There are a few things to note about how <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a> is formatted. The raw data scores (<span class=\"italic\">X<\/span>) are always placed in the left-most column. This column is then summed at the bottom (<span class=\"Symbol-sigma CharOverride-10\">\u03a3<\/span><span class=\"italic\">X<\/span>) to facilitate calculating the mean by dividing the sum of <span class=\"italic\">X<\/span> values by the number of scores in the table (<span class=\"italic\">N<\/span>). The mean score is 7.0 (<span class=\"Symbol-sigma CharOverride-10\">\u03a3<\/span><span class=\"italic\">X<\/span>\/<span class=\"italic\">N <\/span>= 140\/20 = 7.0). Once you have the mean, you can easily work your way down the second column calculating the deviation scores (<img decoding=\"async\" class=\"_idGenObjectAttribute-40\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.6-2.png\" alt=\"\" \/>), representing how far each score deviates from the mean, here calculated as the score (<span class=\"italic\">X<\/span> value) minus 7. This column is also summed and has a very important property: it will always sum to 0, or close to zero if you have rounding error due to many decimal places (<span class=\"Symbol-sigma CharOverride-10\">\u03a3<\/span>(<img decoding=\"async\" class=\"_idGenObjectAttribute-40\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.6-2.png\" alt=\"\" \/>) = 0). This step is used as a check on your math to make sure you haven\u2019t made a mistake. If this column sums to 0, you can move on to filling in the third column, which is composed of the squared deviation scores. The deviation scores are squared to remove negative values and appear in the third column <img decoding=\"async\" class=\"_idGenObjectAttribute-41\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.7-2.png\" alt=\"\" \/>. When these values are summed, you have the sum of the squared deviations, or the <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_645\"><a id=\"_idTextAnchor118\"><\/a><\/a><span class=\"key-term\">sum of squares<\/span> (<span class=\"italic\">SS<\/span>), calculated with the formula <img decoding=\"async\" class=\"_idGenObjectAttribute-42\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8-2.png\" alt=\"\" \/>.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer172\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor119\"><\/a>Table 3.8.<\/span> Calculation of variance for community volunteer hours.<\/p>\n<table id=\"table026\" class=\"Foster-table\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-53\" \/>\n<col class=\"_idGenTableRowColumn-54\" \/>\n<col class=\"_idGenTableRowColumn-46\" \/>\n<col class=\"_idGenTableRowColumn-55\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\n<td class=\"Foster-table Table-col-hd CellOverride-16\">\n<p class=\"Table-col-hd ParaOverride-4\"><span class=\"bold-italic\">X<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-16\">\n<p class=\"Table-col-hd ParaOverride-4\"><span class=\"bold-italic\">X<\/span> \u2212 <span class=\"bold-italic\">M<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-16\">\n<p class=\"Table-col-hd ParaOverride-4\">(<span class=\"bold-italic\">X<\/span> \u2212 <span class=\"bold-italic\">M<\/span>)<span class=\"superscript _idGenCharOverride-1\">2<\/span><\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd ParaOverride-4\"><span class=\"bold-italic\">X<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span><\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">9<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-1\">\n<p class=\"Table-body\">2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-1\">\n<p class=\"Table-body ParaOverride-4\">81<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">9<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">81<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">9<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">2<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">81<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">8<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">64<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">8<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">64<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">8<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">64<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">8<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">64<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">7<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">49<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">7<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">49<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">7<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">49<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">7<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">49<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">7<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">49<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">6<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">\u22121<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">6<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">\u22121<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">6<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">\u22121<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">6<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">\u22121<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">6<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">\u22121<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">6<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">\u22121<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body\">\u22122<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">25<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">\u22122<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-16 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">4<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-3 _idGenCellOverride-2\">\n<p class=\"Table-body ParaOverride-4\">25<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-56\">\n<td class=\"Foster-table Table-body-last Table-body CellOverride-16\">\n<p class=\"Table-body ParaOverride-14\"><span class=\"Symbol-sigma-Table CharOverride-10\">\u03a3<\/span><span class=\"italic\">X<\/span> = 140<\/p>\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\"><img decoding=\"async\" class=\"_idGenObjectAttribute-43\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8f-2.png\" alt=\"\" \/><\/span> = 19,600<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-16\">\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\">\u03a3<\/span>(<span class=\"CharOverride-11\"><img decoding=\"async\" class=\"_idGenObjectAttribute-44\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.61-2.png\" alt=\"\" \/><\/span>) = 0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-17\">\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\">\u03a3<\/span><span class=\"CharOverride-11\"><img decoding=\"async\" class=\"_idGenObjectAttribute-45\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.71-2.png\" alt=\"\" \/><\/span> = 30<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-18\">\n<p class=\"Table-body ParaOverride-4\"><span class=\"Symbol-sigma-Table CharOverride-10\"><img decoding=\"async\" class=\"_idGenObjectAttribute-46\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8d-2.png\" alt=\"\" \/><\/span> = 1,010<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-47\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8c-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">The preceding formula is called the definitional formula, as it shows the logic behind the sum of squared deviations calculation. As mentioned earlier, there can be rounding errors in calculating the deviation scores. Also, when the set of scores is large, calculating the deviation scores, squaring the scores, and then summing those values can be tedious. To simplify the sum of squares calculation, the computational formula is used instead. The computational formula is as follows:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-48\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8b-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">The last column in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a> represents the <span class=\"italic\">X<\/span> values squared and then summed\u2014<img decoding=\"async\" class=\"_idGenObjectAttribute-49\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8d1-2.png\" alt=\"\" \/>. At the bottom of the first column, the <img decoding=\"async\" class=\"_idGenObjectAttribute-35\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8e-2.png\" alt=\"\" \/> value is squared\u00ad\u2014<img decoding=\"async\" class=\"_idGenObjectAttribute-50\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8f1-2.png\" alt=\"\" \/>. These are the values used in the computational formula for the sum of squares. As you can see in the calculation below, the <span class=\"italic\">SS<\/span> value is the same for both the definitional formula and the computational formula:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-125\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/MicrosoftTeams-image-16.png\" alt=\"\" width=\"438\" height=\"69\" srcset=\"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2024\/10\/MicrosoftTeams-image-16.png 438w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2024\/10\/MicrosoftTeams-image-16-300x47.png 300w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2024\/10\/MicrosoftTeams-image-16-65x10.png 65w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2024\/10\/MicrosoftTeams-image-16-225x35.png 225w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2024\/10\/MicrosoftTeams-image-16-350x55.png 350w\" sizes=\"auto, (max-width: 438px) 100vw, 438px\" \/><\/p>\n<p class=\"Text\">As we will see, the sum of squares appears again and again in different formulas\u2014it is a very important value, and using the <span class=\"italic\">X<\/span> and <img decoding=\"async\" class=\"_idGenObjectAttribute-52\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.8h-2.png\" alt=\"\" \/> columns in this table makes it simple to calculate the <span class=\"italic\">SS<\/span> without error.<\/p>\n<h4 class=\"H2\">Variance<\/h4>\n<p class=\"Text-1st\">Now that we have the sum of squares calculated, we can use it to compute our formal measure of average distance from the mean\u2014the variance. The <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_647\"><a id=\"_idTextAnchor120\"><\/a><\/a><span class=\"key-term\">variance<\/span> is defined as the average squared difference of the scores from the mean. We square the deviation scores because, as we saw in the second column of <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a>, the sum of raw deviations is always 0, and there\u2019s nothing we can do mathematically without changing that.<\/p>\n<p class=\"Text\">The population parameter for variance is <span class=\"Symbol\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> (\u201csigma-squared\u201d) and is calculated as:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-53\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.11a-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">We can use the value we previously calculated for <span class=\"italic\">SS<\/span> in the numerator, then simply divide that value by <img decoding=\"async\" class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> to get variance. If we assume that the values in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table\u00a03.8<\/span><\/a> represent the full population, then we can take our value of sum of squares and divide it by <img decoding=\"async\" class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> to get our population variance:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-54\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.12-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">So, on average, scores in this population are 1.5 squared units away from the mean. This measure of spread exhibits much more <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_642\"><a id=\"_idTextAnchor121\"><\/a><\/a><span class=\"key-term\">robustness<\/span> (a term used by statisticians to mean resilience or resistance to outliers) than the range, so it is a much more useful value to compute. Additionally, as we will see in future chapters, variance plays a central role in inferential statistics.<\/p>\n<p class=\"Text\">The sample statistic used to estimate the variance is <span class=\"italic\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> (\u201c<span class=\"italic\">s<\/span>-squared\u201d):<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-55\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.13-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">This formula is very similar to the formula for the population variance with one change: we now divide by <span class=\"italic\">N <\/span>\u2212 1 instead of <img decoding=\"async\" class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/>. The value <span class=\"italic\">N <\/span>\u2212 1 has a special name: the <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_636\"><a id=\"_idTextAnchor122\"><\/a><\/a><span class=\"key-term\">degrees of freedom<\/span> (abbreviated as <span class=\"italic\">d<\/span><span class=\"italic\">f<\/span>). You don\u2019t need to understand in depth what degrees of freedom are (essentially they account for the fact that we have to use a sample statistic to estimate the mean [<img decoding=\"async\" class=\"_idGenObjectAttribute-32\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperM-2.png\" alt=\"Upper M\" \/>] before we estimate the variance) in order to calculate variance, but knowing that the denominator is called <span class=\"italic\">d<\/span><span class=\"italic\">f <\/span>provides a nice shorthand for the variance formula:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-56\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.13a-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">Going back to the values in <a href=\"#_idTextAnchor119\"><span class=\"Fig-table-number-underscore\">Table 3.8<\/span><\/a> and treating those scores as a sample, we can estimate the sample variance as:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-57\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.14-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">Notice that this value is slightly larger than the one we calculated when we assumed these scores were the full population. This is because our value in the denominator is slightly smaller, making the final value larger. In general, as your sample size <img decoding=\"async\" class=\"_idGenObjectAttribute-36\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.2-upperN-2.png\" alt=\"Upper N\" \/> gets bigger, the effect of subtracting 1 becomes less and less. Comparing a sample size of 10 to a sample size of 1000; 10 \u2212 1 = 9, or 90% of the original value, whereas 1000 \u2212 1 = 999, or 99.9% of the original value. Thus, larger sample sizes will bring the estimate of the sample variance closer to that of the population variance. This is a key idea and principle in statistics that we will see over and over again: larger sample sizes better reflect the population.<\/p>\n<h4 class=\"H2\">Standard Deviation<\/h4>\n<p class=\"Text-1st\">The <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_137_644\"><a id=\"_idTextAnchor123\"><\/a><\/a><span class=\"key-term\">standard deviation<\/span> is simply the square root of the variance. This is a useful and interpretable statistic because taking the square root of the variance (recalling that variance is the average squared difference) puts the standard deviation back into the original units of the measure we used. Thus, when reporting descriptive statistics in a study, scientists virtually always report mean and standard deviation. Standard deviation is therefore the most commonly used measure of spread for our purposes, representing the average distance of the scores from the mean.<\/p>\n<p class=\"Text\">The population parameter for standard deviation is <span class=\"Symbol\">s<\/span> (\u201csigma\u201d), which, intuitively, is the square root of the variance parameter <span class=\"Symbol\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> (occasionally, the symbols work out nicely that way). The formula is simply the formula for variance under a square root sign:<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-58\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.15-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">The sample statistic follows the same conventions and is given as <span class=\"italic\">s<\/span> in mathematical formulas. (Note that in American Psychological Association [APA] format for reporting results, sample standard deviation is reported using the abbreviation <span class=\"italic\">SD<\/span>.)<\/p>\n<p class=\"Equation\"><img decoding=\"async\" class=\"_idGenObjectAttribute-59\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn3.17-2.png\" alt=\"\" \/><\/p>\n<p class=\"Text\">The standard deviation is an especially useful measure of variability when the distribution is normal or approximately normal because the proportion of the distribution within a given number of standard deviations from the mean can be calculated. For example, 68% of the distribution is within one standard deviation (above and below) of the mean and approximately 95% of the distribution is within two standard deviations of the mean, as shown in <a href=\"#_idTextAnchor124\"><span class=\"Fig-table-number-underscore\">Figure 3.11<\/span><\/a>. Therefore, if you had a normal distribution with a mean of 50 and a standard deviation of 10, then 68% of the distribution would be between 50 \u2212 10 = 40 and 50 + 10 = 60. Similarly, about 95% of the distribution would be between 50 \u2212 2 \u00d7 10 = 30 and 50\u00a0+ 2 \u00d7 10 = 70.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer192\" class=\"Side-legend\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor124\"><\/a>Figure 3.11.<\/span> Percentages of the normal distribution. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/48\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Normal Distribution Percentages<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer193\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Normal_Distribution_Percentages-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\"><a href=\"#_idTextAnchor125\"><span class=\"Fig-table-number-underscore\">Figure 3.12<\/span><\/a> shows two normal distributions. The red (left-most) distribution has a mean of 40 and a standard deviation of 5; the blue (right-most) distribution has a mean of 60 and a standard deviation of 10. For the red distribution, 68% of the distribution is between 45 and 55; for the blue distribution, 68% is between 50 and 70. Notice that as the standard deviation gets smaller, the distribution becomes much narrower, regardless of where the center of the distribution (mean) is. <a href=\"#_idTextAnchor126\"><span class=\"Fig-table-number-underscore\">Figure 3.13<\/span><\/a> presents several more examples of this effect.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer194\" class=\"Side-legend\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor125\"><\/a>Figure 3.12.<\/span> Normal distributions with standard deviations of 5 and 10. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/49\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Normal Distributions with Standard Deviations<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer195\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Normal_Distributions_with_Standard_Deviations-2.png\" alt=\"\" \/><\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer196\" class=\"Legend-below\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor126\"><\/a>Figure 3.13.<\/span> Differences between two datasets. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/50\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Location and Variability Differences<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div><\/div>\n<div><\/div>\n<div><\/div>\n<div id=\"_idContainer197\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Location_and_Variability_Differences-2.png\" alt=\"\" \/><\/div>\n<div><strong data-start=\"2361\" data-end=\"2415\">Social Justice Example (Wage Gaps by Gender\/Race):<\/strong><br data-start=\"2415\" data-end=\"2418\" \/>Suppose we compare wages in two workplaces. In Workplace A, salaries range from $40,000 to $60,000. In Workplace B, most employees earn between $40,000 and $50,000, but one executive earns $300,000. The range in Workplace B is much larger, and the presence of that single outlier masks the fact that women and people of color may be clustered at the lower end of salaries. This shows how looking only at range\u2014or even mean\u2014can obscure systemic inequities.<\/div>\n<\/div>\n<h3 class=\"H1\">Exercises<\/h3>\n<ol>\n<li class=\"Numbered-list-Exercises-1st\">If the mean time to respond to a stimulus is much higher than the median time to respond, what can you say about the shape of the distribution of response times?<\/li>\n<li class=\"Numbered-list-Exercises\">Compare the mean, median, and mode in terms of their sensitivity to extreme scores.<\/li>\n<li class=\"Numbered-list-Exercises\">Your younger brother comes home one day after taking a science test. He says someone at school told him that \u201c60% of the students in the class scored above the median test grade.\u201d What is wrong with this statement? What if he had said \u201c60% of the students scored above the mean?\u201d<\/li>\n<li class=\"Numbered-list-Exercises\">Make up three datasets with five numbers each that have:\n<ol>\n<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">the same mean but different standard deviations.<\/li>\n<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">the same mean but different medians.<\/li>\n<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">the same median but different means.<\/li>\n<\/ol>\n<\/li>\n<li class=\"Numbered-list-Exercises\">Compute the population mean and population standard deviation for the following scores (remember to use the sum of squares table): 5, 7, 8, 3, 4, 4, 2, 7, 1, 6<\/li>\n<li class=\"Numbered-list-Exercises\">For the following problem, use the following scores: 5, 8, 8, 8, 7, 8, 9, 12, 8, 9, 8, 10, 7, 9, 7, 6, 9, 10, 11, 8\n<ol>\n<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">Create a histogram of these data. What is the shape of this histogram?<\/li>\n<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">How do you think the three measures of central tendency will compare to each other in this dataset?<\/li>\n<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">Compute the sample mean, the median, and the mode<\/li>\n<li class=\"Numbered-list-Exercises-sub _idGenParaOverride-1\">Draw and label lines on your histogram for each of the above values. Do your results match your predictions?<\/li>\n<\/ol>\n<\/li>\n<li class=\"Numbered-list-Exercises\">Compute the range, sample variance, and sample standard deviation for the following scores: 25,\u00a036, 41, 28, 29, 32, 39, 37, 34, 34, 37, 35, 30, 36, 31, 31<\/li>\n<li class=\"Numbered-list-Exercises\">Using the same values from Problem 7, calculate the range, sample variance, and sample standard deviation, but this time include 65 in the list of values. How did each of the three values change?<\/li>\n<li class=\"Numbered-list-Exercises\">Two normal distributions have exactly the same mean, but one has a standard deviation of 20 and the other has a standard deviation of 10. How would the shapes of the two distributions compare?<\/li>\n<li class=\"Numbered-list-Exercises\">Compute the sample mean and sample standard deviation for the following scores: \u22128, \u22124, \u22127, \u22126, \u22128, \u22125, \u22127, \u22129, \u22122, 0<\/li>\n<\/ol>\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<h3 class=\"H1\">Answers to Odd-Numbered Exercises<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<ul>\n<li>1) If the mean is higher, that means it is farther out into the right-hand tail of the distribution. Therefore, we know this distribution is positively skewed.<\/li>\n<li>3) The median is defined as the value with 50% of scores above it and 50% of scores below it; therefore, 60% of score cannot fall above the median. If 60% of scores fall above the mean, that would indicate that the mean has been pulled down below the value of the median, which means that the distribution is negatively skewed<\/li>\n<li>5) <img decoding=\"async\" class=\"_idGenObjectAttribute-31\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Eqn2.14-mu-2.png\" alt=\"mu\" \/> = 4.80, <span class=\"Symbol\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> = 2.36<\/li>\n<li>7) Range = 16, <span class=\"italic\">s<\/span><span class=\"superscript _idGenCharOverride-1\">2<\/span> = 18.40, <span class=\"italic\">s<\/span> = 4.29<\/li>\n<li>9) If both distributions are normal, then they are both symmetrical, and having the same mean causes them to overlap with one another. The distribution with the standard deviation of 10 will be narrower than the other distribution.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<div class=\"glossary\"><span class=\"screen-reader-text\" id=\"definition\">definition<\/span><template id=\"term_137_635\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_635\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_634\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_634\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_639\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_639\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_640\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_640\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_646\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_646\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_641\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_641\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_638\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_638\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_645\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_645\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_647\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_647\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_642\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_642\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_636\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_636\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_137_644\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_137_644\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><\/div>","protected":false},"author":7,"menu_order":3,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-137","chapter","type-chapter","status-publish","hentry"],"part":21,"_links":{"self":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/137","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/users\/7"}],"version-history":[{"count":16,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/137\/revisions"}],"predecessor-version":[{"id":968,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/137\/revisions\/968"}],"part":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/parts\/21"}],"metadata":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/137\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/media?parent=137"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapter-type?post=137"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/contributor?post=137"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/license?post=137"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}