
{"id":92,"date":"2021-12-09T22:29:35","date_gmt":"2021-12-09T22:29:35","guid":{"rendered":"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-2\/"},"modified":"2025-09-08T21:05:11","modified_gmt":"2025-09-08T21:05:11","slug":"chapter-2","status":"publish","type":"chapter","link":"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-2\/","title":{"raw":"Chapter 2: Describing Data Using Frequency Distributions and Graphs","rendered":"Chapter 2: Describing Data Using Frequency Distributions and Graphs"},"content":{"raw":"<div class=\"textbox textbox--sidebar textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Key Terms<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor083\"><span class=\"Hyperlink-underscore\">bell curve<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor084\"><span class=\"Hyperlink-underscore\">bimodal distribution<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor066\"><span class=\"Hyperlink-underscore\">box plots<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor047\"><span class=\"Hyperlink-underscore\">categorical variables<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor060\"><span class=\"Hyperlink-underscore\">frequency polygons<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor056\"><span class=\"Hyperlink-underscore\">histogram<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor086\"><span class=\"Hyperlink-underscore\">skew<\/span><\/a><\/p>\r\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor048\"><span class=\"Hyperlink-underscore\">stem-and-leaf display<\/span><\/a><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<p data-start=\"231\" data-end=\"815\">Statistics is more than just numbers\u2014it is a way of telling stories about people, communities, and systems. When used thoughtfully, statistics can uncover patterns of inequality, highlight voices that are often silenced, and guide us toward solutions that promote fairness. For example, frequency tables and graphs do more than summarize data; they can reveal who has access to education, who is disproportionately impacted by the criminal justice system, or how resources are distributed across neighborhoods. In this way, statistics becomes a tool for advocacy, not just analysis.<\/p>\r\n<p data-start=\"817\" data-end=\"1391\">Approaching statistics from a social justice perspective means asking questions about power, representation, and equity. Whose experiences are being measured? Who is left out of the dataset? How might the way we collect, organize, and present data either reinforce stereotypes or challenge them? By connecting statistical methods to real-world issues\u2014such as racial profiling, housing inequality, and disparities in health care\u2014we see how numbers are never neutral. They are deeply tied to human lives, and how we analyze them can influence policy, practice, and progress.<\/p>\r\n\r\n\r\n<hr data-start=\"1393\" data-end=\"1396\" \/>\r\n<p class=\"Text-1st\">Before we can understand our analyses, we must first understand our data. The first step in doing this is using tables, charts, graphs, plots, and other visual tools to see what our data look like. This section examines graphical methods for displaying various results. We\u2019ll learn some general lessons about how to graph data that fall into a number of categories. A later section will consider how to graph numerical data from a frequency distribution.<\/p>\r\n\r\n<h2 class=\"H2\">Frequency Tables<\/h2>\r\n<p class=\"Text-1st\"><span style=\"background-color: #ffffff\">All of the graphical methods shown in this section are derived from frequency tables. <a style=\"background-color: #ffffff\" href=\"#_idTextAnchor038\"><span class=\"Fig-table-number-underscore\">Table 2.1<\/span><\/a> shows a frequency table for the results of a study on community members\u2019 of color experiences with racial profiling; it shows the frequencies of the various response categories. It also shows the relative frequencies, which are the proportion of responses in each category. For example, the relative frequency for \u201cnever experienced racial profiling\u201d of .17 = 85\/500.<\/span><\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<p data-start=\"640\" data-end=\"718\"><strong data-start=\"640\" data-end=\"718\">Table 2.1. Frequency table for reported experiences with racial profiling.<\/strong><\/p>\r\n\r\n<div id=\"_idContainer026\" class=\"_idGenObjectStyleOverride-1\">\r\n<table id=\"table011\" class=\"Foster-table\" style=\"height: 85px\"><colgroup> <col class=\"_idGenTableRowColumn-29\" \/> <col class=\"_idGenTableRowColumn-30\" \/> <col class=\"_idGenTableRowColumn-31\" \/><\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\" style=\"height: 17px\">\r\n<th style=\"height: 17px;width: 159.25px\">Racial Profiling Exper.<\/th>\r\n<th style=\"height: 17px;width: 83.4688px\">\r\n<p class=\"Table-col-hd\">Frequency<\/p>\r\n<\/th>\r\n<th style=\"height: 17px;width: 152.781px\">\r\n<p class=\"Table-col-hd\">Relative Frequency<\/p>\r\n<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<th class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 159.25px\">\r\n<p class=\"Table-body\">Never<\/p>\r\n<\/th>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 83.4688px\">\r\n<p class=\"Table-body\">85<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 152.781px\">\r\n<p class=\"Table-body\">.17<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<th class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 159.25px\">\r\n<p class=\"Table-body\">Occasionally<\/p>\r\n<\/th>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 83.4688px\">\r\n<p class=\"Table-body\">60<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 152.781px\">\r\n<p class=\"Table-body\">.12<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<th class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 159.25px\">\r\n<p class=\"Table-body\">Frequently<\/p>\r\n<\/th>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 83.4688px\">\r\n<p class=\"Table-body\">355<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 152.781px\">\r\n<p class=\"Table-body\">.71<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-8\" style=\"height: 17px\">\r\n<th class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 159.25px\">\r\n<p class=\"Table-body\">Total<\/p>\r\n<\/th>\r\n<td class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 83.4688px\">\r\n<p class=\"Table-body\">500<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 152.781px\">\r\n<p class=\"Table-body\">1.00<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<div>\r\n<h1>Understanding and Creating Frequency Distributions<\/h1>\r\n<\/div>\r\nFrequency distributions are fundamental tools in statistics for organizing and summarizing data. They help researchers transform raw numbers into meaningful patterns and make statistical interpretation easier. This is particularly important in social justice research, where we analyze patterns in data to uncover inequities in education, housing, criminal justice, and other areas. This section will walk through how to create and interpret different types of frequency distributions, using real-world examples that can support data-informed advocacy and awareness.\u00a0 The data is hypothetical but it when using real-word data, the process is the same.\r\n<h2>From Raw Data to Ranked Order<\/h2>\r\nRaw data is the unprocessed list of values as they were collected. For example, let\u2019s look at how many times 27 juvenile offenders were arrested.\r\n\r\nIn raw form, this might be listed as follows: 2, 1, 3, 2, 4, 1, 2, 3, 1, 2, 3, 1, 2, 4, 3, 1, 2, 3, 2, 1, 5, 5, 5, 2, 2, 6, 6.\r\nWhile this shows the data, it\u2019s not easy to analyze. A ranked frequency distribution organizes this data from highest to lowest to help visualize extremes.\r\n<h2>Simple Frequency Distribution<\/h2>\r\nTo make the data easier to interpret, we count how often each number of arrests appears. This is a simple frequency distribution. Start by identifying all unique values (e.g., 1 arrest through 6 arrests).\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Simple Frequency Table: Juvenile Arrests n=27<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Number of Arrests<\/td>\r\n<td>Frequency<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>1<\/td>\r\n<td>6<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>2<\/td>\r\n<td>9<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>3<\/td>\r\n<td>5<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>4<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>5<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>6<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n&nbsp;<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nIn the table above, we see that 9 juveniles were arrested twice, while only 2 were arrested six times. This table gives us a quick sense of how arrest frequencies are distributed among the sample.\r\n<h2>Grouped Frequency Distribution<\/h2>\r\nSometimes we want to simplify further by grouping values into intervals. This is especially useful when we have a wide range of data. In our case, we can group arrests into three intervals: 1\u20132, 3\u20134, and 5\u20136. We then total the number of offenders whose arrest count falls into each group.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Grouped Frequency Table: Juvenile Arrests<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Arrest Interval<\/td>\r\n<td>Frequency<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>1\u20132<\/td>\r\n<td>15<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>3\u20134<\/td>\r\n<td>7<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>5\u20136<\/td>\r\n<td>5<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n&nbsp;<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nThis grouped table summarizes the same data in broader categories. Now, we can say that most juveniles (15) had between 1 and 2 arrests, while only 5 had 5 or more. Grouping helps when data has variability or when we want a quick snapshot of broader patterns.\r\n<h3>Why This Matters in Social Justice Research<\/h3>\r\nUnderstanding how to organize data into frequency distributions is essential for social justice statistics. For instance, frequency tables can be used to show how often different racial or socioeconomic groups experience arrests, access education, or face housing instability. Creating these tables allows advocates and researchers to identify patterns of inequality and communicate them clearly to policymakers or the public.\r\n\r\n&nbsp;\r\n<div>\r\n<h2>Creating a Grouped Frequency Distribution<\/h2>\r\n<\/div>\r\nGrouped frequency distributions help summarize large datasets by organizing scores into intervals, making it easier to identify patterns and trends. In this section, we\u2019ll walk through how to construct a grouped frequency table and explain the components such as apparent limits, real limits, midpoints, relative frequency, cumulative frequency, and cumulative relative frequency. We\u2019ll also show how to convert relative frequencies into percentages for easier interpretation.\r\n<h3>Step 1: Decide on Intervals<\/h3>\r\nStart by determining the range of your dataset, which is the highest value minus the lowest value. Then choose how many intervals you want. Divide the range by the number of intervals to get the width of each class interval. For example, if your data ranges from 0 to 49 and you want 10 intervals, each interval will cover 5 units (e.g., 0\u20134, 5\u20139, \u2026, 45\u201349).\u00a0 Generally, we want to make intervals easy to understand by making them siz 5 or 10, depending on the range of the data.\u00a0\u00a0 So, intervals that go from 1-5, 6-10, 11-15 etc. make it easy to organize and understand your data.\r\n<h3>Step 2: Apparent Limits<\/h3>\r\nApparent limits are the values that define the range of each interval as it appears in a table. For example, the interval 10\u201314 means values from 10 to 14 are included in that group.\r\n<h3>Step 3: Real Limits<\/h3>\r\nReal limits are the boundaries that account for the continuity of data. For interval 10\u201314, the real limits are 9.5\u201314.5, meaning it includes any value from 9.5 up to but not including 14.5.\u00a0 Real limits are defined as .5 below the lowest apparent limit and .5 above the highest apparent limit in each category.\r\n<h3>Step 4: Midpoints<\/h3>\r\nThe midpoint of each interval is the average of the lower and upper apparent limits. For example, the midpoint of 10\u201314 is (10 + 14) \/ 2 = 12.\r\n<h3>Step 5: Frequency and Relative Frequency<\/h3>\r\nFrequency (f) is the count of values that fall within each interval. Relative frequency is calculated by dividing each frequency by the total number of data points (n). This gives a proportion of the total for each interval.\r\n<h3>Step 6: Cumulative Frequency and Cumulative Relative Frequency<\/h3>\r\nCumulative frequency (CF) is the total number of values that fall below the upper real limit of each interval. Cumulative relative frequency (CRF) is the cumulative frequency divided by the total number of scores. This tells us the proportion of data below a given point.\r\n<h3>Step 7: Converting Relative Frequencies to Percents<\/h3>\r\nTo convert a relative frequency to a percentage, multiply the value by 100. For example, a relative frequency of 0.125 becomes 12.5%.\u00a0 For example, in the table below, the relative frequency for those who missed 0-4 school abscences is .083.\u00a0 To change that into a percent, it becomes 8.3%.\u00a0\u00a0 Relative frequency columns add up to 1.0 and when converted to percents, adds up to 100%.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Grouped Frequency Table Example: School Absences<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Apparent Limits<\/td>\r\n<td>Real Limits<\/td>\r\n<td>Midpoint<\/td>\r\n<td>F<\/td>\r\n<td>Rel f<\/td>\r\n<td>Cum f<\/td>\r\n<td>Cum Rel f<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>0\u20134<\/td>\r\n<td>\u22120.5\u20134.5<\/td>\r\n<td>2<\/td>\r\n<td>4<\/td>\r\n<td>.083<\/td>\r\n<td>4<\/td>\r\n<td>.083<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>5\u20139<\/td>\r\n<td>4.5\u20139.5<\/td>\r\n<td>7<\/td>\r\n<td>8<\/td>\r\n<td>.167<\/td>\r\n<td>12<\/td>\r\n<td>.250<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>10\u201314<\/td>\r\n<td>9.5\u201314.5<\/td>\r\n<td>12<\/td>\r\n<td>3<\/td>\r\n<td>.063<\/td>\r\n<td>15<\/td>\r\n<td>.313<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>15\u201319<\/td>\r\n<td>14.5\u201319.5<\/td>\r\n<td>17<\/td>\r\n<td>3<\/td>\r\n<td>.063<\/td>\r\n<td>18<\/td>\r\n<td>.376<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>20\u201324<\/td>\r\n<td>19.5\u201324.5<\/td>\r\n<td>22<\/td>\r\n<td>6<\/td>\r\n<td>.125<\/td>\r\n<td>24<\/td>\r\n<td>.501<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>25\u201329<\/td>\r\n<td>24.5\u201329.5<\/td>\r\n<td>27<\/td>\r\n<td>4<\/td>\r\n<td>.083<\/td>\r\n<td>28<\/td>\r\n<td>.584<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>30\u201334<\/td>\r\n<td>29.5\u201334.5<\/td>\r\n<td>32<\/td>\r\n<td>6<\/td>\r\n<td>.125<\/td>\r\n<td>34<\/td>\r\n<td>.709<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>35\u201339<\/td>\r\n<td>34.5\u201339.5<\/td>\r\n<td>37<\/td>\r\n<td>3<\/td>\r\n<td>.063<\/td>\r\n<td>37<\/td>\r\n<td>.772<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>40\u201344<\/td>\r\n<td>39.5\u201344.5<\/td>\r\n<td>42<\/td>\r\n<td>4<\/td>\r\n<td>.083<\/td>\r\n<td>41<\/td>\r\n<td>.855<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>45\u201349<\/td>\r\n<td>44.5\u201349.5<\/td>\r\n<td>47<\/td>\r\n<td>7<\/td>\r\n<td>.146<\/td>\r\n<td>48<\/td>\r\n<td>1.000<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<h4 class=\"H2\">Pie Charts<\/h4>\r\n<p class=\"Text-1st\">The pie chart in <a href=\"#_idTextAnchor039\"><span class=\"Fig-table-number-underscore\">Figure 2.1<\/span><\/a> shows the results of the amount of racial profiling experienced. In a pie chart, each category is represented by a slice of the pie. The area of the slice is proportional to the percentage of responses in the category. This is simply the relative frequency multiplied by 100<span style=\"background-color: #ffffff\">. <span class=\"Fig-table-number\" style=\"text-align: initial;font-size: 1.125rem;background-color: #ffffff\"><a id=\"_idTextAnchor039\" style=\"background-color: #ffffff\"><\/a>Figure 2.1.<\/span><span style=\"text-align: initial;font-size: 1.125rem;background-color: #ffffff\"> Pie chart of racial profiling experienced illustrating frequencies of previous racial profiling: 71% of participants reported frequently being racially profiled.<\/span><\/span><\/p>\r\n\r\n<div class=\"_idGenObjectStyleOverride-1\">\r\n\r\n<img class=\"alignnone wp-image-865\" src=\"http:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-300x300.png\" alt=\"Pie chart reflecting frequencies of racial profiling (never: 17%, Occasionally: 12%, and frequently: 71) \" width=\"397\" height=\"397\" \/>\r\n\r\n<\/div>\r\n<p class=\"Text\">Pie charts are effective for displaying the relative frequencies of a small number of categories. They are not recommended, however, when you have a large number of categories. Pie charts can also be confusing when they are used to compare the outcomes of two different surveys or experiments. In an influential book on the use of graphs, Edward Tufte asserted, \u201cThe only worse design than a pie chart is several of them.\u201d\u00b9<span class=\"superscript CharOverride-24\">\r\n<\/span><\/p>\r\n\r\n<div class=\"textbox textbox--sidebar\"><span class=\"superscript CharOverride-24\">\u00b9<\/span> <span class=\"CharOverride-18\">Tufte, E. R. (1983). <\/span><span class=\"italic CharOverride-18\">The visual display of quantitative information<\/span><span class=\"CharOverride-18\"> (p. 178). Graphics Press.<\/span><\/div>\r\n<p class=\"Text\">Here is another important point about pie charts. If they are based on a small number of observations, it can be misleading to label the pie slices with percentages. <span style=\"background-color: #ffffff\">For example, if just 5 people had been interviewed about the amount of racial profiling experienced being never, and 3 participants reported frequently, it would be misleading to display a pie chart slice showing .60. With so few people interviewed, such a large percentage of racially profiled users might easily have occurred since chance can cause large errors with small samples. In this case, it is better to alert the user of the pie chart to the actual numbers involved. The slices should therefore be labeled with the actual frequencies observed (e.g., 3) instead of with percentages.<\/span><\/p>\r\n\r\n<h4 class=\"H2\">Bar Charts<\/h4>\r\n<p class=\"Text-1st\">Bar charts can also be used to represent frequencies of different categories. A bar chart of the amount of racial profiling experienced shown in <a href=\"#_idTextAnchor040\"><span class=\"Fig-table-number-underscore\">Figure 2.2<\/span><\/a>. Participants experience (never, occasionally, frequently) is shown on the <span class=\"italic\">x<\/span>-axis and the frequencies (Number of Respondents) are shown on the <span class=\"italic\">y<\/span>-axis.\u00a0Typically, the <span class=\"italic\">y<\/span>-axis shows the number of observations in each category rather than the percentage of observations in each category as is typical in pie charts.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer029\" class=\"Basic-Text-Frame\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\">\u00a0<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer030\" class=\"_idGenObjectStyleOverride-1\"><img class=\"alignnone wp-image-864 \" src=\"http:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-300x225.png\" alt=\"Bar chart reflecting frequencies of racial profiling (never: 85, Occasionally: 60, and frequently: 355) \" width=\"507\" height=\"380\" \/><\/div>\r\n<\/div>\r\n<h4 class=\"H2\">Comparing Distributions<\/h4>\r\n<p class=\"Text-1st\">Often we need to compare the results of different surveys, or of different conditions within the same overall survey. In this case, we are comparing the \u201cdistributions\u201d of responses between the surveys or conditions. Bar charts are often excellent for illustrating differences between two distributions. <a href=\"#_idTextAnchor041\"><span class=\"Fig-table-number-underscore\">Figure 2.3<\/span><\/a> A community organization surveyed 500 individuals to examine disparities in access to mental health services based on household income. Respondents were asked whether they had <em data-start=\"452\" data-end=\"469\">adequate access<\/em> or <em data-start=\"473\" data-end=\"492\">inadequate access<\/em> to mental health services. The results were categorized by income level.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer031\" class=\"Basic-Text-Frame\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor041\"><\/a>Figure 2.3.<\/span> A bar chart of the number of people's access to health serviced based on Income level<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer032\" class=\"_idGenObjectStyleOverride-1\"><img class=\"wp-image-863 alignnone\" src=\"http:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-300x225.png\" alt=\"\" width=\"519\" height=\"389\" \/><\/div>\r\n<\/div>\r\n<h4 class=\"H2\">Some Graphical Mistakes to Avoid<\/h4>\r\n<p class=\"Text-1st\">Don\u2019t get fancy! People sometimes add features to graphs that don\u2019t help to convey their information. For example, three-dimensional bar charts such as the one shown in <a href=\"#_idTextAnchor042\"><span class=\"Fig-table-number-underscore\">Figure 2.4<\/span><\/a> are usually not as effective as their two-dimensional counterparts.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer033\" class=\"Basic-Text-Frame\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor042\"><\/a>Figure 2.4<\/span>. Charts like this are less effective. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/4\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Mac Bar Chart 3D<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licenced under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer034\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Mac_Bar_Chart_3D-3.png\" alt=\"A less-effective version of Figure 2.2, showing a three-deminstional bar chart. In this version, it is difficult to determine the value represented by each bar.\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Here is another way that fanciness can lead to trouble. Instead of plain bars, it is tempting to substitute meaningful images. For example, <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a> presents the iMac data using pictures of computers. The heights of the pictures accurately represent the number of buyers, yet <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a> is misleading because the viewer\u2019s attention will be captured by areas. The areas can exaggerate the size differences between the groups. In terms of percentages, the ratio of previous Macintosh owners to previous Windows owners is about 6 to 1. But the ratio of the two areas in <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a> is about 35 to 1. A biased person wishing to hide the fact that many Windows owners purchased iMacs would be tempted to use <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a>.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer035\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor043\"><\/a>Figure 2.5.<\/span> . <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/5\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Mac Bar Chart Lie Factor<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">. \u201c<\/span><a href=\"https:\/\/www.flickr.com\/photos\/albaco\/14852028844\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Apple iMac G3 (1998)<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by albaco\/Flickr is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/2.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 2.0<\/span><\/span><\/a><span class=\"Fig-source\">; image was brightened and background was removed.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer036\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Mac_Bar_Chart_Lie_Factor-3.png\" alt=\"A less-effective version of Figure 2.2, showing a bar chart in which the bars are replaced by images of iMacs scaled so that their heights reach the desired values. In this version, the image representing previous Macintosh owners is far larger than the other two populations, which may bias the viewer against those populations.\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Edward Tufte coined the term [pb_glossary id=\"630\"]<a id=\"_idTextAnchor044\"><\/a>[\/pb_glossary]<span class=\"key-term\">lie factor<\/span> to refer to the ratio of the size of the effect shown in a graph to the size of the effect shown in the data. He suggests that lie factors greater than 1.05 or less than 0.95 produce unacceptable distortion.<\/p>\r\n<p class=\"Text\">Another distortion in bar charts results from setting the baseline to a value other than zero. The baseline is the bottom of the <span class=\"italic\">y<\/span>-axis, representing the least number of cases that could have occurred in a category. Normally, but not always, this number should be zero. <a href=\"#_idTextAnchor045\"><span class=\"Fig-table-number-underscore\">Figure 2.6<\/span><\/a> shows the iMac data with a baseline of 50. Once again, the differences in areas suggests a different story than the true differences in percentages. The number of Windows-switchers seems minuscule compared to its true value of 12%.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer037\" class=\"Basic-Text-Frame\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor045\"><\/a>Figure 2.6.<\/span> A redrawing of <a href=\"#_idTextAnchor040\"><span class=\"Fig-table-number-underscore\">Figure 2.2<\/span><\/a> with a baseline of 50. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/6\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Mac Bar Chart Baseline 50<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer038\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Mac_Bar_Chart_Baseline_50-3.png\" alt=\"A less-effective version of Figure 2.2, showing a bar chart in which the y-axis begins at 50 instead of 0. In this version, the bar heights tell a story that is skewed against the smallest group, making the viewer think there were far fewer iMac buyers who previously owned a Windows computer than there actually were.\" \/><\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer040\" class=\"_idGenObjectStyleOverride-1\"><\/div>\r\n<\/div>\r\n<h4 class=\"H2\">Summary<\/h4>\r\n<p class=\"Text-1st\">Pie charts and bar charts can both be effective methods of portraying data. Bar charts are better when there are more than just a few categories and for comparing two or more distributions. Be careful to avoid creating misleading graphs.<\/p>\r\n\r\n<h3 class=\"H1\">Graphing Quantitative Variables<\/h3>\r\n<p class=\"Text-1st\">As discussed in the section on variables in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-1\/\"><span class=\"Hyperlink-underscore\">Chapter 1<\/span><\/a>, quantitative variables are variables measured on a numeric scale. Height, weight, response time, subjective rating of pain, temperature, and score on an exam are all examples of quantitative variables. Quantitative variables are distinguished from qualitative variables (sometimes called [pb_glossary id=\"627\"]<a id=\"_idTextAnchor047\"><\/a>[\/pb_glossary]<span class=\"key-term\">categorical variables<\/span> or nominal variables), such as favorite color, religion, city of birth, and favorite sport, in which there is no ordering or measuring involved.<\/p>\r\n<p class=\"Text\">There are many types of graphs that can be used to portray distributions of quantitative variables. The upcoming sections cover the following types of graphs: (1) stem-and-leaf displays, (2)\u00a0histograms, (3) frequency polygons, (4) box plots, (5) bar charts, (6) line graphs, (7) dot plots, and (8) scatter plots (discussed in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-12\/\"><span class=\"Hyperlink-underscore\">Chapter 12<\/span><\/a>). Some graph types, such as stem-and-leaf displays, are best-suited for small to moderate amounts of data, whereas others, such as histograms, are best-suited for large amounts of data. Graph types such as box plots are good at depicting differences between distributions. Scatter plots are used to show the relationship between two variables.<\/p>\r\n\r\n<h4 class=\"H2\">Stem-and-Leaf Displays<\/h4>\r\n<p class=\"Text-1st\">A [pb_glossary id=\"632\"]<a id=\"_idTextAnchor048\"><\/a>[\/pb_glossary]<span class=\"key-term\">stem-and-leaf display<\/span> is a graphical method of displaying data. It is particularly useful when your data are not too numerous. In this section, we will explain how to construct and interpret this kind of graph.<\/p>\r\n<p class=\"Text\">As usual, we will start with an example. Consider <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>, which shows the number of touchdown passes (TD passes) thrown by each of the 31 teams in the National Football League during the 2000 season.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer041\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor049\"><\/a>Figure 2.8.<\/span> Number of touchdown passes. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/8\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Touchdown Passes Raw Data<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer042\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Touchdown_Passes_Raw_Data-3.png\" alt=\"A list of raw values representing the number of touchdown passes by each of the 31 teams in the NFL during the 2000 season. The values, arranged in descending order, begin with 37, 33, 33, and 32, and end with 12, 12, 9, and 6.\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">A stem-and-leaf display of the data is shown in <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a>. The left portion of <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a> contains the stems. They are the numbers 3, 2, 1, and 0, arranged as a column to the left of the bars. Think of these numbers as 10s digits. A stem of 3, for example, can be used to represent the 10s digit in any of the numbers from 30 to 39. The numbers to the right of the bar are leaves, and they represent the 1s digits. Every leaf in the graph therefore stands for the result of adding the leaf to 10 times its stem.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer043\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor050\"><\/a>Figure 2.9.<\/span> Stem-and-leaf display of the number of touchdown passes. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/9\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Touchdown Passes Stem and Leaf<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer044\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Touchdown_Passes_Stem_and_Leaf-3.png\" alt=\"A stem and leaf display showing the number of touchdown passes by each of the 31 teams. The first row has a stem of 3 and leaves of 2, 3, 3, and 7; that row represents the numbers 32, 33, 33, and 37.\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">To make this clear, let us examine <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a> more closely. In the top row, the four leaves to the right of stem 3 are 2, 3, 3, and 7. Combined with the stem, these leaves represent the numbers 32, 33, 33, and 37, which are the numbers of TD passes for the first four teams in <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>. The next row has a stem of 2 and 12 leaves. Together, they represent 12 data points, namely, two occurrences of 20\u00a0TD passes, three occurrences of 21 TD passes, three occurrences of 22 TD passes, one occurrence of 23\u00a0TD passes, two occurrences of 28 TD passes, and one occurrence of 29 TD passes. We leave it to you to figure out what the third row represents. The fourth row has a stem of 0 and two leaves. It stands for the last two entries in <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>, namely 9 TD passes and 6 TD passes. (The latter two numbers may be thought of as 09 and 06.)<\/p>\r\n<p class=\"Text\">One purpose of a stem-and-leaf display is to clarify the shape of the distribution. You can see many facts about TD passes more easily in <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a> than in <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>. For example, by looking at the stems and the shape of the plot, you can tell that most of the teams had between 10 and 29 passing TDs, with a few having more and a few having less. The precise numbers of TD passes can be determined by examining the leaves.<\/p>\r\n\r\n<h4 class=\"H2\">Histograms<\/h4>\r\n<p class=\"Text-1st\">A [pb_glossary id=\"629\"]<a id=\"_idTextAnchor056\"><\/a>[\/pb_glossary]<span class=\"key-term\">histogram<\/span> is a graphical method for displaying the shape of a distribution. It is particularly useful when there are a large number of observations. We begin with an example consisting of the scores of 642 students on a psychology test. The test consists of 197 items, each graded as \u201ccorrect\u201d or \u201cincorrect.\u201d The students\u2019 scores ranged from 46 to 167.<\/p>\r\n<p class=\"Text\">The first step is to create a frequency table. Unfortunately, a simple frequency table would be too big, containing over 100 rows. To simplify the table, we group scores together as shown in <a href=\"#_idTextAnchor057\"><span class=\"Fig-table-number-underscore\">Table 2.2<\/span><\/a>.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer070\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor057\"><\/a>Table 2.2.<\/span> Grouped frequency distribution of psychology test scores.<\/p>\r\n\r\n<table id=\"table012\" class=\"Foster-table\" style=\"height: 238px\"><colgroup> <col class=\"_idGenTableRowColumn-32\" \/> <col class=\"_idGenTableRowColumn-32\" \/> <col class=\"_idGenTableRowColumn-1\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-7\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-col-hd\">Interval\u2019s Lower Limit<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-7\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-col-hd\">Interval\u2019s Upper Limit<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-col-hd\">Class Frequency<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-1\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">39.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-1\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">49.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">3<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">49.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">59.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">10<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">59.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">69.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">53<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">69.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">79.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">107<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">79.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">89.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">147<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">89.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">99.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">130<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">99.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">109.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">78<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">109.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">119.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">59<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">119.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">129.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">36<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">129.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">139.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">11<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">139.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">149.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">6<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">149.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">159.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-11\" style=\"height: 17px\">\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-7\" style=\"height: 17px;width: 147.562px\">\r\n<p class=\"Table-body\">159.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-7\" style=\"height: 17px;width: 145.766px\">\r\n<p class=\"Table-body\">169.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 107.531px\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<p class=\"Text\">To create this table, the range of scores was broken into intervals, called class intervals. The first interval is from 39.5 to 49.5, the second from 49.5 to 59.5, etc. Next, the number of scores falling into each interval was counted to obtain the class frequencies. There are 3 scores in the first interval, 10 in the second, etc.<\/p>\r\n<p class=\"Text\">Class intervals of width 10 provide enough detail about the distribution to be revealing without making the graph too \u201cchoppy.\u201d More information on choosing the widths of class intervals is presented later in this section. Placing the limits of the class intervals midway between two numbers (e.g., 49.5) ensures that every score will fall in an interval rather than on the boundary between intervals.<\/p>\r\n<p class=\"Text\">In a histogram, the class frequencies are represented by bars. The height of each bar corresponds to its class frequency. A histogram of these data is shown in <a href=\"#_idTextAnchor058\"><span class=\"Fig-table-number-underscore\">Figure 2.15<\/span><\/a>.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer071\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor058\"><\/a>Figure 2.15.<\/span> Histogram of scores on a psychology test. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/15\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Histogram<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer072\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Histogram-3.png\" alt=\"A histogram of scores on a psychology test, with most scores in the center of the distribution and a positive skew.\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">The histogram makes it plain that most of the scores are in the middle of the distribution, with fewer scores in the extremes. You can also see that the distribution is not symmetric: the scores extend farther to the right than they do to the left. The distribution is therefore said to be skewed. (We\u2019ll have more to say about shapes of distributions in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a>.)<\/p>\r\n<p class=\"Text\">In our example, the observations are whole numbers. Histograms can also be used when the scores are measured on a more continuous scale such as the length of time (in milliseconds) required to perform a task. In this case, there is no need to worry about fence sitters since they are improbable. (It would be quite a coincidence for a task to require exactly 7 seconds, measured to the nearest thousandth of a second.) We are therefore free to choose whole numbers as boundaries for our class intervals, for example, 4000, 5000, etc. The class frequency is then the number of observations that are greater than or equal to the lower bound, and strictly less than the upper bound. For example, one interval might hold times from 4000 to 4999 milliseconds. Using whole numbers as boundaries avoids a cluttered appearance, and is the practice of many computer programs that create histograms. Note also that some computer programs label the middle of each interval rather than the end points.<\/p>\r\n<p class=\"Text\">Histograms can be based on relative frequencies instead of actual frequencies. Histograms based on relative frequencies show the proportion of scores in each interval rather than the number of scores. In this case, the <span class=\"italic\">y<\/span>-axis runs from 0 to 1 (or somewhere in between if there are no extreme proportions). You can change a histogram based on frequencies to one based on relative frequencies by (a) dividing each class frequency by the total number of observations, and then (b) plotting the quotients on the <span class=\"italic\">y<\/span>-axis (labeled as proportion).<\/p>\r\n<p class=\"Text\">There is more to be said about the widths of the class intervals, sometimes called [pb_glossary id=\"625\"]<a id=\"_idTextAnchor059\"><\/a>[\/pb_glossary]<span class=\"key-term\">bin widths<\/span>. Your choice of bin width determines the number of class intervals. This decision, along with the choice of starting point for the first interval, affects the shape of the histogram. The best advice is to experiment with different choices of width, and to choose a histogram according to how well it communicates the shape of the distribution.<\/p>\r\n\r\n<h4 class=\"H2\">Frequency Polygons<\/h4>\r\n<p class=\"Text-1st\"><span class=\"key-term\">[pb_glossary id=\"628\"]<a id=\"_idTextAnchor060\"><\/a>[\/pb_glossary]Frequency polygons<\/span> are a graphical device for understanding the shapes of distributions. They serve the same purpose as histograms, but are especially helpful for comparing sets of data. Frequency polygons are also a good choice for displaying cumulative frequency distributions.<\/p>\r\n<p class=\"Text\">To create a frequency polygon, start just as for histograms, by choosing a class interval. Then draw an <span class=\"italic\">x<\/span>-axis representing the values of the scores in your data. Mark the middle of each class interval with a tick mark, and label it with the middle value represented by the class. Draw the <span class=\"italic\">y<\/span>-axis to indicate the frequency of each class. Place a point in the middle of each class interval at the height corresponding to its frequency. Finally, connect the points. You should include one class interval below the lowest value in your data and one above the highest value. The graph will then touch the <span class=\"italic\">x<\/span>-axis on both sides.<\/p>\r\n<p class=\"Text\">The frequency distribution of 642 psychology test scores, shown in <a href=\"#_idTextAnchor061\"><span class=\"Fig-table-number-underscore\">Table 2.3<\/span><\/a>, was used to create the frequency polygon shown in <a href=\"#_idTextAnchor062\"><span class=\"Fig-table-number-underscore\">Figure 2.16<\/span><\/a>.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer073\" class=\"_idGenObjectStyleOverride-1\">\r\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor061\"><\/a>Table 2.3.<\/span> Frequency distribution of psychology test scores.<\/p>\r\n\r\n<table id=\"table013\" class=\"Foster-table\"><colgroup> <col class=\"_idGenTableRowColumn-33\" \/> <col class=\"_idGenTableRowColumn-33\" \/> <col class=\"_idGenTableRowColumn-34\" \/> <col class=\"_idGenTableRowColumn-35\" \/> <\/colgroup>\r\n<thead>\r\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\r\n<td class=\"Foster-table Table-col-hd CellOverride-8\">\r\n<p class=\"Table-col-hd\">Lower Limit<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-8\">\r\n<p class=\"Table-col-hd\">Upper Limit<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd CellOverride-8\">\r\n<p class=\"Table-col-hd\">Count<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-col-hd\">\r\n<p class=\"Table-col-hd\">Cumulative Count<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">29.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">39.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-1\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">39.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">49.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">3<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">3<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">49.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">59.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">10<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">13<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">59.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">69.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">53<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">66<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">69.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">79.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">107<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">173<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">79.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">89.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">147<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">320<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">89.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">99.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">130<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">450<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">99.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">109.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">78<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">528<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">109.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">119.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">59<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">587<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">119.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">129.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">36<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">623<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">129.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">139.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">11<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">634<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">139.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">149.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">6<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">640<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">149.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">159.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">641<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">159.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">169.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\r\n<p class=\"Table-body\">1<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\r\n<p class=\"Table-body\">642<\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr class=\"Foster-table _idGenTableRowColumn-11\">\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-8\">\r\n<p class=\"Table-body\">169.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-8\">\r\n<p class=\"Table-body\">170.5<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body CellOverride-8\">\r\n<p class=\"Table-body\">0<\/p>\r\n<\/td>\r\n<td class=\"Foster-table Table-body-last Table-body\">\r\n<p class=\"Table-body\">642<\/p>\r\n<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<p class=\"Text\">The first label on the <span class=\"italic\">x<\/span>-axis is 35. This represents an interval extending from 29.5 to 39.5. Since the lowest test score is 46, this interval has a frequency of 0. The point labeled 45 represents the interval from 39.5 to 49.5. There are three scores in this interval. There are 147 scores in the interval that surrounds 85.<\/p>\r\n<p class=\"Text\">You can easily discern the shape of the distribution from <a href=\"#_idTextAnchor062\"><span class=\"Fig-table-number-underscore\">Figure 2.16<\/span><\/a>. Most of the scores are between 65 and 115. It is clear that the distribution is not symmetric inasmuch as good scores (to the right) trail off more gradually than poor scores (to the left). In the terminology of <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a> (where we will study shapes of distributions more systematically), the distribution is skewed.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer074\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor062\"><\/a>Figure 2.16.<\/span> Frequency polygon for the psychology test scores. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/17\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Frequency Polygon<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer075\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Frequency_Polygon-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">A cumulative frequency polygon for the same test scores is shown in <a href=\"#_idTextAnchor063\"><span class=\"Fig-table-number-underscore\">Figure 2.17<\/span><\/a>. The graph is the same as before except that the <span class=\"italic\">y<\/span> value for each point is the number of students in the corresponding class interval plus all numbers in lower intervals. For example, there are no scores in the interval labeled \u201c35,\u201d three in the interval \u201c45,\u201d and 10 in the interval \u201c55.\u201d Therefore, the <span class=\"italic\">y<\/span> value corresponding to \u201c55\u201d is 13. Since 642 students took the test, the cumulative frequency for the last interval is 642.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer076\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor063\"><\/a>Figure 2.17.<\/span> Cumulative frequency polygon for the psychology test scores. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/16\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Cumulative Frequency Polygon<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer077\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Cumulative_Frequency_Polygon-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Frequency polygons are useful for comparing distributions. This is achieved by overlaying the frequency polygons drawn for different datasets. <a href=\"#_idTextAnchor064\"><span class=\"Fig-table-number-underscore\">Figure 2.18<\/span><\/a> provides an example. The data come from a task in which the goal is to move a computer cursor to a target on the screen as fast as possible. On 20 of the trials, the target was a small rectangle; on the other 20, the target was a large rectangle. Time to reach the target was recorded on each trial. The two distributions (one for each target) are plotted together in <a href=\"#_idTextAnchor064\"><span class=\"Fig-table-number-underscore\">Figure 2.18<\/span><\/a>. The figure shows that, although there is some overlap in times, it generally took longer to move the cursor to the small target than to the large one.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer078\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor064\"><\/a>Figure 2.18.<\/span> Overlaid frequency polygons for the cursor task. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/18\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Cursor Task Frequency Polygons<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer079\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Cursor_Task_Frequency_Polygons-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">It is also possible to plot two cumulative frequency distributions in the same graph. This is illustrated in <a href=\"#_idTextAnchor065\"><span class=\"Fig-table-number-underscore\">Figure 2.19<\/span><\/a> using the same data from the cursor task. The difference in distributions for the two targets is again evident.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer080\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor065\"><\/a>Figure 2.19.<\/span> Overlaid cumulative frequency polygons for the cursor task. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/19\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Cursor Task Cumulative Frequency Polygons<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer081\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Cursor_Task_Cumulative_Frequency_Polygons-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<h4 class=\"H2\"><a id=\"_idTextAnchor075\"><\/a>Bar Charts<\/h4>\r\n<p class=\"Text-1st\">In the <a href=\"#_idTextAnchor037\"><span class=\"Hyperlink-underscore\">section on qualitative variables<\/span><\/a>, we saw how bar charts could be used to illustrate the frequencies of different categories. For example, as we saw earlier in this chapter, the bar chart shown in <a href=\"#_idTextAnchor040\"><span class=\"Fig-table-number-underscore\">Figure 2.2<\/span><\/a> shows how many purchasers of iMac computers were previous Macintosh users, previous Windows users, and new computer purchasers.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer099\" class=\"_idGenObjectStyleOverride-1\"><\/div>\r\n<\/div>\r\n<p class=\"Text\">Bar charts are particularly effective for showing change over time. <a href=\"#_idTextAnchor077\"><span class=\"Fig-table-number-underscore\">Figure 2.27<\/span><\/a>, for example, shows the percent increase in the Consumer Price Index (CPI) over four three-month periods. The fluctuation in inflation is apparent in the graph.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer100\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor077\"><\/a>Figure 2.27.<\/span> Percent change in the CPI over time. Each bar represents percent increase for the three months ending at the date indicated. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/27\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Percent Change in CPI<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer101\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Percent_Change_in_CPI-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Bar charts are often used to compare the means of different experimental conditions. <a href=\"#_idTextAnchor078\"><span class=\"Fig-table-number-underscore\">Figure 2.28<\/span><\/a> shows the mean time it took one person to move the cursor to either a small target or a large target. On average, more time was required for small targets than for large ones.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer102\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor078\"><\/a>Figure 2.28.<\/span> Bar chart showing the means for the two conditions. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/28\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Means of Two Conditions<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer103\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Means_of_Two_Conditions-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer104\" class=\"Legend-w-space-after\"><\/div>\r\n<\/div>\r\n<h4 class=\"H2\">Line Graphs<\/h4>\r\n<p class=\"Text-1st\">A line graph is a bar graph with the tops of the bars represented by points joined by lines (the rest of the bar is suppressed). For example, <a href=\"#_idTextAnchor077\"><span class=\"Fig-table-number-underscore\">Figure 2.27<\/span><\/a>, which was presented in the section on bar charts, shows changes in the Consumer Price Index (CPI) over time. A line graph of these same data is shown in <a href=\"#_idTextAnchor080\"><span class=\"Fig-table-number-underscore\">Figure 2.30<\/span><\/a>. Although the figures are similar, the line graph emphasizes the change from period to period.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer106\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor080\"><\/a>Figure 2.30.<\/span> A line graph of the percent change in the CPI over time. Each point represents percent increase for the three months ending at the date indicated. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/30\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Percent Change in CPI Line Graph<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer107\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Percent_Change_in_CPI_Line_Graph-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Line graphs are appropriate only when both the <span class=\"italic\">x<\/span>- and <span class=\"italic\">y<\/span>-axes display ordered (rather than qualitative) variables. Although bar charts can also be used in this situation, line graphs are generally better at comparing changes over time. <a href=\"#_idTextAnchor081\"><span class=\"Fig-table-number-underscore\">Figure 2.31<\/span><\/a>, for example, shows percent increases and decreases in five components of the CPI. The figure makes it easy to see that medical costs had a steadier progression than the other components. Although you could create an analogous bar chart, its interpretation would not be as easy.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer108\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor081\"><\/a>Figure 2.31.<\/span> A line graph of the percent change in five components of the CPI over time. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/31\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Percent Change in CPI x5 Line Graph<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer109\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Percent_Change_in_CPI_x5_Line_Graph-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<h4 class=\"H2\">The Shape of Distribution (SKEWED DISTRIBUTIONS)<\/h4>\r\n<p class=\"Text-1st\">Finally, it is useful to present discussion on how we describe the shapes of distributions, which we will revisit in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a> to learn how different shapes affect our numerical descriptors of data and distributions.<\/p>\r\n<p class=\"Text\">The primary characteristic we are concerned about when assessing the shape of a distribution is whether the distribution is symmetrical or skewed. A symmetrical distribution, as the name suggests, can be cut down the center to form two mirror images. Although in practice we will never get a perfectly symmetrical distribution, we would like our data to be as close to symmetrical as possible for reasons we delve into in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a>. Many types of distributions are symmetrical, but by far the most common and pertinent distribution at this point is the normal distribution, shown in <a href=\"#_idTextAnchor083\"><span class=\"Fig-table-number-underscore\">Figure 2.32<\/span><\/a>. Notice that although the symmetry is not perfect (for instance, the bar just to the right of the center is taller than the one just to the left), the two sides are roughly the same shape. The normal distribution has a single peak, known as the center, and two tails that extend out equally, forming what is known as a bell shape or, as we will soon note, a normal curve.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer110\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor083\"><\/a>Figure 2.32.<\/span> A symmetrical distribution. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/32\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Symmetrical Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer111\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Symmetrical_Distribution-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Symmetrical distributions can also have multiple peaks. <a href=\"#_idTextAnchor085\"><span class=\"Fig-table-number-underscore\">Figure 2.33<\/span><\/a> shows a [pb_glossary id=\"624\"]<a id=\"_idTextAnchor084\"><\/a>[\/pb_glossary]<span class=\"key-term\">bimodal distribution<\/span>, named for the two peaks that lie roughly symmetrically on either side of the center point. As we will see in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a>, this is not a particularly desirable characteristic of our data, and, worse, this is a relatively difficult characteristic to detect numerically. Thus, it is important to visualize your data before moving ahead with any formal analyses.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer112\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor085\"><\/a>Figure 2.33.<\/span> A bimodal distribution. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/33\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Bimodal Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer113\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Bimodal_Distribution-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<p class=\"Text\">Distributions that are not symmetrical also come in many forms, more than can be described here. The most common asymmetry to be encountered is referred to as [pb_glossary id=\"631\"]<a id=\"_idTextAnchor086\"><\/a>[\/pb_glossary]<span class=\"key-term\">skew<\/span>, in which one of the two tails of the distribution is disproportionately longer than the other. This property can affect the value of the averages we use in our analyses and make them an inaccurate representation of our data, which causes many problems.<\/p>\r\n<p class=\"Text\">Skew can either be positive or negative (also known as right or left, respectively), based on which tail is longer. It is very easy to get the two confused at first; many students want to describe the skew by where the bulk of the data (larger portion of the histogram, known as the body) is placed, but the correct determination is based on which tail is longer. You can think of the tail as an arrow; whichever direction the arrow is pointing is the direction of the skew. <a href=\"#_idTextAnchor087\"><span class=\"Fig-table-number-underscore\">Figure 2.34<\/span><\/a> shows positive (right) and negative (left) skew, respectively.<\/p>\r\n\r\n<div class=\"_idGenObjectLayout-2\">\r\n<div id=\"_idContainer114\" class=\"Legend-w-space-after\">\r\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor087\"><\/a>Figure 2.34.<\/span> Positively skewed (A) and negatively skewed (B) distributions. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/34\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Skewed Distributions<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"_idGenObjectLayout-1\">\r\n<div id=\"_idContainer115\" class=\"_idGenObjectStyleOverride-1\"><img class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Skewed_Distributions-3.png\" alt=\"\" \/><\/div>\r\n<\/div>\r\n<h3 class=\"H1\"><\/h3>\r\n\"<a href=\"https:\/\/xkcd.com\/833\">Convincing<\/a>\" by Randall Munroe\/xkcd.com is licensed under <a href=\"https:\/\/creativecommons.org\/licenses\/by-nc\/2.5\/\">CC BY-NC 2.5<\/a>.)\r\n\r\n<a href=\"https:\/\/xkcd.com\/833\/\"><img src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/convincing-3.png\" alt=\"\" \/><\/a>","rendered":"<div class=\"textbox textbox--sidebar textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Key Terms<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor083\"><span class=\"Hyperlink-underscore\">bell curve<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor084\"><span class=\"Hyperlink-underscore\">bimodal distribution<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor066\"><span class=\"Hyperlink-underscore\">box plots<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor047\"><span class=\"Hyperlink-underscore\">categorical variables<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor060\"><span class=\"Hyperlink-underscore\">frequency polygons<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor056\"><span class=\"Hyperlink-underscore\">histogram<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor086\"><span class=\"Hyperlink-underscore\">skew<\/span><\/a><\/p>\n<p class=\"Key-terms\"><a href=\"#_idTextAnchor048\"><span class=\"Hyperlink-underscore\">stem-and-leaf display<\/span><\/a><\/p>\n<\/div>\n<\/div>\n<p data-start=\"231\" data-end=\"815\">Statistics is more than just numbers\u2014it is a way of telling stories about people, communities, and systems. When used thoughtfully, statistics can uncover patterns of inequality, highlight voices that are often silenced, and guide us toward solutions that promote fairness. For example, frequency tables and graphs do more than summarize data; they can reveal who has access to education, who is disproportionately impacted by the criminal justice system, or how resources are distributed across neighborhoods. In this way, statistics becomes a tool for advocacy, not just analysis.<\/p>\n<p data-start=\"817\" data-end=\"1391\">Approaching statistics from a social justice perspective means asking questions about power, representation, and equity. Whose experiences are being measured? Who is left out of the dataset? How might the way we collect, organize, and present data either reinforce stereotypes or challenge them? By connecting statistical methods to real-world issues\u2014such as racial profiling, housing inequality, and disparities in health care\u2014we see how numbers are never neutral. They are deeply tied to human lives, and how we analyze them can influence policy, practice, and progress.<\/p>\n<hr data-start=\"1393\" data-end=\"1396\" \/>\n<p class=\"Text-1st\">Before we can understand our analyses, we must first understand our data. The first step in doing this is using tables, charts, graphs, plots, and other visual tools to see what our data look like. This section examines graphical methods for displaying various results. We\u2019ll learn some general lessons about how to graph data that fall into a number of categories. A later section will consider how to graph numerical data from a frequency distribution.<\/p>\n<h2 class=\"H2\">Frequency Tables<\/h2>\n<p class=\"Text-1st\"><span style=\"background-color: #ffffff\">All of the graphical methods shown in this section are derived from frequency tables. <a style=\"background-color: #ffffff\" href=\"#_idTextAnchor038\"><span class=\"Fig-table-number-underscore\">Table 2.1<\/span><\/a> shows a frequency table for the results of a study on community members\u2019 of color experiences with racial profiling; it shows the frequencies of the various response categories. It also shows the relative frequencies, which are the proportion of responses in each category. For example, the relative frequency for \u201cnever experienced racial profiling\u201d of .17 = 85\/500.<\/span><\/p>\n<div class=\"_idGenObjectLayout-1\">\n<p data-start=\"640\" data-end=\"718\"><strong data-start=\"640\" data-end=\"718\">Table 2.1. Frequency table for reported experiences with racial profiling.<\/strong><\/p>\n<div id=\"_idContainer026\" class=\"_idGenObjectStyleOverride-1\">\n<table id=\"table011\" class=\"Foster-table\" style=\"height: 85px\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-29\" \/>\n<col class=\"_idGenTableRowColumn-30\" \/>\n<col class=\"_idGenTableRowColumn-31\" \/><\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\" style=\"height: 17px\">\n<th style=\"height: 17px;width: 159.25px\">Racial Profiling Exper.<\/th>\n<th style=\"height: 17px;width: 83.4688px\">\n<p class=\"Table-col-hd\">Frequency<\/p>\n<\/th>\n<th style=\"height: 17px;width: 152.781px\">\n<p class=\"Table-col-hd\">Relative Frequency<\/p>\n<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<th class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 159.25px\">\n<p class=\"Table-body\">Never<\/p>\n<\/th>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 83.4688px\">\n<p class=\"Table-body\">85<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 152.781px\">\n<p class=\"Table-body\">.17<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<th class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 159.25px\">\n<p class=\"Table-body\">Occasionally<\/p>\n<\/th>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 83.4688px\">\n<p class=\"Table-body\">60<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 152.781px\">\n<p class=\"Table-body\">.12<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<th class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 159.25px\">\n<p class=\"Table-body\">Frequently<\/p>\n<\/th>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 83.4688px\">\n<p class=\"Table-body\">355<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 152.781px\">\n<p class=\"Table-body\">.71<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-8\" style=\"height: 17px\">\n<th class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 159.25px\">\n<p class=\"Table-body\">Total<\/p>\n<\/th>\n<td class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 83.4688px\">\n<p class=\"Table-body\">500<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 152.781px\">\n<p class=\"Table-body\">1.00<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<div>\n<h1>Understanding and Creating Frequency Distributions<\/h1>\n<\/div>\n<p>Frequency distributions are fundamental tools in statistics for organizing and summarizing data. They help researchers transform raw numbers into meaningful patterns and make statistical interpretation easier. This is particularly important in social justice research, where we analyze patterns in data to uncover inequities in education, housing, criminal justice, and other areas. This section will walk through how to create and interpret different types of frequency distributions, using real-world examples that can support data-informed advocacy and awareness.\u00a0 The data is hypothetical but it when using real-word data, the process is the same.<\/p>\n<h2>From Raw Data to Ranked Order<\/h2>\n<p>Raw data is the unprocessed list of values as they were collected. For example, let\u2019s look at how many times 27 juvenile offenders were arrested.<\/p>\n<p>In raw form, this might be listed as follows: 2, 1, 3, 2, 4, 1, 2, 3, 1, 2, 3, 1, 2, 4, 3, 1, 2, 3, 2, 1, 5, 5, 5, 2, 2, 6, 6.<br \/>\nWhile this shows the data, it\u2019s not easy to analyze. A ranked frequency distribution organizes this data from highest to lowest to help visualize extremes.<\/p>\n<h2>Simple Frequency Distribution<\/h2>\n<p>To make the data easier to interpret, we count how often each number of arrests appears. This is a simple frequency distribution. Start by identifying all unique values (e.g., 1 arrest through 6 arrests).<\/p>\n<table>\n<tbody>\n<tr>\n<td>Simple Frequency Table: Juvenile Arrests n=27<\/td>\n<\/tr>\n<tr>\n<td>\n<table>\n<tbody>\n<tr>\n<td>Number of Arrests<\/td>\n<td>Frequency<\/td>\n<\/tr>\n<tr>\n<td>1<\/td>\n<td>6<\/td>\n<\/tr>\n<tr>\n<td>2<\/td>\n<td>9<\/td>\n<\/tr>\n<tr>\n<td>3<\/td>\n<td>5<\/td>\n<\/tr>\n<tr>\n<td>4<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>5<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>6<\/td>\n<td>2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>In the table above, we see that 9 juveniles were arrested twice, while only 2 were arrested six times. This table gives us a quick sense of how arrest frequencies are distributed among the sample.<\/p>\n<h2>Grouped Frequency Distribution<\/h2>\n<p>Sometimes we want to simplify further by grouping values into intervals. This is especially useful when we have a wide range of data. In our case, we can group arrests into three intervals: 1\u20132, 3\u20134, and 5\u20136. We then total the number of offenders whose arrest count falls into each group.<\/p>\n<table>\n<tbody>\n<tr>\n<td>Grouped Frequency Table: Juvenile Arrests<\/td>\n<\/tr>\n<tr>\n<td>\n<table>\n<tbody>\n<tr>\n<td>Arrest Interval<\/td>\n<td>Frequency<\/td>\n<\/tr>\n<tr>\n<td>1\u20132<\/td>\n<td>15<\/td>\n<\/tr>\n<tr>\n<td>3\u20134<\/td>\n<td>7<\/td>\n<\/tr>\n<tr>\n<td>5\u20136<\/td>\n<td>5<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>This grouped table summarizes the same data in broader categories. Now, we can say that most juveniles (15) had between 1 and 2 arrests, while only 5 had 5 or more. Grouping helps when data has variability or when we want a quick snapshot of broader patterns.<\/p>\n<h3>Why This Matters in Social Justice Research<\/h3>\n<p>Understanding how to organize data into frequency distributions is essential for social justice statistics. For instance, frequency tables can be used to show how often different racial or socioeconomic groups experience arrests, access education, or face housing instability. Creating these tables allows advocates and researchers to identify patterns of inequality and communicate them clearly to policymakers or the public.<\/p>\n<p>&nbsp;<\/p>\n<div>\n<h2>Creating a Grouped Frequency Distribution<\/h2>\n<\/div>\n<p>Grouped frequency distributions help summarize large datasets by organizing scores into intervals, making it easier to identify patterns and trends. In this section, we\u2019ll walk through how to construct a grouped frequency table and explain the components such as apparent limits, real limits, midpoints, relative frequency, cumulative frequency, and cumulative relative frequency. We\u2019ll also show how to convert relative frequencies into percentages for easier interpretation.<\/p>\n<h3>Step 1: Decide on Intervals<\/h3>\n<p>Start by determining the range of your dataset, which is the highest value minus the lowest value. Then choose how many intervals you want. Divide the range by the number of intervals to get the width of each class interval. For example, if your data ranges from 0 to 49 and you want 10 intervals, each interval will cover 5 units (e.g., 0\u20134, 5\u20139, \u2026, 45\u201349).\u00a0 Generally, we want to make intervals easy to understand by making them siz 5 or 10, depending on the range of the data.\u00a0\u00a0 So, intervals that go from 1-5, 6-10, 11-15 etc. make it easy to organize and understand your data.<\/p>\n<h3>Step 2: Apparent Limits<\/h3>\n<p>Apparent limits are the values that define the range of each interval as it appears in a table. For example, the interval 10\u201314 means values from 10 to 14 are included in that group.<\/p>\n<h3>Step 3: Real Limits<\/h3>\n<p>Real limits are the boundaries that account for the continuity of data. For interval 10\u201314, the real limits are 9.5\u201314.5, meaning it includes any value from 9.5 up to but not including 14.5.\u00a0 Real limits are defined as .5 below the lowest apparent limit and .5 above the highest apparent limit in each category.<\/p>\n<h3>Step 4: Midpoints<\/h3>\n<p>The midpoint of each interval is the average of the lower and upper apparent limits. For example, the midpoint of 10\u201314 is (10 + 14) \/ 2 = 12.<\/p>\n<h3>Step 5: Frequency and Relative Frequency<\/h3>\n<p>Frequency (f) is the count of values that fall within each interval. Relative frequency is calculated by dividing each frequency by the total number of data points (n). This gives a proportion of the total for each interval.<\/p>\n<h3>Step 6: Cumulative Frequency and Cumulative Relative Frequency<\/h3>\n<p>Cumulative frequency (CF) is the total number of values that fall below the upper real limit of each interval. Cumulative relative frequency (CRF) is the cumulative frequency divided by the total number of scores. This tells us the proportion of data below a given point.<\/p>\n<h3>Step 7: Converting Relative Frequencies to Percents<\/h3>\n<p>To convert a relative frequency to a percentage, multiply the value by 100. For example, a relative frequency of 0.125 becomes 12.5%.\u00a0 For example, in the table below, the relative frequency for those who missed 0-4 school abscences is .083.\u00a0 To change that into a percent, it becomes 8.3%.\u00a0\u00a0 Relative frequency columns add up to 1.0 and when converted to percents, adds up to 100%.<\/p>\n<table>\n<tbody>\n<tr>\n<td>Grouped Frequency Table Example: School Absences<\/td>\n<\/tr>\n<tr>\n<td>\n<table>\n<tbody>\n<tr>\n<td>Apparent Limits<\/td>\n<td>Real Limits<\/td>\n<td>Midpoint<\/td>\n<td>F<\/td>\n<td>Rel f<\/td>\n<td>Cum f<\/td>\n<td>Cum Rel f<\/td>\n<\/tr>\n<tr>\n<td>0\u20134<\/td>\n<td>\u22120.5\u20134.5<\/td>\n<td>2<\/td>\n<td>4<\/td>\n<td>.083<\/td>\n<td>4<\/td>\n<td>.083<\/td>\n<\/tr>\n<tr>\n<td>5\u20139<\/td>\n<td>4.5\u20139.5<\/td>\n<td>7<\/td>\n<td>8<\/td>\n<td>.167<\/td>\n<td>12<\/td>\n<td>.250<\/td>\n<\/tr>\n<tr>\n<td>10\u201314<\/td>\n<td>9.5\u201314.5<\/td>\n<td>12<\/td>\n<td>3<\/td>\n<td>.063<\/td>\n<td>15<\/td>\n<td>.313<\/td>\n<\/tr>\n<tr>\n<td>15\u201319<\/td>\n<td>14.5\u201319.5<\/td>\n<td>17<\/td>\n<td>3<\/td>\n<td>.063<\/td>\n<td>18<\/td>\n<td>.376<\/td>\n<\/tr>\n<tr>\n<td>20\u201324<\/td>\n<td>19.5\u201324.5<\/td>\n<td>22<\/td>\n<td>6<\/td>\n<td>.125<\/td>\n<td>24<\/td>\n<td>.501<\/td>\n<\/tr>\n<tr>\n<td>25\u201329<\/td>\n<td>24.5\u201329.5<\/td>\n<td>27<\/td>\n<td>4<\/td>\n<td>.083<\/td>\n<td>28<\/td>\n<td>.584<\/td>\n<\/tr>\n<tr>\n<td>30\u201334<\/td>\n<td>29.5\u201334.5<\/td>\n<td>32<\/td>\n<td>6<\/td>\n<td>.125<\/td>\n<td>34<\/td>\n<td>.709<\/td>\n<\/tr>\n<tr>\n<td>35\u201339<\/td>\n<td>34.5\u201339.5<\/td>\n<td>37<\/td>\n<td>3<\/td>\n<td>.063<\/td>\n<td>37<\/td>\n<td>.772<\/td>\n<\/tr>\n<tr>\n<td>40\u201344<\/td>\n<td>39.5\u201344.5<\/td>\n<td>42<\/td>\n<td>4<\/td>\n<td>.083<\/td>\n<td>41<\/td>\n<td>.855<\/td>\n<\/tr>\n<tr>\n<td>45\u201349<\/td>\n<td>44.5\u201349.5<\/td>\n<td>47<\/td>\n<td>7<\/td>\n<td>.146<\/td>\n<td>48<\/td>\n<td>1.000<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h4 class=\"H2\">Pie Charts<\/h4>\n<p class=\"Text-1st\">The pie chart in <a href=\"#_idTextAnchor039\"><span class=\"Fig-table-number-underscore\">Figure 2.1<\/span><\/a> shows the results of the amount of racial profiling experienced. In a pie chart, each category is represented by a slice of the pie. The area of the slice is proportional to the percentage of responses in the category. This is simply the relative frequency multiplied by 100<span style=\"background-color: #ffffff\">. <span class=\"Fig-table-number\" style=\"text-align: initial;font-size: 1.125rem;background-color: #ffffff\"><a id=\"_idTextAnchor039\" style=\"background-color: #ffffff\"><\/a>Figure 2.1.<\/span><span style=\"text-align: initial;font-size: 1.125rem;background-color: #ffffff\"> Pie chart of racial profiling experienced illustrating frequencies of previous racial profiling: 71% of participants reported frequently being racially profiled.<\/span><\/span><\/p>\n<div class=\"_idGenObjectStyleOverride-1\">\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-865\" src=\"http:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-300x300.png\" alt=\"Pie chart reflecting frequencies of racial profiling (never: 17%, Occasionally: 12%, and frequently: 71)\" width=\"397\" height=\"397\" srcset=\"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-300x300.png 300w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-1024x1024.png 1024w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-150x150.png 150w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-768x768.png 768w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-1536x1536.png 1536w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-65x65.png 65w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-225x225.png 225w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1-350x350.png 350w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_pie_chart_hd-1.png 1800w\" sizes=\"auto, (max-width: 397px) 100vw, 397px\" \/><\/p>\n<\/div>\n<p class=\"Text\">Pie charts are effective for displaying the relative frequencies of a small number of categories. They are not recommended, however, when you have a large number of categories. Pie charts can also be confusing when they are used to compare the outcomes of two different surveys or experiments. In an influential book on the use of graphs, Edward Tufte asserted, \u201cThe only worse design than a pie chart is several of them.\u201d\u00b9<span class=\"superscript CharOverride-24\"><br \/>\n<\/span><\/p>\n<div class=\"textbox textbox--sidebar\"><span class=\"superscript CharOverride-24\">\u00b9<\/span> <span class=\"CharOverride-18\">Tufte, E. R. (1983). <\/span><span class=\"italic CharOverride-18\">The visual display of quantitative information<\/span><span class=\"CharOverride-18\"> (p. 178). Graphics Press.<\/span><\/div>\n<p class=\"Text\">Here is another important point about pie charts. If they are based on a small number of observations, it can be misleading to label the pie slices with percentages. <span style=\"background-color: #ffffff\">For example, if just 5 people had been interviewed about the amount of racial profiling experienced being never, and 3 participants reported frequently, it would be misleading to display a pie chart slice showing .60. With so few people interviewed, such a large percentage of racially profiled users might easily have occurred since chance can cause large errors with small samples. In this case, it is better to alert the user of the pie chart to the actual numbers involved. The slices should therefore be labeled with the actual frequencies observed (e.g., 3) instead of with percentages.<\/span><\/p>\n<h4 class=\"H2\">Bar Charts<\/h4>\n<p class=\"Text-1st\">Bar charts can also be used to represent frequencies of different categories. A bar chart of the amount of racial profiling experienced shown in <a href=\"#_idTextAnchor040\"><span class=\"Fig-table-number-underscore\">Figure 2.2<\/span><\/a>. Participants experience (never, occasionally, frequently) is shown on the <span class=\"italic\">x<\/span>-axis and the frequencies (Number of Respondents) are shown on the <span class=\"italic\">y<\/span>-axis.\u00a0Typically, the <span class=\"italic\">y<\/span>-axis shows the number of observations in each category rather than the percentage of observations in each category as is typical in pie charts.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer029\" class=\"Basic-Text-Frame\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\">\u00a0<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer030\" class=\"_idGenObjectStyleOverride-1\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-864\" src=\"http:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-300x225.png\" alt=\"Bar chart reflecting frequencies of racial profiling (never: 85, Occasionally: 60, and frequently: 355)\" width=\"507\" height=\"380\" srcset=\"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-300x225.png 300w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-1024x768.png 1024w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-768x576.png 768w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-1536x1152.png 1536w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-2048x1536.png 2048w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-65x49.png 65w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-225x169.png 225w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/racial_profiling_bar_chart_hd-350x263.png 350w\" sizes=\"auto, (max-width: 507px) 100vw, 507px\" \/><\/div>\n<\/div>\n<h4 class=\"H2\">Comparing Distributions<\/h4>\n<p class=\"Text-1st\">Often we need to compare the results of different surveys, or of different conditions within the same overall survey. In this case, we are comparing the \u201cdistributions\u201d of responses between the surveys or conditions. Bar charts are often excellent for illustrating differences between two distributions. <a href=\"#_idTextAnchor041\"><span class=\"Fig-table-number-underscore\">Figure 2.3<\/span><\/a> A community organization surveyed 500 individuals to examine disparities in access to mental health services based on household income. Respondents were asked whether they had <em data-start=\"452\" data-end=\"469\">adequate access<\/em> or <em data-start=\"473\" data-end=\"492\">inadequate access<\/em> to mental health services. The results were categorized by income level.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer031\" class=\"Basic-Text-Frame\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor041\"><\/a>Figure 2.3.<\/span> A bar chart of the number of people&#8217;s access to health serviced based on Income level<\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer032\" class=\"_idGenObjectStyleOverride-1\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-863 alignnone\" src=\"http:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-300x225.png\" alt=\"\" width=\"519\" height=\"389\" srcset=\"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-300x225.png 300w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-1024x768.png 1024w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-768x576.png 768w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-1536x1152.png 1536w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-2048x1536.png 2048w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-65x49.png 65w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-225x169.png 225w, https:\/\/pressbooks.palomar.edu\/introtostats\/wp-content\/uploads\/sites\/8\/2021\/12\/mental_health_access_bar_chart_hd-350x263.png 350w\" sizes=\"auto, (max-width: 519px) 100vw, 519px\" \/><\/div>\n<\/div>\n<h4 class=\"H2\">Some Graphical Mistakes to Avoid<\/h4>\n<p class=\"Text-1st\">Don\u2019t get fancy! People sometimes add features to graphs that don\u2019t help to convey their information. For example, three-dimensional bar charts such as the one shown in <a href=\"#_idTextAnchor042\"><span class=\"Fig-table-number-underscore\">Figure 2.4<\/span><\/a> are usually not as effective as their two-dimensional counterparts.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer033\" class=\"Basic-Text-Frame\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor042\"><\/a>Figure 2.4<\/span>. Charts like this are less effective. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/4\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Mac Bar Chart 3D<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licenced under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer034\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Mac_Bar_Chart_3D-3.png\" alt=\"A less-effective version of Figure 2.2, showing a three-deminstional bar chart. In this version, it is difficult to determine the value represented by each bar.\" \/><\/div>\n<\/div>\n<p class=\"Text\">Here is another way that fanciness can lead to trouble. Instead of plain bars, it is tempting to substitute meaningful images. For example, <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a> presents the iMac data using pictures of computers. The heights of the pictures accurately represent the number of buyers, yet <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a> is misleading because the viewer\u2019s attention will be captured by areas. The areas can exaggerate the size differences between the groups. In terms of percentages, the ratio of previous Macintosh owners to previous Windows owners is about 6 to 1. But the ratio of the two areas in <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a> is about 35 to 1. A biased person wishing to hide the fact that many Windows owners purchased iMacs would be tempted to use <a href=\"#_idTextAnchor043\"><span class=\"Fig-table-number-underscore\">Figure 2.5<\/span><\/a>.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer035\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor043\"><\/a>Figure 2.5.<\/span> . <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/5\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Mac Bar Chart Lie Factor<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">. \u201c<\/span><a href=\"https:\/\/www.flickr.com\/photos\/albaco\/14852028844\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Apple iMac G3 (1998)<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by albaco\/Flickr is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/2.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 2.0<\/span><\/span><\/a><span class=\"Fig-source\">; image was brightened and background was removed.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer036\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Mac_Bar_Chart_Lie_Factor-3.png\" alt=\"A less-effective version of Figure 2.2, showing a bar chart in which the bars are replaced by images of iMacs scaled so that their heights reach the desired values. In this version, the image representing previous Macintosh owners is far larger than the other two populations, which may bias the viewer against those populations.\" \/><\/div>\n<\/div>\n<p class=\"Text\">Edward Tufte coined the term <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_630\"><a id=\"_idTextAnchor044\"><\/a><\/a><span class=\"key-term\">lie factor<\/span> to refer to the ratio of the size of the effect shown in a graph to the size of the effect shown in the data. He suggests that lie factors greater than 1.05 or less than 0.95 produce unacceptable distortion.<\/p>\n<p class=\"Text\">Another distortion in bar charts results from setting the baseline to a value other than zero. The baseline is the bottom of the <span class=\"italic\">y<\/span>-axis, representing the least number of cases that could have occurred in a category. Normally, but not always, this number should be zero. <a href=\"#_idTextAnchor045\"><span class=\"Fig-table-number-underscore\">Figure 2.6<\/span><\/a> shows the iMac data with a baseline of 50. Once again, the differences in areas suggests a different story than the true differences in percentages. The number of Windows-switchers seems minuscule compared to its true value of 12%.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer037\" class=\"Basic-Text-Frame\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor045\"><\/a>Figure 2.6.<\/span> A redrawing of <a href=\"#_idTextAnchor040\"><span class=\"Fig-table-number-underscore\">Figure 2.2<\/span><\/a> with a baseline of 50. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/6\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Mac Bar Chart Baseline 50<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer038\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Mac_Bar_Chart_Baseline_50-3.png\" alt=\"A less-effective version of Figure 2.2, showing a bar chart in which the y-axis begins at 50 instead of 0. In this version, the bar heights tell a story that is skewed against the smallest group, making the viewer think there were far fewer iMac buyers who previously owned a Windows computer than there actually were.\" \/><\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer040\" class=\"_idGenObjectStyleOverride-1\"><\/div>\n<\/div>\n<h4 class=\"H2\">Summary<\/h4>\n<p class=\"Text-1st\">Pie charts and bar charts can both be effective methods of portraying data. Bar charts are better when there are more than just a few categories and for comparing two or more distributions. Be careful to avoid creating misleading graphs.<\/p>\n<h3 class=\"H1\">Graphing Quantitative Variables<\/h3>\n<p class=\"Text-1st\">As discussed in the section on variables in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-1\/\"><span class=\"Hyperlink-underscore\">Chapter 1<\/span><\/a>, quantitative variables are variables measured on a numeric scale. Height, weight, response time, subjective rating of pain, temperature, and score on an exam are all examples of quantitative variables. Quantitative variables are distinguished from qualitative variables (sometimes called <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_627\"><a id=\"_idTextAnchor047\"><\/a><\/a><span class=\"key-term\">categorical variables<\/span> or nominal variables), such as favorite color, religion, city of birth, and favorite sport, in which there is no ordering or measuring involved.<\/p>\n<p class=\"Text\">There are many types of graphs that can be used to portray distributions of quantitative variables. The upcoming sections cover the following types of graphs: (1) stem-and-leaf displays, (2)\u00a0histograms, (3) frequency polygons, (4) box plots, (5) bar charts, (6) line graphs, (7) dot plots, and (8) scatter plots (discussed in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-12\/\"><span class=\"Hyperlink-underscore\">Chapter 12<\/span><\/a>). Some graph types, such as stem-and-leaf displays, are best-suited for small to moderate amounts of data, whereas others, such as histograms, are best-suited for large amounts of data. Graph types such as box plots are good at depicting differences between distributions. Scatter plots are used to show the relationship between two variables.<\/p>\n<h4 class=\"H2\">Stem-and-Leaf Displays<\/h4>\n<p class=\"Text-1st\">A <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_632\"><a id=\"_idTextAnchor048\"><\/a><\/a><span class=\"key-term\">stem-and-leaf display<\/span> is a graphical method of displaying data. It is particularly useful when your data are not too numerous. In this section, we will explain how to construct and interpret this kind of graph.<\/p>\n<p class=\"Text\">As usual, we will start with an example. Consider <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>, which shows the number of touchdown passes (TD passes) thrown by each of the 31 teams in the National Football League during the 2000 season.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer041\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor049\"><\/a>Figure 2.8.<\/span> Number of touchdown passes. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/8\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Touchdown Passes Raw Data<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer042\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Touchdown_Passes_Raw_Data-3.png\" alt=\"A list of raw values representing the number of touchdown passes by each of the 31 teams in the NFL during the 2000 season. The values, arranged in descending order, begin with 37, 33, 33, and 32, and end with 12, 12, 9, and 6.\" \/><\/div>\n<\/div>\n<p class=\"Text\">A stem-and-leaf display of the data is shown in <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a>. The left portion of <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a> contains the stems. They are the numbers 3, 2, 1, and 0, arranged as a column to the left of the bars. Think of these numbers as 10s digits. A stem of 3, for example, can be used to represent the 10s digit in any of the numbers from 30 to 39. The numbers to the right of the bar are leaves, and they represent the 1s digits. Every leaf in the graph therefore stands for the result of adding the leaf to 10 times its stem.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer043\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor050\"><\/a>Figure 2.9.<\/span> Stem-and-leaf display of the number of touchdown passes. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/9\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Touchdown Passes Stem and Leaf<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer044\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Touchdown_Passes_Stem_and_Leaf-3.png\" alt=\"A stem and leaf display showing the number of touchdown passes by each of the 31 teams. The first row has a stem of 3 and leaves of 2, 3, 3, and 7; that row represents the numbers 32, 33, 33, and 37.\" \/><\/div>\n<\/div>\n<p class=\"Text\">To make this clear, let us examine <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a> more closely. In the top row, the four leaves to the right of stem 3 are 2, 3, 3, and 7. Combined with the stem, these leaves represent the numbers 32, 33, 33, and 37, which are the numbers of TD passes for the first four teams in <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>. The next row has a stem of 2 and 12 leaves. Together, they represent 12 data points, namely, two occurrences of 20\u00a0TD passes, three occurrences of 21 TD passes, three occurrences of 22 TD passes, one occurrence of 23\u00a0TD passes, two occurrences of 28 TD passes, and one occurrence of 29 TD passes. We leave it to you to figure out what the third row represents. The fourth row has a stem of 0 and two leaves. It stands for the last two entries in <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>, namely 9 TD passes and 6 TD passes. (The latter two numbers may be thought of as 09 and 06.)<\/p>\n<p class=\"Text\">One purpose of a stem-and-leaf display is to clarify the shape of the distribution. You can see many facts about TD passes more easily in <a href=\"#_idTextAnchor050\"><span class=\"Fig-table-number-underscore\">Figure 2.9<\/span><\/a> than in <a href=\"#_idTextAnchor049\"><span class=\"Fig-table-number-underscore\">Figure 2.8<\/span><\/a>. For example, by looking at the stems and the shape of the plot, you can tell that most of the teams had between 10 and 29 passing TDs, with a few having more and a few having less. The precise numbers of TD passes can be determined by examining the leaves.<\/p>\n<h4 class=\"H2\">Histograms<\/h4>\n<p class=\"Text-1st\">A <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_629\"><a id=\"_idTextAnchor056\"><\/a><\/a><span class=\"key-term\">histogram<\/span> is a graphical method for displaying the shape of a distribution. It is particularly useful when there are a large number of observations. We begin with an example consisting of the scores of 642 students on a psychology test. The test consists of 197 items, each graded as \u201ccorrect\u201d or \u201cincorrect.\u201d The students\u2019 scores ranged from 46 to 167.<\/p>\n<p class=\"Text\">The first step is to create a frequency table. Unfortunately, a simple frequency table would be too big, containing over 100 rows. To simplify the table, we group scores together as shown in <a href=\"#_idTextAnchor057\"><span class=\"Fig-table-number-underscore\">Table 2.2<\/span><\/a>.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer070\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor057\"><\/a>Table 2.2.<\/span> Grouped frequency distribution of psychology test scores.<\/p>\n<table id=\"table012\" class=\"Foster-table\" style=\"height: 238px\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-32\" \/>\n<col class=\"_idGenTableRowColumn-32\" \/>\n<col class=\"_idGenTableRowColumn-1\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\" style=\"height: 17px\">\n<td class=\"Foster-table Table-col-hd CellOverride-7\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-col-hd\">Interval\u2019s Lower Limit<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-7\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-col-hd\">Interval\u2019s Upper Limit<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-col-hd\">Class Frequency<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-1\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">39.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-1\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">49.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">3<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">49.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">59.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">10<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">59.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">69.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">53<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">69.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">79.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">107<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">79.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">89.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">147<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">89.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">99.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">130<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">99.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">109.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">78<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">109.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">119.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">59<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">119.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">129.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">36<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">129.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">139.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">11<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">139.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">149.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">6<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">149.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-7 _idGenCellOverride-2\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">159.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-11\" style=\"height: 17px\">\n<td class=\"Foster-table Table-body-last Table-body CellOverride-7\" style=\"height: 17px;width: 147.562px\">\n<p class=\"Table-body\">159.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-7\" style=\"height: 17px;width: 145.766px\">\n<p class=\"Table-body\">169.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\" style=\"height: 17px;width: 107.531px\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p class=\"Text\">To create this table, the range of scores was broken into intervals, called class intervals. The first interval is from 39.5 to 49.5, the second from 49.5 to 59.5, etc. Next, the number of scores falling into each interval was counted to obtain the class frequencies. There are 3 scores in the first interval, 10 in the second, etc.<\/p>\n<p class=\"Text\">Class intervals of width 10 provide enough detail about the distribution to be revealing without making the graph too \u201cchoppy.\u201d More information on choosing the widths of class intervals is presented later in this section. Placing the limits of the class intervals midway between two numbers (e.g., 49.5) ensures that every score will fall in an interval rather than on the boundary between intervals.<\/p>\n<p class=\"Text\">In a histogram, the class frequencies are represented by bars. The height of each bar corresponds to its class frequency. A histogram of these data is shown in <a href=\"#_idTextAnchor058\"><span class=\"Fig-table-number-underscore\">Figure 2.15<\/span><\/a>.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer071\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor058\"><\/a>Figure 2.15.<\/span> Histogram of scores on a psychology test. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/15\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Histogram<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer072\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Histogram-3.png\" alt=\"A histogram of scores on a psychology test, with most scores in the center of the distribution and a positive skew.\" \/><\/div>\n<\/div>\n<p class=\"Text\">The histogram makes it plain that most of the scores are in the middle of the distribution, with fewer scores in the extremes. You can also see that the distribution is not symmetric: the scores extend farther to the right than they do to the left. The distribution is therefore said to be skewed. (We\u2019ll have more to say about shapes of distributions in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a>.)<\/p>\n<p class=\"Text\">In our example, the observations are whole numbers. Histograms can also be used when the scores are measured on a more continuous scale such as the length of time (in milliseconds) required to perform a task. In this case, there is no need to worry about fence sitters since they are improbable. (It would be quite a coincidence for a task to require exactly 7 seconds, measured to the nearest thousandth of a second.) We are therefore free to choose whole numbers as boundaries for our class intervals, for example, 4000, 5000, etc. The class frequency is then the number of observations that are greater than or equal to the lower bound, and strictly less than the upper bound. For example, one interval might hold times from 4000 to 4999 milliseconds. Using whole numbers as boundaries avoids a cluttered appearance, and is the practice of many computer programs that create histograms. Note also that some computer programs label the middle of each interval rather than the end points.<\/p>\n<p class=\"Text\">Histograms can be based on relative frequencies instead of actual frequencies. Histograms based on relative frequencies show the proportion of scores in each interval rather than the number of scores. In this case, the <span class=\"italic\">y<\/span>-axis runs from 0 to 1 (or somewhere in between if there are no extreme proportions). You can change a histogram based on frequencies to one based on relative frequencies by (a) dividing each class frequency by the total number of observations, and then (b) plotting the quotients on the <span class=\"italic\">y<\/span>-axis (labeled as proportion).<\/p>\n<p class=\"Text\">There is more to be said about the widths of the class intervals, sometimes called <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_625\"><a id=\"_idTextAnchor059\"><\/a><\/a><span class=\"key-term\">bin widths<\/span>. Your choice of bin width determines the number of class intervals. This decision, along with the choice of starting point for the first interval, affects the shape of the histogram. The best advice is to experiment with different choices of width, and to choose a histogram according to how well it communicates the shape of the distribution.<\/p>\n<h4 class=\"H2\">Frequency Polygons<\/h4>\n<p class=\"Text-1st\"><span class=\"key-term\"><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_628\"><a id=\"_idTextAnchor060\"><\/a><\/a>Frequency polygons<\/span> are a graphical device for understanding the shapes of distributions. They serve the same purpose as histograms, but are especially helpful for comparing sets of data. Frequency polygons are also a good choice for displaying cumulative frequency distributions.<\/p>\n<p class=\"Text\">To create a frequency polygon, start just as for histograms, by choosing a class interval. Then draw an <span class=\"italic\">x<\/span>-axis representing the values of the scores in your data. Mark the middle of each class interval with a tick mark, and label it with the middle value represented by the class. Draw the <span class=\"italic\">y<\/span>-axis to indicate the frequency of each class. Place a point in the middle of each class interval at the height corresponding to its frequency. Finally, connect the points. You should include one class interval below the lowest value in your data and one above the highest value. The graph will then touch the <span class=\"italic\">x<\/span>-axis on both sides.<\/p>\n<p class=\"Text\">The frequency distribution of 642 psychology test scores, shown in <a href=\"#_idTextAnchor061\"><span class=\"Fig-table-number-underscore\">Table 2.3<\/span><\/a>, was used to create the frequency polygon shown in <a href=\"#_idTextAnchor062\"><span class=\"Fig-table-number-underscore\">Figure 2.16<\/span><\/a>.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer073\" class=\"_idGenObjectStyleOverride-1\">\n<p class=\"Table-title\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor061\"><\/a>Table 2.3.<\/span> Frequency distribution of psychology test scores.<\/p>\n<table id=\"table013\" class=\"Foster-table\">\n<colgroup>\n<col class=\"_idGenTableRowColumn-33\" \/>\n<col class=\"_idGenTableRowColumn-33\" \/>\n<col class=\"_idGenTableRowColumn-34\" \/>\n<col class=\"_idGenTableRowColumn-35\" \/> <\/colgroup>\n<thead>\n<tr class=\"Foster-table _idGenTableRowColumn-5\">\n<td class=\"Foster-table Table-col-hd CellOverride-8\">\n<p class=\"Table-col-hd\">Lower Limit<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-8\">\n<p class=\"Table-col-hd\">Upper Limit<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd CellOverride-8\">\n<p class=\"Table-col-hd\">Count<\/p>\n<\/td>\n<td class=\"Foster-table Table-col-hd\">\n<p class=\"Table-col-hd\">Cumulative Count<\/p>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-1\">\n<p class=\"Table-body\">29.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-1\">\n<p class=\"Table-body\">39.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-1\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-1\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">39.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">49.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">3<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">3<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">49.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">59.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">10<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">13<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">59.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">69.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">53<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">66<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">69.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">79.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">107<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">173<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">79.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">89.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">147<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">320<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">89.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">99.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">130<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">450<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">99.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">109.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">78<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">528<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">109.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">119.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">59<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">587<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">119.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">129.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">36<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">623<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">129.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">139.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">11<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">634<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">139.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">149.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">6<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">640<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-6\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">149.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">159.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">641<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-7\">\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">159.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">169.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body CellOverride-8 _idGenCellOverride-2\">\n<p class=\"Table-body\">1<\/p>\n<\/td>\n<td class=\"Foster-table Table-body _idGenCellOverride-2\">\n<p class=\"Table-body\">642<\/p>\n<\/td>\n<\/tr>\n<tr class=\"Foster-table _idGenTableRowColumn-11\">\n<td class=\"Foster-table Table-body-last Table-body CellOverride-8\">\n<p class=\"Table-body\">169.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-8\">\n<p class=\"Table-body\">170.5<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body CellOverride-8\">\n<p class=\"Table-body\">0<\/p>\n<\/td>\n<td class=\"Foster-table Table-body-last Table-body\">\n<p class=\"Table-body\">642<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p class=\"Text\">The first label on the <span class=\"italic\">x<\/span>-axis is 35. This represents an interval extending from 29.5 to 39.5. Since the lowest test score is 46, this interval has a frequency of 0. The point labeled 45 represents the interval from 39.5 to 49.5. There are three scores in this interval. There are 147 scores in the interval that surrounds 85.<\/p>\n<p class=\"Text\">You can easily discern the shape of the distribution from <a href=\"#_idTextAnchor062\"><span class=\"Fig-table-number-underscore\">Figure 2.16<\/span><\/a>. Most of the scores are between 65 and 115. It is clear that the distribution is not symmetric inasmuch as good scores (to the right) trail off more gradually than poor scores (to the left). In the terminology of <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a> (where we will study shapes of distributions more systematically), the distribution is skewed.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer074\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor062\"><\/a>Figure 2.16.<\/span> Frequency polygon for the psychology test scores. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/17\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Frequency Polygon<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer075\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Frequency_Polygon-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">A cumulative frequency polygon for the same test scores is shown in <a href=\"#_idTextAnchor063\"><span class=\"Fig-table-number-underscore\">Figure 2.17<\/span><\/a>. The graph is the same as before except that the <span class=\"italic\">y<\/span> value for each point is the number of students in the corresponding class interval plus all numbers in lower intervals. For example, there are no scores in the interval labeled \u201c35,\u201d three in the interval \u201c45,\u201d and 10 in the interval \u201c55.\u201d Therefore, the <span class=\"italic\">y<\/span> value corresponding to \u201c55\u201d is 13. Since 642 students took the test, the cumulative frequency for the last interval is 642.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer076\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor063\"><\/a>Figure 2.17.<\/span> Cumulative frequency polygon for the psychology test scores. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/16\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Psychology Test Scores Cumulative Frequency Polygon<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer077\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Psychology_Test_Scores_Cumulative_Frequency_Polygon-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">Frequency polygons are useful for comparing distributions. This is achieved by overlaying the frequency polygons drawn for different datasets. <a href=\"#_idTextAnchor064\"><span class=\"Fig-table-number-underscore\">Figure 2.18<\/span><\/a> provides an example. The data come from a task in which the goal is to move a computer cursor to a target on the screen as fast as possible. On 20 of the trials, the target was a small rectangle; on the other 20, the target was a large rectangle. Time to reach the target was recorded on each trial. The two distributions (one for each target) are plotted together in <a href=\"#_idTextAnchor064\"><span class=\"Fig-table-number-underscore\">Figure 2.18<\/span><\/a>. The figure shows that, although there is some overlap in times, it generally took longer to move the cursor to the small target than to the large one.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer078\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor064\"><\/a>Figure 2.18.<\/span> Overlaid frequency polygons for the cursor task. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/18\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Cursor Task Frequency Polygons<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer079\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Cursor_Task_Frequency_Polygons-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">It is also possible to plot two cumulative frequency distributions in the same graph. This is illustrated in <a href=\"#_idTextAnchor065\"><span class=\"Fig-table-number-underscore\">Figure 2.19<\/span><\/a> using the same data from the cursor task. The difference in distributions for the two targets is again evident.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer080\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor065\"><\/a>Figure 2.19.<\/span> Overlaid cumulative frequency polygons for the cursor task. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/19\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Cursor Task Cumulative Frequency Polygons<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer081\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Cursor_Task_Cumulative_Frequency_Polygons-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<h4 class=\"H2\"><a id=\"_idTextAnchor075\"><\/a>Bar Charts<\/h4>\n<p class=\"Text-1st\">In the <a href=\"#_idTextAnchor037\"><span class=\"Hyperlink-underscore\">section on qualitative variables<\/span><\/a>, we saw how bar charts could be used to illustrate the frequencies of different categories. For example, as we saw earlier in this chapter, the bar chart shown in <a href=\"#_idTextAnchor040\"><span class=\"Fig-table-number-underscore\">Figure 2.2<\/span><\/a> shows how many purchasers of iMac computers were previous Macintosh users, previous Windows users, and new computer purchasers.<\/p>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer099\" class=\"_idGenObjectStyleOverride-1\"><\/div>\n<\/div>\n<p class=\"Text\">Bar charts are particularly effective for showing change over time. <a href=\"#_idTextAnchor077\"><span class=\"Fig-table-number-underscore\">Figure 2.27<\/span><\/a>, for example, shows the percent increase in the Consumer Price Index (CPI) over four three-month periods. The fluctuation in inflation is apparent in the graph.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer100\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor077\"><\/a>Figure 2.27.<\/span> Percent change in the CPI over time. Each bar represents percent increase for the three months ending at the date indicated. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/27\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Percent Change in CPI<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer101\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Percent_Change_in_CPI-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">Bar charts are often used to compare the means of different experimental conditions. <a href=\"#_idTextAnchor078\"><span class=\"Fig-table-number-underscore\">Figure 2.28<\/span><\/a> shows the mean time it took one person to move the cursor to either a small target or a large target. On average, more time was required for small targets than for large ones.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer102\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor078\"><\/a>Figure 2.28.<\/span> Bar chart showing the means for the two conditions. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/28\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Means of Two Conditions<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer103\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Means_of_Two_Conditions-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer104\" class=\"Legend-w-space-after\"><\/div>\n<\/div>\n<h4 class=\"H2\">Line Graphs<\/h4>\n<p class=\"Text-1st\">A line graph is a bar graph with the tops of the bars represented by points joined by lines (the rest of the bar is suppressed). For example, <a href=\"#_idTextAnchor077\"><span class=\"Fig-table-number-underscore\">Figure 2.27<\/span><\/a>, which was presented in the section on bar charts, shows changes in the Consumer Price Index (CPI) over time. A line graph of these same data is shown in <a href=\"#_idTextAnchor080\"><span class=\"Fig-table-number-underscore\">Figure 2.30<\/span><\/a>. Although the figures are similar, the line graph emphasizes the change from period to period.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer106\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor080\"><\/a>Figure 2.30.<\/span> A line graph of the percent change in the CPI over time. Each point represents percent increase for the three months ending at the date indicated. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/30\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Percent Change in CPI Line Graph<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer107\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Percent_Change_in_CPI_Line_Graph-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">Line graphs are appropriate only when both the <span class=\"italic\">x<\/span>&#8211; and <span class=\"italic\">y<\/span>-axes display ordered (rather than qualitative) variables. Although bar charts can also be used in this situation, line graphs are generally better at comparing changes over time. <a href=\"#_idTextAnchor081\"><span class=\"Fig-table-number-underscore\">Figure 2.31<\/span><\/a>, for example, shows percent increases and decreases in five components of the CPI. The figure makes it easy to see that medical costs had a steadier progression than the other components. Although you could create an analogous bar chart, its interpretation would not be as easy.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer108\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor081\"><\/a>Figure 2.31.<\/span> A line graph of the percent change in five components of the CPI over time. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/31\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Percent Change in CPI x5 Line Graph<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer109\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Percent_Change_in_CPI_x5_Line_Graph-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<h4 class=\"H2\">The Shape of Distribution (SKEWED DISTRIBUTIONS)<\/h4>\n<p class=\"Text-1st\">Finally, it is useful to present discussion on how we describe the shapes of distributions, which we will revisit in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a> to learn how different shapes affect our numerical descriptors of data and distributions.<\/p>\n<p class=\"Text\">The primary characteristic we are concerned about when assessing the shape of a distribution is whether the distribution is symmetrical or skewed. A symmetrical distribution, as the name suggests, can be cut down the center to form two mirror images. Although in practice we will never get a perfectly symmetrical distribution, we would like our data to be as close to symmetrical as possible for reasons we delve into in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a>. Many types of distributions are symmetrical, but by far the most common and pertinent distribution at this point is the normal distribution, shown in <a href=\"#_idTextAnchor083\"><span class=\"Fig-table-number-underscore\">Figure 2.32<\/span><\/a>. Notice that although the symmetry is not perfect (for instance, the bar just to the right of the center is taller than the one just to the left), the two sides are roughly the same shape. The normal distribution has a single peak, known as the center, and two tails that extend out equally, forming what is known as a bell shape or, as we will soon note, a normal curve.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer110\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor083\"><\/a>Figure 2.32.<\/span> A symmetrical distribution. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/32\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Symmetrical Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer111\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Symmetrical_Distribution-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">Symmetrical distributions can also have multiple peaks. <a href=\"#_idTextAnchor085\"><span class=\"Fig-table-number-underscore\">Figure 2.33<\/span><\/a> shows a <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_624\"><a id=\"_idTextAnchor084\"><\/a><\/a><span class=\"key-term\">bimodal distribution<\/span>, named for the two peaks that lie roughly symmetrically on either side of the center point. As we will see in <a href=\"https:\/\/pressbooks.palomar.edu\/introtostats\/chapter\/chapter-3\/\"><span class=\"Hyperlink-underscore\">Chapter 3<\/span><\/a>, this is not a particularly desirable characteristic of our data, and, worse, this is a relatively difficult characteristic to detect numerically. Thus, it is important to visualize your data before moving ahead with any formal analyses.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer112\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor085\"><\/a>Figure 2.33.<\/span> A bimodal distribution. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/33\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Bimodal Distribution<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer113\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Bimodal_Distribution-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<p class=\"Text\">Distributions that are not symmetrical also come in many forms, more than can be described here. The most common asymmetry to be encountered is referred to as <a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_92_631\"><a id=\"_idTextAnchor086\"><\/a><\/a><span class=\"key-term\">skew<\/span>, in which one of the two tails of the distribution is disproportionately longer than the other. This property can affect the value of the averages we use in our analyses and make them an inaccurate representation of our data, which causes many problems.<\/p>\n<p class=\"Text\">Skew can either be positive or negative (also known as right or left, respectively), based on which tail is longer. It is very easy to get the two confused at first; many students want to describe the skew by where the bulk of the data (larger portion of the histogram, known as the body) is placed, but the correct determination is based on which tail is longer. You can think of the tail as an arrow; whichever direction the arrow is pointing is the direction of the skew. <a href=\"#_idTextAnchor087\"><span class=\"Fig-table-number-underscore\">Figure 2.34<\/span><\/a> shows positive (right) and negative (left) skew, respectively.<\/p>\n<div class=\"_idGenObjectLayout-2\">\n<div id=\"_idContainer114\" class=\"Legend-w-space-after\">\n<p class=\"Fig-legend\"><span class=\"Fig-table-number\"><a id=\"_idTextAnchor087\"><\/a>Figure 2.34.<\/span> Positively skewed (A) and negatively skewed (B) distributions. <span class=\"Fig-source\">(\u201c<\/span><a href=\"https:\/\/irl.umsl.edu\/oer-img\/34\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">Skewed Distributions<\/span><\/span><\/a><span class=\"Fig-source\">\u201d by Judy Schmitt is licensed under <\/span><a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><span class=\"Fig-source\"><span class=\"Hyperlink-underscore\">CC BY-NC-SA 4.0<\/span><\/span><\/a><span class=\"Fig-source\">.)<\/span><\/p>\n<\/div>\n<\/div>\n<div class=\"_idGenObjectLayout-1\">\n<div id=\"_idContainer115\" class=\"_idGenObjectStyleOverride-1\"><img decoding=\"async\" class=\"_idGenObjectAttribute-19\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/Skewed_Distributions-3.png\" alt=\"\" \/><\/div>\n<\/div>\n<h3 class=\"H1\"><\/h3>\n<p>&#8220;<a href=\"https:\/\/xkcd.com\/833\">Convincing<\/a>&#8221; by Randall Munroe\/xkcd.com is licensed under <a href=\"https:\/\/creativecommons.org\/licenses\/by-nc\/2.5\/\">CC BY-NC 2.5<\/a>.)<\/p>\n<p><a href=\"https:\/\/xkcd.com\/833\/\"><img decoding=\"async\" src=\"https:\/\/pressbooks.palomar.edu\/wp-content\/uploads\/sites\/8\/2024\/10\/convincing-3.png\" alt=\"\" \/><\/a><\/p>\n<div class=\"glossary\"><span class=\"screen-reader-text\" id=\"definition\">definition<\/span><template id=\"term_92_630\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_630\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_92_627\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_627\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_92_632\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_632\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_92_629\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_629\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_92_625\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_625\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_92_628\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_628\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_92_624\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_624\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_92_631\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_92_631\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><\/div>","protected":false},"author":7,"menu_order":2,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-92","chapter","type-chapter","status-publish","hentry"],"part":21,"_links":{"self":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/92","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/users\/7"}],"version-history":[{"count":28,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/92\/revisions"}],"predecessor-version":[{"id":980,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/92\/revisions\/980"}],"part":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/parts\/21"}],"metadata":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapters\/92\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/media?parent=92"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/pressbooks\/v2\/chapter-type?post=92"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/contributor?post=92"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.palomar.edu\/introtostats\/wp-json\/wp\/v2\/license?post=92"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}