PSDA - Making Inferences From Data Lesson

satprep_LessonTopBanner.png 

Making Inferences from Data

Data tells a story and provides information from which informed conclusions may be drawn. Different types of organization methods are used for the data depending on the type of data to be shown and analyzed. Organization methods could be linear, bar, tables. Information may be only words initially. Data organization will allow you to draw conclusions or make conjectures concerning your business operations, and allow you to foresee changes needed.

Linear Graph Data Inferences

Data displayed on graphs provide information from which further data or analysis may be obtained. Use the Amusement Ride Graph below to answer questions concerning the data shown on the graph by the data points and the conclusions drawn from connecting the data points. For this example, the number of people at the line grew linearly between all points. The ride opened at 9 a.m. Rain occurred during the day.

image of graph amusement ride. number of people in line at different time. 9am: 50 people, 10am: 60 people, 11am: 75 people, 12 noon: 100 people, 1pm: 90 people, 2pm: 15 people, 3pm: 20 people, 4pm: 70 people, 5pm: 0 people

Amusement Park Bar Graph

Now let's look the bar graph of the Amusement Ride data below, a different method of displaying data. Does the bar graph provide the same perspective of the data?   No, this graph only shows the number of people in line at each hour with no visual connection between the time periods. Yes we can still see when the times were increasing and decreasing. However, the connectivity of the line graph provides an idea of what happens during the times in between the hours and this is not clear on the the bar graph.  

image of amusement park bar graph with number of people in line at different times.
9am: 50 people
10am : 60 people
11am: 75 people
Noon: 100 people
1pm: 90 people
2pm: 10 people
3pm: 20 people
4pm: 70 people
5pm: 0 people

A bar graph is better with static or fixed data versus flexible, changeable data that is shown in the line graph above.

Here is a bar graph with static data.

ABC High School's basketball team has played 6 of their 20 games for the season. Scores for ABC and their opponent are shown in the bar chart below.

image bar graph ABC High vs. Opponents.

XYZ Company manufactures widgets for various customers throughout the world. The daily shipping schedule requires widgets to be shipped in bundles of 5 widgets. The number of widget bundles shipped for Monday through Friday for a period of 4 weeks is shown in the table below. Create a frequency table (histogram) of the data for the number of shipped bundles. For example, Week 1 on Monday, 12 bundles of 5 widgets are shipped, meaning 60 widgets are shipped.

M T W TH F

Week 1

12

8

7

6

8

Week 2

7

6

8

10

11

Week 3

13

7

10

12

10

Week 4

12

12

8

8

7

 

To create a frequency table, a table of the number of times the widget bundle occurs, the number of widget bundles shipped will be used as the horizontal axis and the number of times shipped the vertical axis.  Changing the type of graph allows a different perspective of the information. Changing to a frequency table provides a synopsis of common amounts shipped.

image graph widget bundles shipped over a week period

 

Scatter Plots

Scatter plot graphs are just that, the data points on a graph, unconnected, ready for interpretation. These points provide the relationship between the horizontal data and the vertical data and are plotted based on the (x, y) values.

Questions that are asked about scatter plot graphs usually look for positive and negative correlation and a regression index r.  Data in scatter plots are graphed with two variables. The key to answering scatter plot questions is to determine whether there is a correlation (a match) between the data that suggests an outcome for future data.

Equations of the lines for scatter plots are called Regression Lines of Best Fit. The "line" that is requested could take on linear, exponential, or other math lines that connect data. The lines may be done by hand, but a graphing utility is best as the graphing utility can provide the regression index quickly when requested. Check your graphing utility to know the procedure for finding these regression lines and correlation index.

Let's look at a few scatter plots and answer some basic questions of understanding.   For each of the graphs, estimate the line of best fit.   In other words, where do you see a line or curved line going through the points so that the line is close to the maximum number of points?   Once the line is drawn, determine the correlation and the strength of the correlation, the regression index r.  

Positive correlation - a positive slope, use r correlated to visual sighting of slope

  • visually sighted as with a correlation r = 1 if the points are grouped closely to the line shape, the strength of the correlation is strong
  • visually sighted r = .5 if the points are follow the path of the line but are loosely associated with the line

Negative correlation - a negative slope, use r correlated to visual sighting of slope

  • visually sighted as with a correlation r = −1 if the points are grouped closely to the line shape, the  strength of the correlation is strong
  • visually sighted r = −.5 if the points are follow the path of the line but are loosely associated with the line

No correlation - there is not a pattern positive or negative that can be found for the points plotted

  • visually sighted as with a correlation r = 0, the points are scattered on the graph

 

SATPrep_LessonBottomBanner.pngIMAGES CREATED BY GAVS OR OPENSOURCE