FSR - Statistical Investigations Lesson

Statistical Investigations

Introduction

Statistical investigations are used every day for a variety of reasons. Statistical studies use a combination of mathematical and scientific techniques to answer questions about human behavior, the effectiveness of medical treatments, the safety and reliability of machinery and products, and so on. Consider the following examples of two different types of statistical investigations. Data can tell a story and help you predict future trends! It's important to note that this "story" may only be valid if the data set is big enough to represent the population. The sample size impacts the precision with which estimates of the population parameters can be made. In today's times, there is data everywhere. This vast amount of data, known as “Big Data”, can be used to make decisions in various business enterprises and in financial decision-making. 


Observational Study

An observational study is a type of research in which data is collected about some characteristic of the population. The data can be collected by observation, survey, interview, or other means. The following approaches are examples of an observational study:

Radio rating services sometimes collect data on listenership by asking participants to record the date, time, and station each time they listen to the radio. Other rating services distribute monitoring devices that automatically record this information anytime the participant has the radio turned on. Still, others call participants and ask them about their listening habits. The data are then compiled so that advertisers know which stations are the most popular at specific times during the day.


Experimental Study

In an experimental study, the researcher separates the participants into one or more groups and applies some sort of treatment. After treatment, the variable of interest is measured and the results are compared. A group of participants that the treatment group is being compared to is called the control group. Researchers are often concerned that participants in a study show improvement simply because they are in the study and not because they are receiving effective treatment. This is called the placebo effect.

A 17-year-old student designed a science fair project with 72 mice randomly assigned to three groups: hard rock music, Mozart, and no music at all (called a control group). The mice in the first two groups were exposed to music 10 hours a day. Three times a week, all of the groups were timed as they ran through a maze. An analysis of results showed that the 24 mice in the no-music group averaged about a 5-minute improvement in their maze completion time, while the Mozart mice improved by 8.5 minutes. The hard rock mice actually got slower—an average of four times slower!

Another interesting fact: The student had to start his experiment over because all the hard-rock mice killed each other. None of the classical mice did that. Wertz, M. 1998.

What are the treatment and the variable of interest in this case?

Answer: The treatment is the type of music. The variable of interest is the time it takes to run the maze. 


Answering Questions Leads to Experimentation

Consider this situation: "This unopened bag of chips is half empty. I wonder if it really contains 28.3 grams as the package says?"

This type of informal question or observation is the beginning of many investigations. Informal questions can turn into more formal problem statements or research questions.

For example, you may decide to investigate whether there is a scandal in the potato chip industry by checking the following: spuds.jpg

Do spud potato chips contain an average of 28.3g of chips per bag?

Suppose you conduct the investigation into Spud Potato Chips and find that the mean weight of the chips in your sample is 25 grams, rather than 28.3 grams ( x = 25 grams). Do you think that a difference of 3.3 grams between the actual and advertised weights is large enough that it needs to be reported? If so, how do you report this information, and to whom?


Statistical Studies 

Statistical studies are designed with carefully selected measures that ensure (within error margins) that, if the sample is well selected and the study is well designed and conducted, the mean and other measures of the sample are likely to be similar to the corresponding measures of the population being studied. Sometimes, if the population is small (such as high school seniors in a small town), it may be possible that the sample studied is the entire population. However, often a sample is a smaller subset of a population (such as a research question that might target the entire population of high school seniors in a state or in the nation).

In a blind study, participants do not know whether they are receiving the treatment or the placebo. This is crucial for trying to control the placebo effect. Using a placebo is worthless if the participants know they are getting it.

There is often a concern that if researchers know who is getting the placebo, it may affect their ability to fairly assess a treatment. For example, researchers may subconsciously check for improvement more carefully. A physical therapist may push the patient who is receiving the treatment to work harder, while not encouraging the control group patient as much. In a double-blind study, only a third person not involved in the assessment knows the participants who are receiving the treatment and those who are in the control group. After data has been collected, this person then identifies each patient file as treatment or control.

Recall the Spud Potato Chips scenario from earlier. We hypothesized that the true mean weight of bags of Spud's might be less than the 28.3 grams advertised on the bags. What do we mean by "the population of Spud Potato Chips" that we are interested in testing? The population of interest is all bags of potato chips, not just those being sold at the school. 

If we only test the bags at the school, it is likely that all of those bags came from the same shipment. Perhaps that shipment was undersized for some reason (for example, faulty equipment that has since been fixed). We should at least select bags from various places around town. The best possible scenario is testing bags from around the country.


Random Sampling

Random sampling means that an unbiased strategy was used to select the participants in the statistical investigation. Random assignment of treatments means that a strategy was used to determine who gets an active treatment and who gets a placebo or the order in which treatments are given. In both situations, we can use a random number table or generator to select participants. There are several different types of random number generators including your graphing calculator.  

Adapted from Course materials (III.A Student Activity Sheets 1, 4, & 5) for AMDM developed under the leadership of the Charles A. Dana Center, in collaboration with the Texas Association of Supervisors of Mathematics and with funding from Greater Texas Foundation.    

IMAGES CREATED BY GAVS