Home » Simulation Example » What We Learned

Web Visualization: Sampling from a Normal distribution

What We Learned

The sampling distribution is a complex concept. For instance, from our teaching, we have found that students often confuse the histogram of values from a random sample and the histogram of sample means from many random samples. They often confuse the sample standard deviation and the standard error of the mean. This visualization was carefully designed to reduce that confusion.

Comments about what we learn from trialing the resource:

  • Students benefit from some description of the purpose of the animation at the start of the tutorial. When we inserted a description, we found students could more easily focus on the important features of the tutorial. Without the description, some students were confused (“… initially I don’t know what the next steps so I’m just following and I don’t know what’s going on at first.”). The description is now a part of the tutorial.
  • Students need clear instructions in order to take repeated samples, with language carefully chosen. To help students, we dropped our terminology “calculate many means” and instead used “calculate means for many samples”.
  • In the tutorial, when students repeatedly sample one individual to form a random sample of size 10, the countdown of the number of fish left to be sampled appears. This reinforces the idea that the students are creating one sample of 10 fish.
  • In the tutorial, students are asked to repeatedly take a sample of size 10 and calculate the mean, with each sample mean dropping down to the histogram of the sampling distribution. After students have had sufficient time to understand the difference between the top and bottom histograms, they can use the “means for many samples” button, to speed up the activity.
  • The “Sampling from a Normally distributed population” simulation was effective in reinforcing some fundamental ideas about mean, standard deviation and histograms. All of the students interviewed commented in some way about how the simulation helped visualize the effect certain parameters had on the distribution. For instance, students liked to see how changing the population standard deviation or mean affected the resulting histogram. Visualizing each fish being measured then dropping down to create the histogram was especially helpful and made the concepts seem less abstract.
    • “Intuitive. Shows how mean and standard deviation relate… Really helps to see how sample size affects the mean. Same with standard deviation.”

      “Really useful. In class, that's what he [instructor] tried to explain. Not clear to me [at that time]. … Really good. Illustrated the points it was trying to make.”

  • Students should be required to use the resource “Sampling from a Normally distributed population” before using later animations (Confidence Intervals and Central Limit Theorem). From our interviews, we found students who interacted with the Sampling from Normal resource prior to the CLT or Confidence Interval web visualization had a better understanding of how the samples are drawn and what the population mean represents with respect to the example. Students using only the Confidence Interval applet during our second set of interviews requested more explanation about the population mean and standard deviation at the end of the tutorial, suggesting they did not understand the basic premise of the simulation.

How We Learned

In the Fall of 2015, we conducted a pilot study to understand how students interact with the resource. We selected six students from an introductory statistics course for second year students in a variety of majors in the Faculty of Science. During 50-minute long individual interviews, each student was exposed to two web visualizations, first the “Sampling from a Normally distributed population” and then the “Confidence intervals of the mean” web visualization. Students were asked to “think-aloud” while exploring the visualizations. To probe student understanding, we asked questions at the beginning and end of the interviews. The post-questions were taken from the NSF-funded Web ARTIST project.

We then implemented some changes based on what we saw, and conducted a second set of interview in January 2016. The students were registered in an introductory statistics course intended for third year biology majors. These interviews followed the same format as the first set (pre-questions, “think-aloud” exploration, post questions). However students were given only one of the two visualizations to allow for more time during the interviews. Four students were selected to try the “Sampling from a Normally distributed population” web visualization.