This just in from Mrs. Baldwin: Check out this amazing data visualization site!
Ms. Baldwin just sent in a couple photos from her AP Statistics class!
Juniors in the AP Statistics class had the opportunity, this year, to work on a project in the style of a statistical consultant. Students used real data from the 50 States and had their choice of variables to examine. Many of the projects looked at patterns and relationships among various social indicators such as crime, income, education, etc. We looked for trends and associations using a sophisticated statistical programming language called “R”. This was an ambitious task, but the students jumped right in and turned in some of their finest work! Please enjoy the following excerpts from their work.
Ty Lee – Education trends in the 50 States
This histogram visualizes the distribution of the percent of the state population with a bachelor’s degree or higher. Height of boxes indicate the number of states that fall into the range from the left end to the right end. The state with lowest rate was West Virginia, while the highest was District of Columbia. The shape of this histogram seems to indicate that the underlying population is normal; the graph is uni-modal, not skewed, and looks similar to a bell curve.
Cade Zawacki – Statistical Analysis Casts Doubt on a Claim about Gun Violence
“If you look at all the fiascos that have occurred, 99 percent of them have been by Democrats pulling their guns out and shooting people,” Kiehne said. “So I don’t think you have a problem with the Republicans.”
(Does NOT include outliers CO + CT)
(Observed – Expected) ^2 / (Expected)
Red: 0.6195867769 +
Chi-Squared cdf(0.9017620301,e99,1) = 0.3423096854
With a Chi-Squared value of about 0.902, we can expect to see results like this about 34% of the time – assuming that mass shootings happen randomly in any state. Thus, we cannot conclude that there is a relationship between mass shootings and political affiliation.
Depicted above: Chi-Squared cdf(0.9017620301,e99,1) = 0.3423096854
[These results suggests that it is highly likely that the political inclinations of the State have no effect on the occurrence or frequency of mass shootings. This student pointed out that the data limit our conclusions, to a certain degree since we do not have data for the political preferences of individual shooters.]
Norio Chan and Simon Lu – No Significant Difference in Assault Rates in Red and Blue States
The graph and the results of the t-test show that we cannot reject the null hypothesis of “no difference in assault rates”. There is no strong evidence to suggest that assault rates are different in blue states and red states.
Gleb Paschenko – Murder Rates Decrease when High School Graduation Rates Increase
Although some outliers are present, the association seems to follow a moderately strong negative linear pattern. In states where high school graduation rate is higher, the murder rate per 100,000 is lower.
James Kim – Are Harsher Sentences Associated with Crime Reduction?
Correlation Between Strength of Sentence/Real Execution Rate and Crime Rate
The very high P-Value suggests that there probably is no relationship between the real execution rate and the murder rate. [We might be inclined to believe that those states with more severe penalties might see a reduction in crime. The data was not able to demonstrate that this was the case.]
Emma Lawrence and Sam Atkins – High School Graduation Rates in Red States and Blue State
If we use a significance level of .1 then we can reject the null hypothesis of no difference in high school graduation rates due to the p-value being less than .1. There is some evidence to suggest that high school graduation rates are higher in Blue States. [They are certainly more consistent.]
In a world of uncertainty, the mind looks for deterministic explanations. Statistics students have learned to talk about uncertainty, supported by quantitative measures, with confidence. We learned that, too often, our preconceived ideas about a population or claims in the media rest on very little data, indeed. Students, you have been entrusted with a powerful tool – use it for the good of humankind!
Mrs. Baldwin’s AP Stats classes have been working on several projects. Here’s a peek at one, as described by one of her students:
“This histogram visualizes distribution of percentage of the state population with a bachelor’s degree or higher. Height of boxes indicate the number of states that fall into the range from the left end to the right end of the rectangle. The state with lowest rate was West Virginia, while the highest was District of Columbia. The distribution of this histogram seems to be normal; the graph is uni-modal, not skewed, and looks similar to a bell curve.”
Here’s what Mrs. Baldwin has to say about the sleep data she collected over the last 6 weeks:
“As good statisticians, we of course recognize that our data are being collected through voluntary sampling. This is less of a problems when the data are easy to gather and we give several opportunities for people to participate. The data should represent the Williston community fairly well, but likely underrepresent faculty and students who don’t find themselves on the Math floor of the Schoolhouse.
We can see that our community of Williston students and faculty shows a lot of variation. We have a low of one hour of sleep and a high of 13 hours. A typical member of the community gets about 7 hours with the observations becoming less and less common as they extend above or below that. Since there are about as many high extremes as there are low extremes and there is a single peak, we call the shape of the picture “unimodal” and “symmetric” (we could fold the graph in half and get about the same picture on either side). Our observations about the shape suggest that the underlying distribution could very well be our good friend The Normal Distribution. We’ll see you again soon, Normal Distribution, bye for now…”
How many hours of sleep did you get last night?