
Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Material Type: Notes; Class: APPLIED STATISTICS II; Subject: Statistics; University: Arkansas State University; Term: Unknown 1989;
Typology: Study notes
1 / 1
This page cannot be seen from the preview
Don't miss anything!
Stat 4473 – Data Analysis
Example: To determine whether waste discharged by a chemical plant is polluting the local river, the river water was sampled at two locations – one upstream and one downstream from the discharge site. Independent water samples of sizes n 1 = 10 and n 2 = 15, respectively, were selected from the upstream and downstream locations. The concentration level (ppm) of a suspected chemical pollutant was determined in each water sample, with the following results:
Upstream 24.5, 29.7, 20.4, 28.5, 25.3, 21.8, 20.2, 21.0, 21.9, 22.
Downstream 32.8, 30.4, 32.3, 26.4, 27.8, 26.9, 29.0, 31.5, 31.2, 26.7, 25.6, 25.1, 32.8, 34.3, 35.
Is there sufficient evidence that the chemical plant is polluting the river? If yes, explore the level of the pollution by estimating the difference in the average concentrations of the chemical pollutant upstream and downstream of the river.
Write-up
Ho: : 1! : 2 = 0 H 1 : : 1! : 2 < 0 where : 1 = mean concentration upstream : 2 = mean concentration downstream
Assumptions check: Boxplots show no outliers in either data set. For the downstream data, statistical tests for normality revealed no evidence that the nearly normal assumption is violated. The data appears reasonably symmetric in the stem plot. However, for the upstream data, statistical tests for normality show some evidence that the data do not come from a normal population (p-value = .0624 for the Kolmorogorov-Smirnov test with Ho: data is a sample from a normal distribution vs. H (^) a : data is not a sample from a normal distribution.) The stem plot appears right-skewed. The two-sample t procedures are quite robust against non-normality. We will continue the analysis with the two-sample t methods, and compare the results with those from an alternative nonparametric method.
(Upstream)
(Downstream)
t obs = !4. df = 19. p-value =.
Reject Ho in favor of H 1. There is very strong evidence that the mean concentration of the pollutant downstream of the discharge site is higher than the mean concentration upstream of the discharge site.
A 95% confidence interval for : 2! : 1 (downstream! upstream) is (3.4739, 9.1861) ppm. We conclude with 95% confidence that the average downstream pollution level is between 3.47 and 9.19 ppm more than that prevailing in the upstream waters.