This text explains the process of hypothesis testing in statistics with examples and calculations. It covers topics like t-tests, confidence intervals, and ANOVA. The subject is Introduction to Statistics and the course code is STATS 101/101G/108. The text is relevant for students in the second semester of the year 2018.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
1 Hypothesis Testing in Statistics Course Name: IntroductiontoStatistics Course Code: STATS101/101G/108 Assignment:3 Semester: Second Year 2018
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
2 Answer a)Sample Size = 18 batteries Independent or treatmentvariable was two possible choice of batteries, Energizer batteries or Ultra-cell batteries. The response or dependent variable was the battery life in the electronic game in hours Independent t-test was used to test the difference in battery hours of the two types of batteries. Let the average battery life for an Energizer battery isμ1and average life for Ultra-cell battery isμ2. The Energizer batteries (M = 8.28 hours, SD = 0.22 hours, n = 9) were observed to be to some extent better than the Ultra-cell batteries (M = 8.24 hours, SD = 0.16 hours, n = 9).
3 Independent t-test to test the difference in battery life: 1.The difference in average battery life of Energizer and Ultra-cell batteries =μd=μ1−μ2 2.Null Hypothesis: H0:μd=μ1−μ2=0 3.Two tail Alternate Hypothesis: HA:μd=μ1−μ2≠0 4.From the provided samples, difference in battery life wasx1 − −x2 − = 8.28 - 8.24 = 0.036 5.Standarderrorofthemeanofthedifferenceinbatterylife=0.0905(t‐ procedurestoolonCanvas) The test statistic = t=estimate−hypothesisedvalue standarderror=0.0356−0 0.0905=0.39with degrees of freedom = min (9 – 1, 9 – 1) = 8. 6.The P-value for two tail =P(|t|>0.39)=0.7044(t‐procedurestool) 7.P-value explanation: The calculated p-value was 0.7044 and was greater than 0.05. Hence, enough evidence for rejecting the null hypothesis was not present. The observed difference of 0.036 hours of life times for the two types of batteries was not statistically significant at 5% level of significance. Consequently, the null hypothesis failed to get rejected, and it was concluded that difference in life time between the two types of batteries was almost equal to zero. 8.At5%levelofsignificance,estimatedconfidenceintervalforμdwas =(μ1−μ2)±t−multiplier∗StandardError=0.0356±2.306∗0.0905=[−0.173,0.244] (The t-multiplier at 5% level of significance was calculated from t‐proceduretool). 9.Confidence interval explanation: With 95% probability, it was inferred that the average battery hours of Energizer batteries would be roughly 0.17 hours less than that of Ultra- cell batteries. Similarly, battery hours of Energizer batteries would be roughly 0.24 hours more than average battery hours of Ultra-cell batteries. The result was practically significant from also. 10.Conclusion: The sample of the battery hours for playing an electronic game did not have enough evidence to establish any significant difference in battery life of Energizer and Ultra-cell batteries.
4 b)The confidence interval at 95% level of significance estimated that the value of the population mean of difference in life hours of the two types of batteries was not within the boundaries. The hypothesized value of the difference in average battery life of the two types of batteries was assumed to be zero, and the non inclusion of population parameter in the confidence interval indicated that the conclusion from p-value was true. Answer a)The sampling situation for analysing the difference in proportion was the case of one sample with four different response categories.
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
5 b)The scenario was tested by t-test for difference between two proportions as follows. 1.It was assumed thatp1, andp2denotes the proportions of responses for supporting the legislation, and refuting it. Hence,p1-p2= Difference of the proportions 2.Null hypothesis: H0:p ^¿d ¿=p1-p2= 0 3.Two tailed Alternate hypothesis: HA:p1-p2¿0 4.Estimated difference:p1 ^¿ ¿-p2 ^¿ ¿= 384 500−79 500=0.61 5.The t-test statistic was, t=estimateddifference−hypothesizeddifference standarderror Estimated difference = 0.61, hypothesized difference = 0 Standard error of mean = 0.0333 at 5% level of significance (t‐procedurestool). The test statistic,t=0.61−0 0.0333=18.318with degrees of freedom =∞ 6.Two tail P-value =P(|t|>18.318)=0.0000(from t‐procedures) 7.Explanation of P-value: The p-value was less than 0.05, at 5% level of significance. Hence, there was very strong enough evidence to accept the alternate hypothesis opposed to the null hypothesis. Hence, the null hypothesis was rejected based on the evidence from the difference in proportions of adultNewZealandersin support and against the legislation. 8.The confidence interval was calculated as CI=¿¿ The t-estimate = 1.96 was calculated from t‐procedurestool. 9.Interpretation of Confidence Interval: With 95% confidence, the estimated value of difference in proportions of support and against views of the NewZealanders was anywhere between 0.545 and 0.675. The support was 54.47% to 67.52% higher than against views, which was also practically significant.
6 10.Conclusion: Thenullhypothesiswasrejectedat5%levelofsignificance.Theresponse ofNewZealanders in favor of the legislation was significantly greater than that of the views against.
7 Answer: a) (i) The hypothesized difference in productivity score was considered to be zero. (ii) At 5% level of significance, the estimated difference in productivity scores in Case1was greater than the hypothesized value. The standard error of the sampling distribution was 0.682, and the deviation from average difference in scores constructed the confidence interval signifying that the hypothesized value was not close to the estimated value. b)At 5% level of significance, all the cases excluding Case 3 demonstrated that the sample mean difference was statistically significant. c)Practically significant and non significant cases were as follows. (i) Case 1: Practically significant (ii) Case 4 and Case5: Not practically significant d)Case 2 and Case 3: Nothing could be concluded, as lower limits of the confidence interval were less than 10, the desired minimum change of productivity scores. e)The mean difference was noteworthy at 5% level of significance. With 95% certainty, it was assessed that mean difference of the two payout frameworks would be somewhere close to 0.96 hours to 5.58 hours. The real distinction of the methods for the two pay-out frameworks would be as low as 0.96 hours and high as 5.58 hours. A statistical significance (Diff = 3.27, p < 0.05) was noted for the Case 6 scenario, and Bonus pay-out framework would be favored.Theresultsofcase6wasnotpracticallysignificant,andconsideringthe administration's thought about the distinction in efficiency score the organization might want to adhere to its past model for pay-out.
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
8 Answer 4 a)(i) Units: 12 short stories from the section of mystery, ironic, and literary. (ii) Treatment: Enclosure or omission of spoiler paragraph. (iii) Response variable: The enjoyment rating of the readers b)(i) The following graphs of the two treatments and their difference were constructed using iNZight tool.
9 Figure1: Side-by-Side Box Plots for the Two Treatments Figure2: Box Plot for Difference in Treatment Scores
10 (ii) The side-by-side box plots reflected that median of enjoyment scores was undoubtedly greater with inclusion of the spoiler paragraph compared to enjoyment score from story without such paragraph. The median of enjoyment score was almost 7.0 for stories with spoiler, while the median of enjoyment score was almost 6.0 for the stories without spoiler. Distribution of enjoyment scores was highly left skewed for stories with spoiler paragraph, whereas, stories without spoiler paragraph had comparatively less skewed distribution of enjoyment scores. The distribution of difference in the enjoyment scores was also somewhat left skewed. The median of difference in enjoyment scores was approximately 0.5. c)The difference between average enjoyment scores for the two treatments was verified by t- test as follows. Let the average enjoyment score with spoiler wasμ1and average enjoyment score without spoiler wasμ2. The average enjoyment score with spoiler (M = 6.217, SD = 1.22, n = 12) were observed to be to some extent better than the average enjoyment score without spoiler (M = 5.725, SD = 1.26, n = 12). 1.Parameter: Letμ1andμ2are the average population enjoyment scores with and without spoiler paragraph in stories. Letμd=μ1-μ2is the difference in average enjoyment scores. 2.Null hypothesis: H0: (μd=μ1-μ2= 0) 3.The two-tailed Alternate hypothesis: HA: (μd=μ1-μ2≠0) 4.Estimate:Differenceintwoaverageenjoymentscores= x − 1−x − 2=6.217−5.725=0.4917≃0.492 5.Test statistic:t= (x − 1−x − 2)−0 SEM=0.4917−0 0.1003=4.900where standard error of mean = SEM = 0.1003 (from SPSS) with degrees of freedom = min (12-1, 12-1) = 11. 6.Calculated two tail P-value =P(|t|>0.492)=0.000(from SPSS)
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
11 7.Explanation of P-value: There was a statistical significant difference between the average enjoyment scores for stories with and without spoiler paragraph. The evidence was very strongly against the null hypothesis at 5% level of significance. Hence, the average enjoyment in reading stories with spoiler paragraph was significantly greater compared to the without spoiler story enjoyment scores. 8.Confidence Interval: At95%significancelevel,approximateconfidenceintervalforμdwas CI=(μd)±tmultiplier∗SE=[0.27,0.71](from SPSS output) 9.Confidence Interval elucidation: With 95% confidence it was inferred that the average enjoyment score with spoiler would be roughly 0.27 less than and 0.71 more than average enjoyment score without spoiler. The result was practically significant (Bluman, 2009). 10.Conclusion: Very strong evidence was there to establish difference in enjoyment scores. The null hypothesis was rejected at 5% significance level, stating that average enjoyment score with spoiler was greater than that of the stories without spoiler. d)Dependent variable: Difference of enjoyment scores with and without spoiler paragraph. i.The difference scores were continuous in nature. ii.There were two categorical values of the treatment attribute. iii.No outlier scores were present iv.Normally distributed enjoyment scores was noted. Shapiro-Wilk test (SW = 0.955, p = 0.712) ascertained that the differences of enjoyment scores were normally distributed. Therefore, the fourth condition for dependent t-test was also satisfied.
12
13 Answer a)(i) Units: 1000cyclistswho finished the cycling event of 180kilometerride event in the years 2010 to 2017. (ii) Treatment: Four age groups in the study (iii) Response variable: The timetocompleteK2. b)(i) The following graphs of the four age groups were constructed using iNZight tool. Figure3: Distribution of Time to Complete K2 for Four Age Categories (ii) Time taken by the cyclists to finish the K2 Road Cycle Classic was observed to increment with the age of the cyclist. From the median of the spread, and the histogram plots, it was apparent that old age people were less competent in finishing the race early, when compared to the younger age groups. Outlier timings were visible for each age group. Probably, some cyclists were completing the race too yearly, or finishing the event with maximum timing (Kim, 2015).
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
14 c)One-way ANOVA SPSS output for F-test below. Table1: ANOVA Output from SPSS Sum of SquaresdfMean SquareFSig. Between Groups64.327321.44228.104.000 Within Groups759.914996.763 Total824.241999 d)Assumptions for ANOVA: i.Event completion time of four age groups was independent of each other, ii.Completion time for the cyclists of all the four age groups was normally distributed iii.The variances of the four age groups were significantly equal e)Smallest standard deviation was for 18-34 age group (M = 6.62, SD = 0.76) and the highest standard deviation of time was noted for the 55+ age group (M = 7.32, SD = 0.91). The required ratio was calculated as 0.76 0.91=0.84. f)The Validity of the F-test was assured by establishing the assumptions of ANOVA (Lord, & Novick, 2008). i.Independent Observations: Completion times of the cyclists of four age groups were independent of each other. ii.Symmetric distribution: Completion times for the age groups was evidently normal (refer to Figure 6). iii.Normality: Shapiro Wilk test indicated normal distribution of completion times for the four age groups (Appendix – Table 7). iv.Assumption of homogeneity: From Levene’s test, the variances of the four age groups was found to be equal (L = 2.498, p = 0.058). (Appendix – Table 8) (Nordstokke, & Zumbo, 2010). g)(i) Null hypothesis: Variances in average event completion time for the four age groups were equal: H0:(σ1 2=σ2 2=σ3 2=σ4 2)
15 (ii) Alternate hypothesis: Variances in average event completion time for the four age groups were significantly different: HA:(σ1 2≠σ2 2≠σ3 2≠σ4 2) (iii) At 5% level of significance, difference in average time taken by the cyclists for completion of the event was statistically significant (F = 28.104, p < 0.05) for the four age groups. From the descriptive values, the youngest cyclists’ age group of 18 -34 years was found to be the fastest in completing the event, and the tendency was also practically significant. h)(i) Tukey post hoc analysis was done to identify the difference in average completion time between the age groups of 18-34 and 35-44. There was no statistical significant evidence (MD = - 0.336, p = 0.982) of existence of any difference in completion time, at 5% level of significance.The95%confidenceintervalwas[-0.2665,0.1992].Hence,with95% confidence it was claimed that the average time of the cyclists of 18-34 (years) age group was 0.27 hours less than that of the 35-44 years age group. Again, 18-34 years of the cyclists’ estimated completion time was 0.20 hours greater than that of the cyclists of 35-44 years (Lau, Sussman, Espinosa, Katalay, & Allam, 2017). (ii) At 5% level of significance, pair wise difference in average completion time was seen for age groups of 18-34 and 45-44, 35-44 and 45-54, 35-44 and 55+, and 45-54 and 55+. (iii) At 5% level of significance, the cyclists of 55+ years were the slowest of the four age groups. i)The cyclists took least of 5.45 hours and most extreme of 11.29 hours to finish the K2 occasion. Age was the most significant factor in completing the race early. Cyclists of 18 years to 44 years finished the occasion with relatively indistinguishable fastest time. Cyclists over 55 years were the slowest, when contrasted with members of other age gatherings. Some cyclists were there in each age group, who either finished the race very slowly or very early with statistical significance.
16
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
17 a)(i) Scenario 1: Free and Sex Scenario 2: Spent and First Scenario 3: Purchase and Age Scenario 4: Quantity Other Quantity Stone Scenario 5: Spent and Shop (ii) Variables mapped with type of data as below. Table2: Type Details Mapping with Variables b)The different scenarios and their matched tools have been in the following table.
18 Table3: Different Scenarios Matched with Appropriate Tools c)The scenarios have been mapped against the statistical tests.
19 Table4: Scenario Comparison with Appropriate test ScenarioTest Scenario 1E Scenario 2D Scenario 3F Scenario 4C Scenario 5F
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
20 References Bluman, A. G. (2009).Elementary statistics: A step by step approach. New York: McGraw-Hill Higher Education. Kim, T. K. (2015). T test as a parametric statistic.Korean journal of anesthesiology,68(6), 540- 546. Lau, Y. T., Sussman, L., Espinosa, E. P., Katalay, S., & Allam, B. (2017). Characterization of hemocytes from different body fluids of the eastern oyster Crassostrea virginica.Fish & shellfish immunology,71, 372-379. Lord, F. M., & Novick, M. R. (2008).Statistical theories of mental test scores. IAP. Nordstokke, D. W., & Zumbo, B. D. (2010). A new nonparametric Levene test for equal variances.Psicologica:InternationalJournalofMethodologyandExperimental Psychology,31(2), 401-430.
21 Appendix: Table5: Normality Check of the Timing of the Cyclists – SPSS Outputs Age Category Kolmogorov-SmirnovaShapiro-Wilk StatisticdfSig.StatisticdfSig. Timein hours 18-34.115142.000.930142.000 35-44.122271.000.872271.000 45-54.094408.000.943408.000 55+.101179.000.958179.000 a.Lilliefors Significance Correction Table6: Levene’s Homogeneity Test of Variances