Hypothesis Testing: STATS 101 Assignment, Second Semester 2018

Verified

Added on  2023/06/04

|21
|3304
|310
Homework Assignment
AI Summary
Document Page
1
Hypothesis Testing in Statistics
Course Name: Introduction to Statistics
Course Code: STATS 101/101G/108
Assignment: 3
Semester: Second
Year 2018
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
2
Answer
a) Sample Size = 18 batteries
Independent or treatment variable was two possible choice of batteries, Energizer
batteries or Ultra-cell batteries. The response or dependent variable was the battery life in
the electronic game in hours
Independent t-test was used to test the difference in battery hours of the two types of
batteries.
Let the average battery life for an Energizer battery is μ1 and average life for Ultra-cell
battery is μ2 . The Energizer batteries (M = 8.28 hours, SD = 0.22 hours, n = 9) were
observed to be to some extent better than the Ultra-cell batteries (M = 8.24 hours, SD =
0.16 hours, n = 9).
Document Page
3
Independent t-test to test the difference in battery life:
1. The difference in average battery life of Energizer and Ultra-cell batteries = μd=μ1μ2
2. Null Hypothesis: H0: μd=μ1μ2=0
3. Two tail Alternate Hypothesis: HA: μd=μ1μ20
4. From the provided samples, difference in battery life was x1

x2

= 8.28 - 8.24 = 0.036
5. Standard error of the mean of the difference in battery life = 0.0905 (t‐
procedures tool on Canvas)
The test statistic =
t = estimatehypothesised value
s tan dard error = 0. 03560
0. 0905 =0 . 39 with degrees of
freedom = min (9 – 1, 9 – 1) = 8.
6. The P-value for two tail = P (|t|> 0 .39 ) =0 .7044 (t‐procedures tool)
7. P-value explanation: The calculated p-value was 0.7044 and was greater than 0.05.
Hence, enough evidence for rejecting the null hypothesis was not present. The observed
difference of 0.036 hours of life times for the two types of batteries was not statistically
significant at 5% level of significance. Consequently, the null hypothesis failed to get
rejected, and it was concluded that difference in life time between the two types of
batteries was almost equal to zero.
8. At 5% level of significance, estimated confidence interval for μd was
= ( μ1μ2 ) ±t multiplierS tan dard Error=0. 0356±2. 3060 . 0905= [ 0 . 173 , 0. 244 ]
(The t-multiplier at 5% level of significance was calculated from t‐procedure tool).
9. Confidence interval explanation: With 95% probability, it was inferred that the average
battery hours of Energizer batteries would be roughly 0.17 hours less than that of Ultra-
cell batteries. Similarly, battery hours of Energizer batteries would be roughly 0.24 hours
more than average battery hours of Ultra-cell batteries. The result was practically
significant from also.
10. Conclusion: The sample of the battery hours for playing an electronic game did not have
enough evidence to establish any significant difference in battery life of Energizer and
Ultra-cell batteries.
Document Page
4
b) The confidence interval at 95% level of significance estimated that the value of the
population mean of difference in life hours of the two types of batteries was not within
the boundaries. The hypothesized value of the difference in average battery life of the
two types of batteries was assumed to be zero, and the non inclusion of population
parameter in the confidence interval indicated that the conclusion from p-value was true.
Answer
a) The sampling situation for analysing the difference in proportion was the case of one
sample with four different response categories.
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
5
b) The scenario was tested by t-test for difference between two proportions as follows.
1. It was assumed that p1 , and p2 denotes the proportions of responses for supporting the
legislation, and refuting it.
Hence, p1 - p2 = Difference of the proportions
2. Null hypothesis: H0: p
^¿d
¿ = p1 - p2 = 0
3. Two tailed Alternate hypothesis: HA: p1 - p2 ¿ 0
4. Estimated difference: p1
^¿
¿ - p2
^¿
¿ =
384
500 79
500 =0 .61
5. The t-test statistic was,
t = estimated differencehypothesized difference
s tan dard error
Estimated difference = 0.61, hypothesized difference = 0
Standard error of mean = 0.0333 at 5% level of significance (tprocedures tool).
The test statistic, t= 0. 610
0 . 0333 =18 .318 with degrees of freedom =
6. Two tail P-value = P (|t|> 18. 318 ) =0. 0000 (from tprocedures)
7. Explanation of P-value:
The p-value was less than 0.05, at 5% level of significance. Hence, there was very strong
enough evidence to accept the alternate hypothesis opposed to the null hypothesis. Hence,
the null hypothesis was rejected based on the evidence from the difference in proportions
of adult New Zealanders in support and against the legislation.
8. The confidence interval was calculated as
CI =¿ ¿
The t-estimate = 1.96 was calculated from tprocedures tool.
9. Interpretation of Confidence Interval:
With 95% confidence, the estimated value of difference in proportions of support and
against views of the New Zealanders was anywhere between 0.545 and 0.675. The
support was 54.47% to 67.52% higher than against views, which was also practically
significant.
Document Page
6
10. Conclusion:
The null hypothesis was rejected at 5% level of significance. The response
of New Zealanders in favor of the legislation was significantly greater than that of the
views against.
Document Page
7
Answer:
a)
(i) The hypothesized difference in productivity score was considered to be zero.
(ii) At 5% level of significance, the estimated difference in productivity scores in Case1was
greater than the hypothesized value. The standard error of the sampling distribution was
0.682, and the deviation from average difference in scores constructed the confidence
interval signifying that the hypothesized value was not close to the estimated value.
b) At 5% level of significance, all the cases excluding Case 3 demonstrated that the sample
mean difference was statistically significant.
c) Practically significant and non significant cases were as follows.
(i) Case 1: Practically significant
(ii) Case 4 and Case 5: Not practically significant
d) Case 2 and Case 3: Nothing could be concluded, as lower limits of the confidence interval
were less than 10, the desired minimum change of productivity scores.
e) The mean difference was noteworthy at 5% level of significance. With 95% certainty, it was
assessed that mean difference of the two payout frameworks would be somewhere close to
0.96 hours to 5.58 hours. The real distinction of the methods for the two pay-out frameworks
would be as low as 0.96 hours and high as 5.58 hours. A statistical significance (Diff = 3.27,
p < 0.05) was noted for the Case 6 scenario, and Bonus pay-out framework would be
favored. The results of case 6 was not practically significant, and considering the
administration's thought about the distinction in efficiency score the organization might want
to adhere to its past model for pay-out.
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
8
Answer 4
a) (i) Units: 12 short stories from the section of mystery, ironic, and literary.
(ii) Treatment: Enclosure or omission of spoiler paragraph.
(iii) Response variable: The enjoyment rating of the readers
b) (i) The following graphs of the two treatments and their difference were constructed
using iNZight tool.
Document Page
9
Figure 1: Side-by-Side Box Plots for the Two Treatments
Figure 2: Box Plot for Difference in Treatment Scores
Document Page
10
(ii) The side-by-side box plots reflected that median of enjoyment scores was undoubtedly
greater with inclusion of the spoiler paragraph compared to enjoyment score from story
without such paragraph. The median of enjoyment score was almost 7.0 for stories with
spoiler, while the median of enjoyment score was almost 6.0 for the stories without spoiler.
Distribution of enjoyment scores was highly left skewed for stories with spoiler paragraph,
whereas, stories without spoiler paragraph had comparatively less skewed distribution of
enjoyment scores.
The distribution of difference in the enjoyment scores was also somewhat left skewed. The
median of difference in enjoyment scores was approximately 0.5.
c) The difference between average enjoyment scores for the two treatments was verified by t-
test as follows. Let the average enjoyment score with spoiler was μ1 and average enjoyment
score without spoiler was μ2 . The average enjoyment score with spoiler (M = 6.217, SD =
1.22, n = 12) were observed to be to some extent better than the average enjoyment score
without spoiler (M = 5.725, SD = 1.26, n = 12).
1. Parameter: Let μ1 and μ2 are the average population enjoyment scores with and without
spoiler paragraph in stories. Let μd=μ1 - μ2 is the difference in average enjoyment scores.
2. Null hypothesis: H0: ( μd=μ1 - μ2 = 0)
3. The two-tailed Alternate hypothesis: HA: ( μd=μ1 - μ20 )
4. Estimate: Difference in two average enjoyment scores =
x

1x

2=6 . 2175 . 725=0. 49170 . 492
5. Test statistic: t=
( x

1x

2 )0
SEM = 0 . 49170
0 . 1003 =4 . 900 where standard error of mean = SEM =
0.1003 (from SPSS) with degrees of freedom = min (12-1, 12-1) = 11.
6. Calculated two tail P-value = P (|t|>0 . 492 ) =0 .000 (from SPSS)
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
11
7. Explanation of P-value:
There was a statistical significant difference between the average enjoyment scores for
stories with and without spoiler paragraph. The evidence was very strongly against the null
hypothesis at 5% level of significance. Hence, the average enjoyment in reading stories
with spoiler paragraph was significantly greater compared to the without spoiler story
enjoyment scores.
8. Confidence Interval:
At 95% significance level, approximate confidence interval for μd was
CI = ( μd ) ±t multiplierSE=[ 0 . 27 , 0. 71 ] (from SPSS output)
9. Confidence Interval elucidation: With 95% confidence it was inferred that the average
enjoyment score with spoiler would be roughly 0.27 less than and 0.71 more than average
enjoyment score without spoiler. The result was practically significant (Bluman, 2009).
10. Conclusion: Very strong evidence was there to establish difference in enjoyment scores.
The null hypothesis was rejected at 5% significance level, stating that average enjoyment
score with spoiler was greater than that of the stories without spoiler.
d) Dependent variable: Difference of enjoyment scores with and without spoiler paragraph.
i. The difference scores were continuous in nature.
ii. There were two categorical values of the treatment attribute.
iii. No outlier scores were present
iv. Normally distributed enjoyment scores was noted. Shapiro-Wilk test (SW = 0.955, p =
0.712) ascertained that the differences of enjoyment scores were normally distributed.
Therefore, the fourth condition for dependent t-test was also satisfied.
Document Page
12
Document Page
13
Answer
a) (i) Units: 1000 cyclists who finished the cycling event of 180 kilometer ride event in the
years 2010 to 2017.
(ii) Treatment: Four age groups in the study
(iii) Response variable: The time to complete K2.
b) (i) The following graphs of the four age groups were constructed using iNZight tool.
Figure 3: Distribution of Time to Complete K2 for Four Age Categories
(ii) Time taken by the cyclists to finish the K2 Road Cycle Classic was observed to increment
with the age of the cyclist. From the median of the spread, and the histogram plots, it was
apparent that old age people were less competent in finishing the race early, when compared to
the younger age groups. Outlier timings were visible for each age group. Probably, some cyclists
were completing the race too yearly, or finishing the event with maximum timing (Kim, 2015).
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
14
c) One-way ANOVA SPSS output for F-test below.
Table 1: ANOVA Output from SPSS
Sum of Squares df Mean Square F Sig.
Between Groups 64.327 3 21.442 28.104 .000
Within Groups 759.914 996 .763
Total 824.241 999
d) Assumptions for ANOVA:
i. Event completion time of four age groups was independent of each other,
ii. Completion time for the cyclists of all the four age groups was normally distributed
iii. The variances of the four age groups were significantly equal
e) Smallest standard deviation was for 18-34 age group (M = 6.62, SD = 0.76) and the highest
standard deviation of time was noted for the 55+ age group (M = 7.32, SD = 0.91). The
required ratio was calculated as
0. 76
0. 91 =0 . 84 .
f) The Validity of the F-test was assured by establishing the assumptions of ANOVA (Lord, &
Novick, 2008).
i. Independent Observations: Completion times of the cyclists of four age groups were
independent of each other.
ii. Symmetric distribution: Completion times for the age groups was evidently normal (refer to
Figure 6).
iii. Normality: Shapiro Wilk test indicated normal distribution of completion times for the four
age groups (Appendix – Table 7).
iv. Assumption of homogeneity: From Levene’s test, the variances of the four age groups was
found to be equal (L = 2.498, p = 0.058). (Appendix – Table 8) (Nordstokke, & Zumbo,
2010).
g) (i) Null hypothesis: Variances in average event completion time for the four age groups were
equal: H0: ( σ1
2=σ2
2=σ 3
2=σ4
2 )
Document Page
15
(ii) Alternate hypothesis: Variances in average event completion time for the four age
groups were significantly different: HA: ( σ1
2σ2
2σ 3
2σ4
2 )
(iii) At 5% level of significance, difference in average time taken by the cyclists for
completion of the event was statistically significant (F = 28.104, p < 0.05) for the four age
groups. From the descriptive values, the youngest cyclists’ age group of 18 -34 years was
found to be the fastest in completing the event, and the tendency was also practically
significant.
h) (i) Tukey post hoc analysis was done to identify the difference in average completion time
between the age groups of 18-34 and 35-44. There was no statistical significant evidence
(MD = - 0.336, p = 0.982) of existence of any difference in completion time, at 5% level of
significance. The 95% confidence interval was [-0.2665, 0.1992]. Hence, with 95%
confidence it was claimed that the average time of the cyclists of 18-34 (years) age group
was 0.27 hours less than that of the 35-44 years age group. Again, 18-34 years of the
cyclists’ estimated completion time was 0.20 hours greater than that of the cyclists of 35-44
years (Lau, Sussman, Espinosa, Katalay, & Allam, 2017).
(ii) At 5% level of significance, pair wise difference in average completion time was seen
for age groups of 18-34 and 45-44, 35-44 and 45-54, 35-44 and 55+, and 45-54 and 55+.
(iii) At 5% level of significance, the cyclists of 55+ years were the slowest of the four age
groups.
i) The cyclists took least of 5.45 hours and most extreme of 11.29 hours to finish the K2
occasion. Age was the most significant factor in completing the race early. Cyclists of 18
years to 44 years finished the occasion with relatively indistinguishable fastest time. Cyclists
over 55 years were the slowest, when contrasted with members of other age gatherings.
Some cyclists were there in each age group, who either finished the race very slowly or very
early with statistical significance.
Document Page
16
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
17
a) (i)
Scenario 1: Free and Sex
Scenario 2: Spent and First
Scenario 3: Purchase and Age
Scenario 4: Quantity Other Quantity Stone
Scenario 5: Spent and Shop
(ii) Variables mapped with type of data as below.
Table 2: Type Details Mapping with Variables
b) The different scenarios and their matched tools have been in the following table.
Document Page
18
Table 3: Different Scenarios Matched with Appropriate Tools
c) The scenarios have been mapped against the statistical tests.
Document Page
19
Table 4: Scenario Comparison with Appropriate test
Scenario Test
Scenario 1 E
Scenario 2 D
Scenario 3 F
Scenario 4 C
Scenario 5 F
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
20
References
Bluman, A. G. (2009). Elementary statistics: A step by step approach. New York: McGraw-Hill
Higher Education.
Kim, T. K. (2015). T test as a parametric statistic. Korean journal of anesthesiology, 68(6), 540-
546.
Lau, Y. T., Sussman, L., Espinosa, E. P., Katalay, S., & Allam, B. (2017). Characterization of
hemocytes from different body fluids of the eastern oyster Crassostrea virginica. Fish &
shellfish immunology, 71, 372-379.
Lord, F. M., & Novick, M. R. (2008). Statistical theories of mental test scores. IAP.
Nordstokke, D. W., & Zumbo, B. D. (2010). A new nonparametric Levene test for equal
variances. Psicologica: International Journal of Methodology and Experimental
Psychology, 31(2), 401-430.
Document Page
21
Appendix:
Table 5: Normality Check of the Timing of the Cyclists – SPSS Outputs
Age
Category
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Time in
hours
18-34 .115 142 .000 .930 142 .000
35-44 .122 271 .000 .872 271 .000
45-54 .094 408 .000 .943 408 .000
55+ .101 179 .000 .958 179 .000
a. Lilliefors Significance Correction
Table 6: Levene’s Homogeneity Test of Variances
chevron_up_icon
1 out of 21
circle_padding
hide_on_mobile
zoom_out_icon
logo.png

Your All-in-One AI-Powered Toolkit for Academic Success.

Available 24*7 on WhatsApp / Email

[object Object]