Confidence Intervals & Hypothesis Testing in Statistical Analysis

Verified

Added on 2023/06/05

AI Summary

This assignment solution covers problems related to hypothesis testing and confidence intervals in statistics. It includes calculating sample means and standard deviations to construct confidence intervals and performing t-tests to validate claims about population means. The solution discusses the importance of the Central Limit Theorem when dealing with non-normally distributed data and emphasizes using the standard error of the mean for accurate population mean estimation. The problems involve assessing whether sample data supports or rejects a given hypothesis about population parameters, highlighting the practical application of statistical methods in data analysis. This document is available on Desklib, a platform offering a wide range of study resources including past papers and solved assignments for students.

Part - A – Problems
1

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser

1. Answer:
a. Sample mean
x
−
=745+752+754 +745+748+753+746+742+745+748+741+746
12 =747 . 08 ml
Sample standard deviation
s=∑
i=1
12
√ ( xi−747 . 08 ) 2
12 =4 .12 ml
, where xi s are the sample observations
At 95% confidence level the value of t=2. 20 (two tail)
Population mean = μ=750 ml
Hence, confidence interval is
x
−
±t∗ s
√ n =747 . 08±2 . 20∗4 . 12
√ 12 =[744 . 75 ,749 . 41 ]
b. Assuming the data to be normally distributed the claim is tested.
Null Hypothesis: H 0: ( μ=750 )
Alternate Hypothesis: HA : ( μ≠750 )
Let level of significance is 5% and the critical value of the test statistic (t-stat) is
found at (12-1) = 11 degrees of freedom as tcrit =2. 20
2

Now calculated value of the t-stat is
tcal= x
−
−750
4 .12
√ 12
=747 .08−750
4 .12
√ 12
=−2. 45
The p-value is P (|t|>2 . 45 ) =0. 032 (two tailed) and hence the null hypothesis is
rejected at 5% level of significance, as |tcal|>|t crit| and p-value is less than 0.05.
c. The population mean of 750 ml was outside the confidence interval [744 . 75 ,749 . 41 ]
implying that the interval estimator from sample mean is unable to assess the
population mean. Hence, outcome of the hypothesis testing (rejection of null
hypothesis) is true (Bethea).
3

2. Answer
a. If X1 , X2 , X3 , .. .. . ., X n denote a normally distributed simple random sample, then
each Xi ( ∀ i ) is independent and identically distributed. Hence, Xi ( ∀ i ) are also
normally distributed with mean μ and standard deviation σ .
Now
P (−zα
2
≤Z≤zα
2 )=1−α
=> P (−zα
2
≤ Xi−μ
σ ≤z α
2 )=1−α , for any i=1(1 )n
=> P (−σ∗zα
2
≤Xi−μ≤σ∗zα
2 )=1−α
=> P (−σ∗zα
2
≤μ−X i≤σ∗zα
2 )=1−α
=> P ( X i−σ∗zα
2
≤μ≤Xi + σ∗zα
2 )=1−α
So the estimated interval is
Xi±σ∗zα
2 with ( 1−α ) 100 % confidence for the
population mean.
b. The estimator in the second equation should be used for practical purpose. Standard
deviation is an estimator of the variability of the observations. But, Standard error
of mean (SEM) is the estimate of variability of means of sample. Therefore, SEM
precisely measures the accuracy of prediction of population mean by sample mean.
Hence, SEM should be used in estimating the confidence interval more
appropriately. Hence the second equation is the proper choice (Ferguson).
c. If the X is not normally distributed, then the estimated interval (equation 1) for
population mean will not be exact. Mean of samples are generally different from
each other, and in case of non normal distribution the confidence intervals will not
be able to assess the population, thus creating biasness in the estimation. Therefore,
the sampling mean acts as an unbiased estimator in case of non normal distribution
(Using Central Limit theorem) (Dudley).
d. According to Central Limit theorem, even if the random variable X does not follow
normal distribution, the sample mean approximately follows a normal for a large
sample ( n>30 ) . This argument facilitates the use of the second equation as a
measure of confidence interval for a large dataset.
4

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser

Reference
Dudley, R. M. Uniform Central Limit Theorems. Cambridge University Press, 2014.
Ferguson, Thomas S. A Course in Large Sample Theory. Routledge, 2017.
Bethea, Robert M. Statistical methods for engineers and scientists. Routledge, 2018.
5