This document contains solved assignments for the Statistics for Business course (STA101). It includes calculations and interpretations of covariance, correlation, hypothesis testing, confidence intervals, measures of central tendency, and probability calculations.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Assessment 1 - Assignment Unit:STA101 – Statistics for Business Student Name: Student Number: Course Instructor: Date: 19thJanuary 2019
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Question 1: A sample of eight observations of variables x (years of experience) and y (salary in $1,000s) is shown below: x53792468 y2023151127211714 a.Calculate and interpret the covariance between x and y. Answer Cov(X,Y)= ∑ i=1 n (Xi−X)(Yi−Y) n X= ∑ i=1 n Xi n=5+3+7+9+2+4+6+8 8=44 8=5.5 Y= ∑ i=1 n Yi n=20+23+15+11+27+21+17+14 8=148 8=18.5 Cov(X,Y)= ∑ i=1 n (Xi−X)(Yi−Y) n ¿(5−5.5)(20−18.5)+…+(8−5.5)(14−18.5) 8=−89 8=−11.125 b.Give a possible reason that the covariance is negative. Answer A possible reason as to why the covariance is negative is that the relationship between x and y is negative. X and Y move in an inverse direction. c.Calculate the coefficient of correlation, and comment on the relationship between x and y. Answer r=Cov(X,Y) SD(X)SD(Y)
SD(X)=√∑ i=1 n (Xi−X) 2 n−1=√(5−5.5)2+(3−5.5)2+(6−5.5)2+(8−5.5)2 7=√42 7=√6=2.4495 SD(Y)=√∑ i=1 n (Yi−Y) 2 n−1=√(20−18.5)2+(23−18.5)2+(17−18.5)2+(14−18.5)2 7=√192 7=√27.42857=5.23 r=Cov(X,Y) SD(X)SD(Y)=−11.125 2.4495∗5.237229=−0.86721 d.Give a possible reason that the correlation is negative. Answer The possiblereason thatthe correlationisnegative isthe fact thatthe relationship between x and y is negative. X and Y move in an inverse direction. Question 2: A company claims that 10% of the users of a certain sinus drug experience drowsiness. In clinical studies of this sinus drug, 81 of the 900 subjects experienced drowsiness. a.We want to test their claim and find out whether the actual percentage is not 10%. State the appropriate null and alternative hypotheses. Answer H0: The proportion of users who use sinus drug and experience drowsiness is 10% HA: The proportion of users who use sinus drug and experience drowsiness is not 10% This can also be written as follows; H0:p=0.1 HA:p≠0.1 b.Is there enough evidence at the 5% significance level to infer that the competitor is correct? Answer We compute the Z statistics as follows; Z=^p−p √p(1−p) n
^p=81 900=0.09 Z=^p−p √p(1−p) n =0.09−0.1 √0.1(1−0.1) 900 =−1.000 The P-Value is 0.317311. Since the p-value is greater than the 5% level of significance, we fail to reject the null hypothesis and conclude that there is noenough evidence at the 5% significance level to infer that the competitor is correct. c.Construct a 95% confidence interval estimate of the population proportion of the users of this allergy drug who experience drowsiness. Answer C.I:^p±Zα/2√^p(1−^p) n ^p=81 900=0.09 C.I:^p±Zα/2√^p(1−^p) n→0.09±1.96∗ √0.09(1−0.09) 900 →0.09±1.96∗ √0.09(1−0.09) 900 →0.09±1.96∗0.009539 →0.09±0.018697 Lower limit:0.09−0.018697=0.071303 Upper limit:0.09+0.018697=0.108697 d.Explain how to use this confidence interval to test the hypotheses. Answer To test for hypothesis we look at whether the interval contains the 10%. As can be seen, the 95% confidence interval is between 0.0713 (7.13%) and 0.1087 (10.87%); the interval contain the 10%. Since the 10% is contained in the interval, the null hypothesis cannot be rejected. Question3: Below are monthly rents paid by 30 students who live off campus. a.Find the mean, median, and mode.
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Answer Mean,x= ∑ i=1 n xi n=730+730+…+720+700 30=21740 30=725 Median=15thvalue+16thvalue 2=720+720 2=1440 2=720 Mode=mostfrequent=730 b.Do the measures of central tendency agree? Explain. Answer Yes the measures of central tendency seem to agree. This is because the values for the three measures are almost equal hence we conclude that they agree. c.Calculate the standard deviation. Answer SD=√∑ i=1 n (xi−x)2 n−1=√(730−725)2+(730−725)2+…+(720−725)2+(700−725)2 30−1=√378750 29=√13060.34=1 d.Are there outliers or unusual data values? Answer We compute the inner fences Q1=662.5 Q3=755.0 Interquartilerange(IQR)=Q3−Q1→755.0−662.5=92.5 Upper boundary:755.0+1.5∗92.5=893.75 Lower boundary:662.5−1.5∗92.5=523.75 Values such as 1030, 930 and 500 are outside the boundary hence they can be regarded as outliers or unusual data values. Thus we can confidently say that there are outliers or unusual data values. e.Using the Empirical Rule, do you think the data could be from a normal population? Answer μ±1SD 725±1∗114.2814 725−114.2814=610.7186
725+114.2814=839.2814 From the data, 70% fall within one standard deviation of the mean. μ±2SD 725±2∗114.2814 725−228.5628=496.4372 725+228.5628=953.5628 From the data, 96.7% fall within two standard deviation of the mean. μ±3SD 725±3∗114.2814 725−342.8442=382.1558 725+342.8442=1067.8442 From the data, 100.0% fall within three standard deviation of the mean. According to the empirical rule it is supposed to be 68-95-99 7. Where 68% of the data fall with one standard deviation of the mean; 95% fall within two standard deviation of the mean and 99.7% fall within three standard deviation of the mean. Since the data is not fromtheempiricalrule,thegivedatasetcanbethoughtoffollowinganormal distribution. 730730730930700570 6901,030740620720670 560740650660850930 600620760690710500 730800820840720700 Question 4: Three messenger services deliver to a small town in Oregon. Service A has 60% of all the scheduled deliveries, service B has 30%, and service C has the remaining 10%. Their on-time rates are 80%, 60%, and 40% respectively. Define event O as a service delivers a package on time. a.CalculateP(AandO).
Answer P(A∧O)=P(A)P(O|A)=0.6∗0.8=0.48 b.Calculate the probability that a package was delivered on time. Answer P(O)=P(O∧A)+P(O∧B)+P(O∧C) ¿P(A)∗P(O∨A)+P(B)∗P(O∨B)+P(C)∗P(O∨C) ¿0.6∗0.8+0.3∗0.6+0.1∗0.4=0.48+0.18+0.04 ¿0.7=70% Thus probabilitythat a package was delivered on time is 70% (0.7). c.If a package was delivered on time, what is the probability that it was serviceA? Answer P(A∨O)=P(A∧O) P(O) ¿P(O∨A)∗P(A) P(O) ¿0.8∗0.6 0.7 ¿0.686=68.6% d.If a package was delivered 40 minutes late, what is the probability that it was serviceB? Answer P(B∨O)=P(B∧O) P(O) ¿P(O∨B)∗P(B) P(O) ¿(1−P(O∨B))∗P(B) (1−P(O))
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
¿(1−0.6)∗0.3 (1−0.7) ¿0.4∗0.3 0.3 ¿0.4=40% e.If a package was delivered 40 minutes late, what is the probability that it was serviceC? Answer P(C∨O)=P(C∧O) P(O) ¿P(O∨C)∗P(C) P(O) ¿(1−P(O∨C))∗P(C)¿¿ (1−P(O)) ¿(1−0.4)∗0.1 (1−0.7) ¿0.6∗0.1 0.3 ¿0.2=20%