Statistics Exam: Frequency Distribution, Regression Analysis, ANOVA, and Sales Data
Verified
Added on 2023/06/04
|7
|1353
|141
AI Summary
This exam covers topics such as constructing frequency distribution, regression analysis, ANOVA, and sales data analysis. It includes sample data sets and Excel outputs for better understanding.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Statistics Student Name: Instructor Name: Course Number: 25 September 2018
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Question 1 (7 marks) Below you are given the examination scores of 20 students (data set also provided in accompanying MS Excel file). a.Construct a frequency distribution, cumulative frequency distribution, relative frequency distribution, cumulative relative frequency distribution and percent frequency distribution for the data set using a class width of 10.(5 marks) Answer ClassFrequencyCumulative frequency Relative frequency Cumulative relative frequency Percent frequency 50-59330.150.1515% 60-69250.10.2525% 70-795100.250.550% 80-894140.20.770% 90-996200.31100% b.Construct a histogram showing the percent frequency distribution of the examination scores. Comment on the shape of the distribution.(2 marks) Answer The shape of the histogram shows a kind of left skewed distribution, this shows that the distribution is negatively skewed. However, the skewness is not severe.
Question 2 (8 marks) Shown below is a portion of a computer output for a regression analysis relating supply (Y in thousands of units) and unit price (X in thousands of dollars) a.What has been the sample size for this problem? (1 mark) Answer Sample size = 39+1+1 = 41 b.Determine whether or not supply and unit price are related. Use α = 0.05. (2 marks) Answer We obtain the t-value tvalue=Beta StandardError=0.029 0.021=1.3810 The t-critical value is 2.022; this value is greater than the computed value hence the null hypothesis is not rejected. This means thatsupply and unit price are not related at α = 0.05. c.Compute the coefficient of determination and fully interpret its meaning. Be very specific. (2 marks) Answer R2=EE TSS=354.689 354.689+7035.262=354.689 7389.951=0.047996 The coefficient of determination is 0.048; this implies that unit price only explains 4.8% of the variation in the dependent variable. d.Compute the coefficient of correlation and explain the relationship between supply and unit price. (2 marks) Answer Coefficientofcorrelation=√R2=R Coefficientofcorrelation=√0.047996=0.2191 The coefficient of correlation is 0.21921; this indicates that there is a weak positive relationship between supply and unit price. e.Predict the supply (in units) when the unit price is $50,000. (1 mark) Answer The regression equation model is; y=54.076+0.029(X) So whenX=$50,000 Supply would be;
y=54.076+0.029(50000)=1504.076≈1505 Thus the supply is approximately 1505. Question 3 (6 marks) Allied Corporation wants to increase the productivity of its line workers. Four different programs have been suggested to help increase productivity. Twenty employees, making up a sample, have been randomly assigned to one of the four programs and their output for a day's work has been recorded. You are given the results below (data set also provided in accompanying MS Excel file). a.Construct an ANOVA table. (3 marks) Answer The following is the ANOVA Table Anova: Single Factor SUMMARY GroupsCountSum Averag e Varianc e Program A5725145525 Program B5675135425 Program C5950190312.5 Program D5750150637.5 ANOVA Source of VariationSSdfMSFP-valueF crit Between Groups875032916.6676.1403510.005573.238872 Within Groups760016475 Total1635019 b.As the statistical consultant to Allied, what would you advise them? Use a .05 level of significance. (3 marks) Answer Looking at the above results, it can be seen that the p-value is 0.00557 (a value less thanα= 0.05). With this, we therefore reject the null hypothesis and conclude that the productivity varies based on the program. Results further showed that Program D had significantly higher productivity than any other program. As the statistical consultant to Allied, I would advise them to consider program D since more productivity would be realized from this program.
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Question 4 (9 marks) A company has recorded data on the weekly sales for its product (y), the unit price of the competitor's product (x1), and advertising expenditures (x2). The data resulting from a random sample of 7 weeks follows. Use Excel's Regression Tool to answer the following questions (data set also provided in accompanying MS Excel file). a.What is the estimated regression equation? Show the regression output. (2 marks) Answer SUMMARY OUTPUT Regression Statistics Multiple R0.877814 R Square0.770558 Adjusted R Square0.655837 Standard Error1.83741 Observations7 Coefficient s Standard Errort StatP-value Lower 95% Upper 95% Intercept3.5976154.0522440.8878080.424805-7.6532214.84845 Price41.3200213.337363.0980650.0362894.28956778.35048 Advertising0.0132420.3275920.0404220.969694-0.89630.922782 The estimated regression equation is given below; Sales=3.5976+41.3200(Price)+0.01324(Advertising) b.Determine whether the model is significant overall. Use α = 0.10. (2 marks) Answer ANOVA dfSSMSF Significanc e F Regressio n245.3528422.676426.7168010.052644 Residual413.50433.376075
Total658.85714 As can be seen I the above table, the p-value for the F-statistics is 0.0526 (a value less than α = 0.10), this leads to rejection of the null hypothesis and thus we conclude that the overall model is significant at α = 0.10. c.Determine if competitor’s price and advertising is individually significantly related to sales. Use α = 0.10. (2 marks) Answer Coefficient s Standard Errort StatP-value Lower 95% Upper 95% Intercept3.5976154.0522440.8878080.424805-7.6532214.84845 Price41.3200213.337363.0980650.0362894.28956778.35048 Advertising0.0132420.3275920.0404220.969694-0.89630.922782 The p-value for the price is 0.036(a value less than α = 0.10), this leads to rejection of the null hypothesis and thus we conclude thatcompetitor’s price is individually significantly related to salesat α = 0.10. The p-value for the advertising is 0.9697(a value greater than α = 0.10), this leads to non- rejection of the null hypothesis and thus we conclude thatadvertising is individually not significantly related to salesat α = 0.10. d.Based on your answer to part (c), drop any insignificant independent variable(s) and re-estimate the model. What is the new estimated regression equation? (2 marks) Answer We drop the advertising and the new results as shown below; SUMMARY OUTPUT Regression Statistics Multiple R0.877761 R Square0.770464 Adjusted R Square0.724557 Standard Error1.643765 Observations7 ANOVA dfSSMSF Significanc e F Regressio n145.3473345.3473316.783110.009385
Residual513.509812.701963 Total658.85714 Coefficient s Standard Errort StatP-value Lower 95% Intercep t3.5817883.6082150.9926760.366447-5.69342 Price41.6030510.155214.0967190.00938515.49825 The estimated regression equation is given below; Sales=3.5818+41.6031(Price) e.Interpret the slope coefficient(s) of the model from part (d). (1 marks) Answer The slope coefficient for the competitor’s price is 41.6031; this implies that a unit increase in the competitor’s price would result to an increase in the sales made by 41.6031. Similarly, a unit decrease in the competitor’s price would result to a decrease in the sales made by 41.6031.