Statistics - Frequency Distribution, Linear Regression, ANOVA, Multiple Regression
Verified
Added on  2023/06/05
|10
|1171
|154
AI Summary
This article covers topics like frequency distribution, linear regression, ANOVA, and multiple regression in statistics. It includes tables and figures to explain the concepts.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Running Head: STATISTICS Statistics Name of the student: Name of the university: Course ID:
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
2STATISTICS Table of tables Table 1: The frequency distribution table of examination scores of 20 selected students..............2 Table 2: First linear regression model.............................................................................................3 Table 3: One-way ANOVA table....................................................................................................4 Table 4: Multiple regression model.................................................................................................6 Table 5: First linear regression model.............................................................................................7 Table of Figures Figure 1: Histogram of frequency in percentages of examination scores of 20 selected students. .2
3STATISTICS Answer 1. Answer 1. a) Class intervals Class boundaries Frequencie s Cumulative frequencies Relative frequencie s Cumulative relative frequencies Percent frequencies 50-5949.5-59.5330.150.1515% 60-6959.5-69.5250.100.2510% 70-7969.5-79.55100.250.525% 80-8979.5-89.54140.200.720% 90-9989.5-99.56200.30130% Total201.00100% Table1: The frequency distribution table of examination scores of 20 selected students Answer 1. b) 50-5960-6970-7980-8990-99 0% 5% 10% 15% 20% 25% 30% 35% 15% 10% 25% 20% 30% Histogram of Examination Scores Classintervalsofscore Frequenciesinpercenatges Figure1: Histogram of frequency in percentages of examination scores of 20 selected students The shape of the distribution of Examination score shows-
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
4STATISTICS The distribution of examination score is left-skewed. For this distribution, mode>median>mode. The right tail of the distribution is longer than its left tail. Answer 2. Table2: First linear regression model Answer 2. a) The sample size is (40+1) = 41. Answer 2. b) In this linear regression model, the p-value of ‘Unit Price’ (X) (0.175156156) is greater than the level of significance (0.05). Hence, the null hypothesis of insignificant relationship
5STATISTICS between independent and dependent variable cannot be rejected with 95% probability. The statistical and linear relationship between ‘Unit Price (X)’ and ‘Supply (Y)’ could be established. Answer 2. c) The ‘Co-efficient of determination’ of the model is found to be0.047996. It shows that 4.799% variability of ‘Unit Price (X)’ is explained by ‘Supply (Y)’. Answer 2. d) Pearson’s co-efficient of correlation (r) is0.21908that refers a weak positive correlation between ‘Unit price’ and ‘Supply’ as (r is in-between 0.0 to 0.3). Therefore, these two variables are not significantly linked. Answer 2. e) The linear regression model is estimated as, Y =0.029*X + 54.076 For unit price (X) = $50,000, then,Y = [54.076 + (0.029*50000)] = 1504.076. The supply is predicted to be 1504.076 units. Answer 3. Answer 3. a)
6STATISTICS Table3: One-way ANOVA table Answer 3. b) At the 5% level of significance, the hypotheses are stated as- Null hypothesis (H0):The mean scores of all the four programs are equal to each other. Alternative hypothesis (HA):There exists at-least one inequality in the mean scores of the four programs. The p-value of the F-statistic (6.140351) is 0.00557 that is less than level of significance (5%). Hence, the null hypothesis could be with rejected at 95% probability and the alternative hypothesis cannot be rejected. It could be concluded that the average productivity of workers of Program C has higher than other three programs. It is recommended that the line programmers of Program C are more acceptable because of its are more effectiveness. Answer 4. The company records the data of weekly sales of products (y) as response factor; unit price of the competitor’s product (x1) and advertising expenditures (x2) as predictor factor.
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
7STATISTICS Table4: Multiple regression model Answer 4. a) The Predicted regression equation is- ‘Sales’ (Y) =3.597615086 + 41.32002219 * ‘Price’ (X1) + 0.013241819 * ‘Advertising’ (X2). Answer 4. b) The significance level of the F-test is assumed to be 10%. The p-value of the multiple regression model is0.052643614; it is less than 0.1. Hence, the null hypothesis could be rejected
8STATISTICS with 90% probability. Therefore, the multiple regression model is found to be statistically significant at 10% level of significance. Answer 4. c) The p-values of the predictors are respectively- ‘Price’ (0.036289) and ‘Advertising’ (0.969694). Thep-value of ‘Advertising’ is greater than 5% and the p-value of ‘Price’ is less than 5%. The p-values indicates that ‘Price’ is the significant factor and ‘Advertising’ is the insignificant factor. Table5: First linear regression model
9STATISTICS Answer 4. d) ‘Advertising’ being an insignificant factor, it is dropped from the multiple regression model. Now, the regression model considers only ‘Sales’ as dependent variable and ‘Price’ as independent variable. The newly structured linear regression model finds the estimated equation- ‘Sales (Y)’ =3.58178844 + 41.6030534 * ‘Price (X1)’. Answer 4. e) The slope(β) of the linearregressionmodelis41.6030534. It showsa positive association between the predictor ‘Price’ and the response ‘Sales’. For 1 unit growth in ‘Price’, the ‘Sales’ would grow by 41.6 units. Similarly, for 1 unit decrease in ‘Price’, the ‘Sales’ would reduce by 41.6 units.