The solution includes the calculation of frequency distribution, relative frequency distribution, percent frequency distribution, ANOVA, coefficient of determination, correlation coefficient, and hypothesis testing. The output is in JSON format.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Solution of the Assignment Question 1: Here X : Furniture order (in nearest $) Min. Obs. = Min{ X1, X2, …, X50}¿123 Max. Obs. = Max{ X1, X2, …, X50}¿490 So, Range = Max. Obs. – Min. Obs. = 490 -123 = 367 As class width is 50, Number of classes = (Range) / (class width) = 367 / 50 = 7.340 So we did 7.37≈8 classes. We did the classes such that first class include min. obs. and last class include max. obs. We construct the 8 classes as follows: 120–170170-220220-270 270- 320320-370370-420420-470470-520 Frequency distribution: Frequency of particular data point means the number of times that data points occurs in the dataset. Frequency of the class is the number of data points in that class. Following table shows the frequency distribution of dataset given for above mentioned classes: Page1of15
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
ClassesFrequency 120-1708 170-22015 220-27012 270-3204 320-3705 370-4202 420-4702 470-5202 Total50 Relative Frequency distribution: Relative frequency = Class Frequency / Total number of observations ClassesRelative Frequency 120- 1700.16 170- 2200.30 220- 2700.24 270- 3200.08 320- 3700.10 370- 4200.04 420- 4700.04 470- 5200.04 Total1.00 Page2of15
Percent Frequency distribution: Percent frequency = Relative frequency × 100 ClassesPercent Frequency 120-17016 170-22030 220-27024 270-3208 320-37010 370-4204 420-4704 470-5204 Total100 (b) Page3of15
Figure : Percent Frequency Histogram We can observe that positive skewness in the data from percent frequency histogram. (c) Mean Mean is always good measure of location. For our data, Mean = 251.46 Question 2: Simple linear regression: Where we have only one predictor variable. Here the Response variable is Demand and Predictor (independent) variable is unit price. We denote, Y: Demand & X: Unit Price (a) H0: Response variable and predictor variable is not related. Page4of15
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Vs H0: Response variable and predictor variable is related. To test this hypothesis, we construct the ANOVA for simple linear regression: Source of Variation Sum of Squares Degrees of FreedomMean SquareF RegressionSSReg1 MSReg = SSReg F Cal = MSReg / MSRes ResidualSSResn-2 MSRes = SSRes / (n-2) TotalTSSn-1 Given, n – 1 =47, i.e. n = 48 SSReg = 5048.818, SSRes = 3132.661, SST = 8181.479 So, MSReg = SSReg = 5048.818, MSRes = SSRes / (n-2) = 3132.661 / 46 = 68.10 F Cal = SREg / MSRes = 5048.818 / 68.10 = 74.14 Completed ANOVA of the simple linear regression is ANOVA Sourcesof Variation Degrees of FreedomSSMSSF-Value Regression1 5048.818 5048.81874.14 Page5of15
Residual463132.66168.10 Total478181.479 Decision Criteria to decide whether to reject null hypothesis (H0) or not: If we observe that F-Cal > Critical value of F then we reject the null hypothesis otherwise fail to reject null hypothesis. Under null hypothesis, F-Cal follows F distribution with (1, 46) degrees of freedom. where 1 is for numerator degrees of freedom and 46 is denominator degrees of freedom. Table value of F distribution with (1, 46) degrees of freedom atα= 0.05 is F(0.05, 1,46)= 4.052 By comparing F-Cal and table F value, we observe that F-Cal = 74.14 > F(0.05, 1,46)= 4.052 , so we reject null hypothesisatα= 0.05. i.e. we have strong evidence that demand and unit price are related to each other. (b) Coefficient of determination: When we observe the strong correlation between response variable and predictor variable we fit the regression model. Out of total variation in response variable is explained by predictor variable. When variation explained by the predictor variable is more, we say that model fit is good. Page6of15
Coefficient of determination is the proportion of variation explained by the predictor variable from the total variation. Coefficient of determination is denoted by R2and given by Coefficient of determination = SSReg / TSS where TSS is total sum of square. Alternatively it is defined as Coefficient of determination = 1 - SSRes / TSS Coefficient of determination (R2) = 1 - 3132.661 / 8181.479 = 0.6171 It means that 61.71% variation in response (Demand) is explained by the predictor variable (unit price). (c) Correlation coefficient: Correlation coefficient is denoted by r. We know that Coefficient of determination i.e. R2is the square of correlation coefficient (r). So we calculate the r by square root of R2. From slope of X, we can say that there is negative correlation between X and Y. So, r = -√R2 r = -√0.617103= - 0.7856 Relationship between demand and unit price: Page7of15
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
We observed that correlation coefficient is -0.7856, suggest that demand and unit price is strongly negatively related with each other. We can say that when unit price decreases the demand increases and when unit price increases demand will decreases. Question 3: Here we have three treatments. Letμ1is the population mean of first treatment,μ2is the population mean of second treatment Andμ3is the population mean of third treatment. Our null and alternative hypothesis are H0:μ1=μ2=μ3 Vs H1: At least one ofμidifferent. i = 1, 2, 3 For testing the above null hypothesis vs alternative hypothesis, we need to have complete the ANOVA. We have 3 treatments and total observations are n then for testing the above hypothesis, ANOVA is. Source of Variation Sum of SquaresDegrees of FreedomMean SquareF Page8of15
Between TreatmentsSST2MST = SST / 2 Cal F = MST / MSE Within Treatment (Error)SSEn-3 MSE = SSE / (n - 3) TotalTSSn-1 Given: n-1 = 23 i.e. n = 24 i.e. sample size for each treatment is 24 / 3 = 8. SST = 390.58 SSE = 158.40 TSS = 548.98. Total degrees of freedom for error = n - 3 = 24 - 3 = 21 MST = SST / 2 = 390.58 / 2 = 195.290 MSE = SSE / 21 = 158.4 / 21 = 7.54 Cal F = MST / MSE = 195.290 / 7.543 = 25.89 By using the above calculated value we complete the ANOVA as : Source of VariationSum of SquaresDegrees of FreedomMean SquareF Between Treatments390.582195.2925.89 Within Treatment (Error)158.4217.54 Total548.9823 Decision Criteria to decide whether to reject null hypothesis (H0) or not: Page9of15
If we observe that F-Cal > Critical value of F then we reject the null hypothesis otherwise fail to reject null hypothesis. Under null hypothesis, F-Cal follows F distribution with (2, 21) degrees of freedom. where 2 is for numerator degrees of freedom and 21 is denominator degrees of freedom. Table value of F distribution with (2, 21) degrees of freedom atα= 0.05 is F(0.05, 2, 21)= 3.47 By comparing F-Cal and table F value, we observe that F-Cal = 25.89 > F(0.05, 2, 21)= 3.47 , so we reject null hypothesisatα= 0.05.i.e. all the treatment means are not same. At least one of the treatment mean is different. Question 4: Response variable: Y : number of mobile phone sold per day. Predictor variables: X1 :Price in $ 1000 X2 : number of advertising spots (a) From the given output we can write regression equation for multiple when price (X1) and number of advertising spots (X2) are regressed over Y: Y=intercept+slopeofX1×X1+slopeofX2×X2 Intercept = 0.8051, Slope of X1 = 0.4977 and Slope of X2 = 0.4733 Page10of15
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
So,Y=0.8051+0.4977×X1+0.4733×X2 (b) Here we are interested in testing the following null and alternative hypothesis: Null Hypothesis: There is no significant relationship between number of mobile phone sold per day and independent variables Price in $ 1000 and number of advertising spots Vs Alternative Hypothesis: There is no significant relationship between number of mobile phone sold per day and independent variables Price in $ 1000 and number of advertising spots Our first step is construction of complete ANOVA We have two independent variables and 7 observations are taken so ANOVA Source of Variation Sum of Squares Degrees of FreedomMean SquareF RegressionSSReg2 MSReg = SSReg / 2 F Cal = MSReg / MSRes ResidualSSRes4 MSRes = SSE / 4 TotalTSS6 Given: SSreg = 40.7, SSRes = 1.016, TSS = 41.716 MSReg = SSreg / 2 = 40.7 / 2 = 20.35 MSRes = SSres / 4 =1.016 / 4 = 0.254 F Cal = MSReg / MSRes = 20.35 / 0.254 = 80.12 Page11of15
Complete ANOVA is given below by computing the remaining values Source of VariationSum of Squares Degrees of FreedomMean SquareF Regression40.7220.35080.12 Residual1.01640.254 Total41.7166 Decision Criteria to decide whether to reject null hypothesis (H0) or not: If we observe that F-Cal > Critical value of F then we reject the null hypothesis otherwise fail to reject null hypothesis. Under null hypothesis, F-Cal follows F distribution with (2, 4) degrees of freedom. where 2 is for numerator degrees of freedom and 4 is denominator degrees of freedom. Table value of F distribution with (2, 4) degrees of freedom atα= 0.05 is F(0.05, 2, 4)= 6.7 By comparing F-Cal and table F value, we observe that F-Cal = 80.12 > F(0.05, 2, 4)= 6.7 , so we reject null hypothesisatα= 0.05. i.e There is significant relationship between number of mobile phone sold per dayand independent variables (Price in $ 1000, number of advertising spots). (c) i) Here we test H0:β1=0againstH1:β1≠0 Cal t for testing this null hypothesis is Cal−t=^β1 SE(^β1)=0.4977 0.4617=1.078 Page12of15
Under Null hypothesi, Cal-t follows t distribution with 4 degrees of freedom. Critical values oft4atα= 0.05 is Critical t value = 2.78 Decision criteria: Reject H0if|Cal−t|>CriticaltValue Now |1.078| < 2.78 means we fail to reject H0. ii) Here we test H0:β2=0vsH1:β2≠0 Cal-t for testing this hypothesis is Cal−t=^β2 SE(^β2)=0.4733 0.0387=12.22997 Under Null hypothesi, Cal-t follows t distribution with 4 degrees of freedom. Critical values oft4atα= 0.05 is Critical t value = 2.78 Decision criteria: Reject H0if|Cal−t|>CriticaltValue So |12.22997| > 2.78 So we reject Null hypothesis. (d) Interpretation of Slope of X2: Page13of15
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
If price is fixed (i.e. X1 is fixed), then each change in one unit in number of advertising spot (X2), there is 0.4733numbers of mobile phone sold per day (Y). (e) Given: X1 = 20 and X2 = 10 Then Y= 0.8051 + 0.4977 × X1 + 0.4733 × X2=0.8051 + 0.4977 × 20 + 0.4733 × 10 = 15.492≈15 15 (approximately) mobile phones expected to sell in a particular day if company charges $ 20000 and used 10 advertising spots. References i)Bickel, P.J. and Doksum, K.A., 2015.Mathematical statistics: basic ideas and selected topics, volume I(Vol. 117). CRC Press. ii)Box, G.E., Hunter, J.S. and Hunter, W.G., 2005.Statistics for experimenters: design, innovation, and discovery(Vol. 2). New York: Wiley-Interscience. iii)Chatterjee, S. and Hadi, A.S., 2015.Regression analysis by example. John Wiley & Sons. iv)DeGroot, M.H. and Schervish, M.J., 2012.Probability and statistics. Pearson Education. v)Draper, N.R. and Smith, H., 2014.Applied regression analysis(Vol. 326). John Wiley & Sons. vi)Hogg,R.V.andCraig,A.T.,1995.Introductiontomathematicalstatistics.(5"" edition)(pp. 269-278). Upper Saddle River, New Jersey: Prentice Hall. vii)Montgomery, D.C., 2017.Design and analysis of experiments. John wiley & sons. Page14of15
viii)Moyé,L.A.,Chan,W.,&Kapadia,A.S.(2017).Mathematicalstatisticswith applications. CRC Press. ix)Ross, S.M., 2014.Introduction to probability and statistics for engineers and scientists. Academic Press. x)Ryan, T.P., 2008.Modern regression methods(Vol. 655). John Wiley & Sons. Page15of15