Your contribution can guide someone’s learning journey. Share your
documents today.
Surname1 Name Instructor Course Date REGRESSION ANALYSIS a.Correlation matrix IncomeValueYears of education AgeMortgage paymentGender Income1 Value0.719651 Years of education0.18804 4 -0.143721 Age0.24255 5 0.21950 4 0.6208577871 Mortgage payment 0.11767 6 0.36057 2 -0.213125505-0.04411 Gender-0.27441-0.0073-0.061944104-0.18630.1989421331 Table 1 As can be observed from the correlation matrix above, the independent variables do not show any high level of correlation with each other hence there no chances of multicollinearity. b.Regression equation SUMMARY OUTPUT Regression Statistics Multiple R0.844053985 R Square0.712427129 Adjusted R Square0.636750058 Standard Error0.63367862 Observations25 ANOVA dfSSMSF Significance F Regression518.90103.7802 9.41404203 2 0.00012166 2 Residual197.62940.4015
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Surname2 Total2426.5304 CoefficientsStd Errort StatP-valueLower 95% Upper 95% Intercept28.616149613.20658.92453.1833E-0821.904935.32738 Value0.0316195350.00526.05507.99203E-060.02070.04255 Years of education0.7081650230.26022.7216 0.01354307 90.16361.25278 Age-0.0569360.0341-1.6704 0.11123434 8-0.12830.01441 Mortgage payment-0.0005255010.0014-0.3832 0.70585726 3-0.00340.00235 Gender-0.587650830.2669-2.20180.04023791-1.1463-0.02902 Table 2 Regression equation Income=0.032(value)+0.71(yearsofeduc)−0.06(age)−0.0005(mortgagepayment)−0.59(gender)+28.62 c.The value of R-squared is 0.71.This means that the independent variables are responsible for 71% of the variation that occurs in the response variable (income). d.Prediction for income Income=0.032(value)+0.71(yearsofeduc)−0.06(age)−0.0005(mortgagepayment)−0.59(gender)+28.62 Income=0.032(275000)+0.71(13)−0.06(48)−0.0005(375)−0.59(2)+28.62 Income=8800+9.23−2.88−0.1875−1.18+28.62 Income=$8,833.60 e.Test for the significance of the independent variables (global hypothesis) Hypothesis H0:βi= 0 H1:βi≠ 0 Alpha = 0.01 It can be observed that the p-value for the slope of mortgage payment, gender, age and years of education are greater than 0.01 (level of significance) hence they are not different from zero. f.Test for independence of variables (individual hypothesis) Hypothesis
Surname3 H0:β1= 0; β2= 0; β3= 0; β4= 0; β5= 0 H1:β1≠ 0; β2≠ 0; β3≠ 0; β4≠ 0; β5≠ 0 At 0.01 level of significance From the regression results above, p-values for all independent variables except “value” are greater than the level of significance (0.01) as can be observed from the regression table above. Independent variable “value” has a p-value of 0.00. This means that it is the only significant variable at 0.01 level of significant. Therefore all the independent variables are dropped except “value”. g.The new regression model SUMMARY OUTPUT Regression Statistics Multiple R 0.71965015 9 R Square 0.51789635 2 Adjusted R Square 0.49693532 4 Standard Error 0.74572411 9 Observations25 ANOVA dfSSMSF Significanc e F Regression113.7413.74 24.707583 45.016E-05 Residual2312.7904 0.55610 4 Total2426.5304 Coefficients Std Errort StatP-valueLower 95% Upper 95% Intercept 35.8888613 7 0.82616 8 43.4401 6 1.4016E- 2334.179803 37.5979 2 Value 0.02623498 7 0.00527 8 4.97067 2 5.0162E- 050.0153167 0.03715 3 Table 3
Surname4 The value of R-squared is 0.52. This means that the independent variable (value) is able to explain 52% of the variation that occurs in income. The regression equation is; Income=0.026(value)+35.89 QUESTION 2 I.TRENDLINES . a. linear trend 200420062008201020122014201620182020 Year 0K 10K 20K 30K 40K 50K 60K 70K 80K Sales Sheet 1 The trend of sum of Sales for Year. Figure 1 P-value:< 0.0001 Equation:Sales = 2387.87*Year + -4.73694e+06
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Surname5 Coefficients TermValueStdErrt-valuep-value Year2387.8769.425734.3946< 0.0001 intercept-4.73694e+06139720-33.9032< 0.0001 Logarithmic trend 200420062008201020122014201620182020 Year 0K 10K 20K 30K 40K 50K 60K 70K 80K Sales Sheet 1 The trend of sum of Sales for Year. Figure 2 P-value:< 0.0001 Equation:Sales = 4.80577e+06*log(Year) + -3.64895e+07 Coefficients TermValueStdErrt-valuep-value log(Year)4.80577e+0613924134.5141< 0.0001 intercept-3.64895e+071.05922e+06-34.4493< 0.0001
Surname6 Exponential trend 200420062008201020122014201620182020 Year 0K 10K 20K 30K 40K 50K 60K 70K 80K Sales Sheet 1 The trend of sum of Sales for Year. Figure 3 P-value:< 0.0001 Equation:ln(Sales) = 0.0355143*Year + -60.3491 Coefficients TermValueStdErrt-valuep-value Year0.03551430.001374425.84< 0.0001 intercept-60.34912.76597-21.8184< 0.0001
Surname7 II.I would suggest linear trendline. This is because it has got a higher R-squared value (0.988) compared to exponential and logarithmic trend. III.2022 sales estimate Equation:Sales = 2387.87*Year + -4.73694e+06 Sales = 2387.87*2022 + -4.73694e+06 Sales = 4828273.14 – 4736940 = 91,333.14 The sales estimates are very reasonable given the sales for 2020 was about 85,000. IV.Summary Count:16 SUM(Sales) Sum:1,098,469 Average:68,654.31 Minimum:49,378 Maximum:84,005 Median:69,218.50 V.Dashboard
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.