This document discusses data and business decision making. It covers topics such as the final dataset, chosen numerical and categorical variables, scatter plots, hypothesis testing, and regression models.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Running head: DATA AND BUSINESS DECISION MAKING Data and Business Decision Making Name of the Student Name of the University Course ID
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
1DATA AND BUSINESS DECISION MAKING Table of Contents Answer to question 1.......................................................................................................................2 Answer to question 2.......................................................................................................................2 Answer to question 3.......................................................................................................................3 Answer to question 4.1....................................................................................................................5 Answer to question 4.2....................................................................................................................6 Answer to question 4.3....................................................................................................................6 Answer to question 4.4....................................................................................................................7 Answer to question 4.5....................................................................................................................7 Answer to question 5.1....................................................................................................................8 Answer to question 6.1..................................................................................................................10 Answer to question 6.2..................................................................................................................10 Answer to question 6.3..................................................................................................................11 Answer to question 6.4..................................................................................................................11 Answer to question 6.5..................................................................................................................12 Answer to question 6.6..................................................................................................................13 Answer to question 6.7..................................................................................................................13 Reference.......................................................................................................................................14
2DATA AND BUSINESS DECISION MAKING Answer to question 1 The Final dataset is prepared by removing all the missing observations in order to get consistent and unbiased results. The final dataset has 111 observations. Answer to question 2 The chosen numerical variables are tenure and age. The average value of tenure is 7.76 years and the tenure of respondent lies between 0 and 19 years. The histogram of the tenure shows the right skewed distribution (Godoey and Reich 2019). The average value of age is 33.37 years and the age of respondents lies between 10 and 38 years. The histogram of the age shows the right skewed distribution. tenureage Mean7.756833.3694 Standard Error0.44620.2867 Median833 Mode738 Standard Deviation4.70053.0209 Sample Variance22.09489.1260 Kurtosis-0.9044-1.2466 Skewness0.13180.0635 Range1910 Minimum028 Maximum1938 Sum8613704 Count111111 The chosen categorical variables are marriage and gender. There is 86% of married and 14% single respondents. There is approximately 42% female and 58% of male respondent.
3DATA AND BUSINESS DECISION MAKING MarriageFrequencyFrequency Percentage married9686% single1514% Grand Total111100% GenderFrequencyFrequency Percentage Female4742% Male6458% Grand Total111100% Answer to question 3 The scatter plot shows a strong relationship between IQ and education (Walker and Zhu 2017). The regression result shows that the education has significant impact on IQ as the p-value of the coefficient is significant at 5% significance level.
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
4DATA AND BUSINESS DECISION MAKING 91 01 11 21 31 41 51 61 71 81 9 0 20 40 60 80 100 120 140 160 IQ vs Educati on SUMMARY OUTPUT Regression Statistics Multiple R0.65637241 R Square0.43082474 Adjusted R Square0.425602949 Standard Error11.55835342 Observations111 ANOVA dfSSMSFSignificance F Regression111022.3210611022.3210682.505161345.25971E-15 Residual10914561.91317133.5955337 Total11025584.23423 CoefficientsStandard Errort StatP-valueLower 95%Upper 95% Intercept50.137663426.2334627728.0433083910.0037.7831428662.49218398 educ3.9914318450.4394284389.083235180.003.1204989684.862364723 The scatter plot shows a moderate positive correlation between KW and age. The regression result shows that the age has significant impact on KW as the p-value of the coefficient of age is significant at 5% significance level.
5DATA AND BUSINESS DECISION MAKING 2527293133353739 0 10 20 30 40 50 60 KW vs Age SUMMARY OUTPUT Regression Statistics Multiple R0.3827247 R Square0.146478196 Adjusted R Square0.13864772 Standard Error6.604072379 Observations111 ANOVA dfSSMSFSignificance F Regression1815.8466008815.846600818.706169263.38941E-05 Residual1094753.90114743.61377199 Total1105569.747748 CoefficientsStandard Errort StatP-valueLower 95%Upper 95% Intercept8.0884876336.9836207821.1582083110.249-5.75282171821.92979698 age0.9015059050.2084376394.325062920.0000.488389281.31462253 Answer to question 4.1 wage Mean1100.7477 Standard Error41.5113 Median1027 Mode1000 Standard Deviation437.3485 901 1134 1368 667 1602 1836 More 433 2070 2303 2537 0 5 10 15 20 25 30 0.00% 20.00% 40.00% 60.00% 80.00% 100.00% 120.00% Histogram Bin Frequency
6DATA AND BUSINESS DECISION MAKING The threshold value is decided by using the median as the distribution is right skewed not normally distributed. Answer to question 4.2 The below graph depicts that the number of male respondents is higher than the female in both thecasesof high and lowwage.The numberof high and lowwage earnersare approximately same across male and female. Answer to question 4.3
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
7DATA AND BUSINESS DECISION MAKING FemaleMaleGrand Total High Wage223355 Low Wage253156 Total4764111 FemaleMaleGrand Total High Wage0.1980.2970.495 Low Wage0.2250.2790.505 Total0.4230.5771.000 FemaleMaleGrand Total High Wage0.4680.5160.495 Low Wage0.5320.4840.505 Total1.0001.0001.000 Gender Wage Joint Probability Wage Gender Marginal Probability Gender Wage Answer to question 4.4 Probability of female or low wage is47+56−25 111=0.7027. Probability of being a female earning a low wage is25 111=0.22523. Answer to question 4.5
8DATA AND BUSINESS DECISION MAKING FemaleMaleGrand Total High Wage223355 Low Wage253156 Total4764111 FemaleMaleGrand Total High Wage23.28831.71255 Low Wage23.71232.28856 Total4764111 FemaleMaleGrand Total High Wage0.0710.0520.124 Low Wage0.0700.0510.121 Total0.1410.1040.245 H_0 = Ther is no association between male and high wage H_1 = Ther is an association between male and high wage degrees of freedom= (rows-1)*(columns-1)1*1 Crtical value at alpha=0.053.841 sigNo Gender Wage Expected Gender Wage (O-E)^2/E Gender Wage The above test shows that the null hypothesis cannot be rejected which says males do not tend to earn higher wage than the females. Answer to question 5.1 Hypothesis Test for μ (Mean) Hypotheses Null Hypothesisμ=900 Alternative Hypothesisμ≠900 Level of significanceα0.05
9DATA AND BUSINESS DECISION MAKING Critical Value (s)1.9600 Population Standard Deviation437.3485 Sample Data Sample Mean1100.748 Sample Size111 Standard Error of the Mean41.51 ZSample Statistic4.835981 p-value0.000003 The z-stat for the sample is 4.836 and the critical value of z-stat is 1.96 at 0.05 significance level. The test rule says the greater value of z-stat of the sample than the critical value at a given significance level is enough to reject the null hypothesis (Johnes 2018). Hence, the alternative hypothesis is accepted here which says the sample mean is not equal to 900. Confidence Interval for mean (m) Data Population Standard Deviation 437.348 5 Sample mean 1100.74 8 Sample Size111 Confidence Level95% Intermediate Calculations Standard error of the mean41.5113 Z Value1.9600 Sampling Error/Margin of Error81.3621 Confidence Interval Interval Lower Limit1019.39 Interval Upper Limit1182.11 The 95% confidence interval of the mean value is (1019.39, 1182.11). This says that the average monthly wage is not same for the current and the previous year.
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
10DATA AND BUSINESS DECISION MAKING Answer to question 6.1 The null hypothesis, H0: The mean coefficient is 0. The alternative hypothesis, H0: The mean coefficient is not 0 Regression Statistics Multiple R0.9405 R Square0.8846 Adjusted R Square0.8755 Standard Error403.9371 Observations111 ANOVA dfSSMSFSignificance F Regression1137584600.6137584600.6843.22300.0000 Residual11017948166.42163165.1493 Total111155532767 CoefficientsStandard Errort StatP-valueLower 95%Upper 95% Intercept0.0000#N/A#N/A#N/A#N/A#N/A educ78.48422.702829.03830.000073.127983.8405 The p-value is significant for the coefficient of education with 0.00 p-value. The education has a significant positive impact on wage (Bottema‐Beutel 2016). One unit rise in the value of education will raise the wage by 78.4842. The regression model: wages=78.4842∗educ Answer to question 6.2
11DATA AND BUSINESS DECISION MAKING Regression Statistics Multiple R0.9414 R Square0.8861 Adjusted R Square0.8771 Standard Error401.2297 Observations111 ANOVA dfSSMSFSignificance F Regression1137824386.7137824386.69856.130.00 Residual11017708380.31160985.28 Total111155532767 CoefficientsStandard Errort StatP-valueLower 95%Upper 95% Intercept0#N/A#N/A#N/A#N/A#N/A IQ10.41820.356129.25970.00009.712611.1238 The p-value is less than 0.05 for the coefficient of IQ. The IQ is significant and has a positive impact on wage. One unit rise in the value of IQ will raise the wage by 10.4182. The regression model: wages=10.4182∗IQ Answer to question 6.3 The value of R2is the goodness of fit for the model. Here in case of education as an explanatory variable, R2is 0.8846 which indicates that the education can explain 88.46% of the variance in wage (Edo and Rapoport 2019). On the other hand, the model with IQ as an explanatory variable, R2is 0.8861 which indicates that the IQ can explain 88.61% of the variance in wage. Here, the value of R2is higher for IQ which means IQ is the better predictor of wage (Wang 2016). Answer to question 6.4
12DATA AND BUSINESS DECISION MAKING SUMMARY OUTPUT Regression Statistics Multiple R0.1461 R Square0.0213 Adjusted R Square0.0124 Standard Error434.6367 Observations111 ANOVA dfSSMSFSignificance F Regression1449017.82449017.822.380.13 Residual10920591087.12188909.0562 Total11021040104.94 CoefficientsStandard Errort StatP-valueLower 95%Upper 95% Intercept1278.2647122.309510.45110.00001035.85131520.6780 exper-15.750910.2165-1.54170.1260-35.99964.4978 The p-value of the coefficient of work experience is greater than 0.05 which indicates that it has no significant impact on wage. The regression model: wages=1278.647−15.7509∗exper Answer to question 6.5
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
13DATA AND BUSINESS DECISION MAKING SUMMARY OUTPUT Regression Statistics Multiple R0.940 R Square0.884 Adjusted R Square0.875 Standard Error405.201 Observations111 ANOVA dfSSMSFSignificance F Regression1137472089.60137472089.60837.280.00 Residual11018060677.40164187.98 Total111155532767.00 CoefficientsStandard Errort StatP-valueLower 95% Intercept0#N/A#N/A#N/A#N/A KW28.670.9928.940.0026.70 The p-value of the coefficient of KW scores is less than 0.05 which implies that it has significant impact on wage. The regression model: wages=28.67∗KW Answer to question 6.6 The KW score is the better predictor as the coefficient of this variable is significant and the value of R2is better. Answer to question 6.7 The link between wage and education is strong. The goodness of fit is better in case of the model of wage and education compared to the model of wage and work experience.
14DATA AND BUSINESS DECISION MAKING Reference Bottema‐Beutel, K., 2016. Associations between joint attention and language in autism spectrum disorder and typical development: A systematic review and meta‐regression analysis.Autism Research,9(10), pp.1021-1035. Edo, A. and Rapoport, H., 2019. Minimum wages and the labor market effects of immigration. Labour Economics,61, p.101753. Godoey, A. and Reich, M., 2019. Minimum Wage Effects in Low-Wage Areas. Working Paper# 106-19. Johnes, G., 2018. A sporting chance: on the impact of sports participation on subsequent earnings.Economics Bulletin,38(1), pp.146-151. Walker, I. and Zhu, Y., 2017. University selectivity and the graduate wage premium: Evidence from the UK. Wang, Z., 2016. Wage growth, ability sorting, and location choice at labor-force entry: New evidence from US Census data.Journal of Urban Economics,96, pp.112-120.