Your contribution can guide someone’s learning journey. Share your
documents today.
Data Analytics MG4002
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Table of Contents Abstract................................................................................................................................3 Introduction..........................................................................................................................3 Conceptual Framework........................................................................................................4 Methodology........................................................................................................................5 Data Analysis and Results....................................................................................................4 Discussion and Conclusions.................................................................................................5 References............................................................................................................................5 2|P a g e
Data Analytics MG4002 Abstract A data analysis is performed for checking the significant relationship between the dependent and independent variables. Results were obtained from the SPSS outputs for required data analysis. From this data analysis it is observed that there is insufficient evidence to conclude that there is a statistically significant relationship exists between the two variable family important and child qualities: feeling of responsibility. Also, it is observed that there is insufficient evidence to conclude that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility.Also, it is observed that there is insufficient evidence to conclude that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. This means, there is no any statistically significant relationship exists between the given dependent and independent variables. So, we cannot use the linear regression model for the prediction of the dependent variable child qualities: feeling of responsibility. Introduction It is important to check the relationship exists between the different variables, because this will allow us to know the nature of relationship and based on this relationship we can take further decisions about the dependent and independent variables. Checking relationship for prediction purpose is very important because it will help in deciding different policies. Here, we have to see the relationship exists between the three variables such as Child Qualities: Feeling of Responsibility, Family Important, and Work Important. For this study, we assume two independent variables as family important and work important and dependent variable as Child qualities: feeling of responsibility. By using different statistical tools and techniques we have to find out the correlation coefficients and then we have to check whether these correlation coefficients are statistically significant or not. For checking these significant relationships we have to use some statistical tools and techniques for the data analysis. Here, we have to use both techniques of determining the relationships. We have to use the parametric as well as non- parametric technique for finding the extent of relationship exists between the given variables. This means we will use Pearson’s correlation coefficient r and Spearman’s R correlation coefficient. Let us see this study in detail. 3|P a g e
Conceptual Framework For this research study, we have to find out the relationship between the dependent variable child qualities: feeling of responsibility and independent variables family important and work important. First of all, we have to check whether there is any statistically significant relationship exists between the dependent variable child qualities: feeling of responsibility and independent variables family important. After checking this relationship we have to check whether there is any statistically significant relationship exists between the dependent variable child qualities: feeling of responsibility and independent variables work important. Dependent variable or response variable for this statistical study is given as below: Dependent variable: Child qualities: feeling of responsibility The independent variables or predictors for this study are summarised as below: Independent Variables: Family important, Work important Now, we have to state two hypotheses based on above dependent and independent variables which are stated as below: Hypothesis 1 Null hypothesis: H0: There is no any statistically significant relationship exists between the two variables family important and child qualities: feeling of responsibility. Alternative hypothesis: Ha: There is a statistically significant relationship exists between the two variable family important and child qualities: feeling of responsibility. Hypothesis 2 Null hypothesis: H0: There is no any statistically significant relationship exists between the two variables work important and child qualities: feeling of responsibility. Alternative hypothesis: Ha: There is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. After checking above two hypotheses by using proper tests for relationships, if there is no sufficient evidence to conclude that there is statistically significant relationship exists between the dependent variable and independent variable, then we cannot use linear regression model. We know that given dependent variable is bivariate in nature i.e. this dependent variable or response variable have two responses and in this case we will use logistic regression model for the prediction of dependent variable child qualities: feeling of responsibility. 4|P a g e
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Methodology For this research study, we have to use a statistical data analysis by using SPSS statistical software. First we have to find out the frequency distributions for the given three variables and then we have to see the relationship exists between these variables by using the Pearson correlation coefficient and Spearman correlation coefficient. First of all we have to see the relationship between the dependent variable child qualities: feeling of responsibility and independent variable family important. After finding this relationship, we have to see the relationship between the dependent variable child qualities: feeling of responsibility and independent variable work important. We will use the corresponding P-values from the SPSS outputs for taking decisions regarding the null hypotheses. For both of the tests, we will consider 5% level of significance. We will take decision whether reject or do not reject the null hypothesis based on the comparison of P-value and alpha value. If no statistically significant evidence of linear relationship exists between the given dependent and independent variables, then we cannot use linear regression model for the prediction of dependent variable. In this case, we will use other regression model. For the given data, dependent variable or response variable only have two values or two responses and hence we will use binary logistic regression model for the prediction of the dependent variable child qualities: feeling of responsibility. Let us see this data analysis in detail given below. Data Analysis and Results In this section, we have to analyse the given data by using different tools and techniques of statistical analysis. We have to use SPSS for data analysis. First of all we have to see the frequency distributions for the given three variables Family important, Work important, and Child qualities: feeling of responsibility. For this study, we consider two independent variables as family important and work important. The dependent variable for this research study is Child qualities: feeling of responsibility. Now, we have to see the frequency distribution of the variable family important which is given as below: Statistics Family important NValid1038 Missing3 5|P a g e
Family important FrequencyPercentValid Percent Cumulative Percent ValidVery important98194.294.594.5 Rather important444.24.298.7 Not very important101.01.099.7 Not at all important3.3.3100.0 Total103899.7100.0 MissingMissing; Not asked by the interviewer 2.2 No answer1.1 Total3.3 Total1041100.0 From above frequency distribution table, it is observed that there are 3 missing values of total 1041 participants. For the question regarding to the variable family important, it is observed that most of the participants in the survey said that it is very important. About 981 respondents said that it is very important. It is observed that 44 respondents said that it is rather important, 10 respondents said that it is not very important, while 3 respondents said that not at all important. This frequency distribution table concludes that about 94.5% of the respondents said that family is very important. Now, we have to see the frequency distribution for the independent variable work important. The SPSS output for this frequency distribution is given as below: Statistics Work important NValid940 Missing101 6|P a g e
Work important FrequencyPercentValid Percent Cumulative Percent ValidVery important35333.937.637.6 Rather important36535.138.876.4 Not very important10810.411.587.9 Not at all important11411.012.1100.0 Total94090.3100.0 MissingMissing; Not asked by the interviewer 989.4 No answer2.2 Don´t know1.1 Total1019.7 Total1041100.0 There are about 101 missing values. Number of valid responses is 940. It is observed that about 353 respondents said that work is very important, 365 respondents said that work is rather important. About 108 respondents said that work is not very important. It is observed that about 114 respondents said that work is not at all important. This table concludes that about 37.6% of the respondents said that work is very important. Now, we have to see the frequency distribution for the variable Child qualities: feeling of responsibility. The SPSS output for this frequency distribution is given as below: 7|P a g e
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Statistics Child qualities: feeling of responsibility NValid1041 Missing0 Child qualities: feeling of responsibility FrequencyPercentValid Percent Cumulative Percent ValidMentioned62660.160.160.1 Not mentioned41539.939.9100.0 Total1041100.0100.0 From this table, it is observed that about 626 respondents were mention that child qualities regarding the feeling of responsibility. It is observed that about 415 respondents were not mention that child qualities regarding the feeling of responsibility. Now, we have to see some correlational study for checking the relationship between the given three variables. We have to check two hypotheses which are given as below: Hypothesis 1 Null hypothesis: H0: There is no any statistically significant relationship exists between the two variables family important and child qualities: feeling of responsibility. Alternative hypothesis: Ha: There is a statistically significant relationship exists between the two variable family important and child qualities: feeling of responsibility. For this test, we consider 5% level of significance for checking the claim. The SPSS results for this test are given as below: 8|P a g e
Correlations Family important Child qualities: feeling of responsibility Family importantPearson Correlation1.024 Sig. (2-tailed).441 N10381038 Child qualities: feeling of responsibility Pearson Correlation.0241 Sig. (2-tailed).441 N10381041 From above SPSS output, it is observed that the Pearson correlation coefficient between two variables family important and child qualities: feeling of responsibility is given as 0.024. This indicates that there is a very low or negligible relationship or correlation exists between these two variables. The P-value for this relationship is given as 0.441 which is greater than alpha value 0.05, so we do not reject the null hypothesis that there is no any statistically significant relationship exists between the two variables family important and child qualities: feeling of responsibility. There is insufficient evidence to conclude thatthere is a statistically significant relationship exists between the two variable family important and child qualities: feeling of responsibility. Now, we have to see the non-parametric Spearman’s correlation coefficient between the given two variables. The SPSS output is given as below: Correlations Family important Child qualities: feeling of responsibility Spearman's rhoFamily importantCorrelation Coefficient1.000.020 Sig. (2-tailed)..525 N10381038 Child qualities: feeling of responsibility Correlation Coefficient.0201.000 Sig. (2-tailed).525. N10381041 9|P a g e
From above output, it is observed that the correlation coefficient between these two variables is given as 0.02 which is negligible. P-value is given as 0.525. We do not reject the null hypothesis that there is no any statistically significant relationship exists between the two variables family important and child qualities: feeling of responsibility.There is insufficient evidence to conclude thatthere is a statistically significant relationship exists between the two variable family important and child qualities: feeling of responsibility. Hypothesis 2 Null hypothesis: H0: There is no any statistically significant relationship exists between the two variables work important and child qualities: feeling of responsibility. Alternative hypothesis: Ha: There is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. For this test, we consider 5% level of significance for checking the claim. The SPSS results for this test are given as below: Correlations Child qualities: feeling of responsibilityWork important Child qualities: feeling of responsibility Pearson Correlation1-.009 Sig. (2-tailed).779 N1041940 Work importantPearson Correlation-.0091 Sig. (2-tailed).779 N940940 From above table, it is observed the Pearson correlation coefficient between the given two variables is given as -0.009 which is very low negative and negligible. The p-value for this test is given as 0.779. So, we do not reject the null hypothesis that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. There is insufficient evidence to conclude that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. 10|P a g e
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Now, we have to check this relationship by using non-parametric Spearman correlation coefficient. Required SPSS output is given as below: Correlations Child qualities: feeling of responsibilityWork important Spearman's rhoChild qualities: feeling of responsibility Correlation Coefficient1.000-.012 Sig. (2-tailed)..720 N1041940 Work importantCorrelation Coefficient-.0121.000 Sig. (2-tailed).720. N940940 From this table, it is observed that the Spearman’s correlation coefficient is given as -0.012 which is negligible. P-value is given as 0.720 which is greater than alpha value 0.05. So, we do not reject the null hypothesis that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. There is insufficient evidence to conclude that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. Logistic Regression Now, we have to use the logistic regression model for the prediction of response variablechild qualities: feeling of responsibility. Here, we have to use binary logistic model because dependent variable or response variable have only two types of responses such as ‘mentioned’ and ‘not mentioned’. Also, it is observed that the relationships between the dependent and independent variables are not statistically significant and therefore we cannot use linear models in this case. So, here we are using binary logistic regression model for the prediction of dependent or response variable child qualities: feeling of responsibility. The SPSS output for this regression model is given as below: 11|P a g e
Case Processing Summary Unweighted CasesaNPercent Selected CasesIncluded in Analysis93890.1 Missing Cases1039.9 Total1041100.0 Unselected Cases0.0 Total1041100.0 a. If weight is in effect, see classification table for the total number of cases. Dependent Variable Encoding Original ValueInternal Value Mentioned0 Not mentioned1 Classification Tablea,b Observed Predicted Child qualities: feeling of responsibilityPercentage CorrectMentionedNot mentioned Step 0Child qualities: feeling of responsibility Mentioned5610100.0 Not mentioned3770.0 Overall Percentage59.8 a. Constant is included in the model. b. The cut value is .500 12|P a g e
Variables in the Equation BS.E.WalddfSig.Exp(B) Step 0Constant-.397.06735.6221.000.672 Variables not in the Equation ScoredfSig. Step 0VariablesV4.5381.463 V8.0951.758 Overall Statistics.6452.724 Omnibus Tests of Model Coefficients Chi-squaredfSig. Step 1Step.6382.727 Block.6382.727 Model.6382.727 Model Summary Step-2 Log likelihood Cox & Snell R Square Nagelkerke R Square 11263.377a.001.001 a. Estimation terminated at iteration number 3 because parameter estimates changed by less than .001. Classification Tablea Observed Predicted Child qualities: feeling of responsibilityPercentage CorrectMentionedNot mentioned Step 1Child qualities: feeling of responsibility Mentioned560199.8 Not mentioned3752.5 Overall Percentage59.9 a. The cut value is .500 13|P a g e
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Variables in the Equation BS.E.WalddfSig.Exp(B) Step 1aV4.154.208.5471.4601.166 V8-.022.068.1081.743.978 Constant-.518.2653.8171.051.595 a. Variable(s) entered on step 1: V4, V8. The p-value for this regression model is given as 0.00 which is less than given level of significance or alpha value so we reject the null hypothesis that binary logistic regression model is not significant. We conclude that the binary logistic regression model is useful for the prediction of dependent variable child qualities: feeling of responsibility. The binary logistic regression equation is given as below: V14 = -0.518 + 0.154*V4 – 0.022*V8 By using this regression equation, we can predict the values for V14 or dependent variable child qualities: feeling of responsibility. Discussion and Conclusions From above statistical data analysis by using both parametric and non-parametric correlation coefficients, it is observed thatthere is insufficient evidence to conclude thatthere is a statistically significant relationship exists between the two variable family important and child qualities: feeling of responsibility. Also, it is observed that there is insufficient evidence to conclude that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. This means, there is no any statistically significant relationship exists between the given dependent and independent variables. So, we cannot use the linear regression model for the prediction of the dependent variable child qualities: feeling of responsibility. We conclude that the binary logistic regression model is useful for the prediction of dependent variable child qualities: feeling of responsibility. These variables are not statistically related to each other and hence we cannot develop linear model for the prediction of dependent variable. From this study, two main conclusions as discuss above are summarised as below: 1.There is insufficient evidence to conclude thatthere is a statistically significant relationship exists between the two variable family important and child qualities: feeling of responsibility. 14|P a g e
2.There is insufficient evidence to conclude that there is a statistically significant relationship exists between the two variable work important and child qualities: feeling of responsibility. 3.We conclude that the binary logistic regression model is useful for the prediction of dependent variable child qualities: feeling of responsibility. References Casella, G. and Berger, R. L. (2002).Statistical Inference. Duxbury Press. Cox, D. R. and Hinkley, D. V. (2000).Theoretical Statistics. Chapman and Hall Ltd. Degroot, M. and Schervish, M. (2002).Probability and Statistics. Addison - Wesley. Dobson, A. J. (2001).An introduction to generalized linear models. Chapman and Hall Ltd. Evans, M. (2004).Probability and Statistics: The Science of Uncertainty. Freeman and Company. Hastle, T., Tibshirani, R. and Friedman, J. H. (2001).The elements of statistical learning: data mining, inference, and prediction: with 200 full-color illustrations. Springer - Verlag Inc. Hogg, R., Craig, A., and McKean, J. (2004).An Introduction to Mathematical Statistics. Prentice Hall. Liese, F. and Miescke, K. (2008).Statistical Decision Theory: Estimation, Testing, and Selection.Springer. Pearl, J. (2000).Casuality: models, reasoning, and inference. Cambridge University Press. Ross, S. (2014).Introduction to Probability and Statistics for Engineers and Scientists. London: Academic Press. 15|P a g e