Descriptive Analysis and 5-point Analysis of Breast Cancer Incidence Rate
Verified
Added on  2023/01/17
|6
|759
|22
AI Summary
This document provides a descriptive analysis and 5-point analysis of breast cancer incidence rate for females over 50 and under 50. It includes histograms, summary statistics, and t-tests to test for significant differences between actual and modeled rates.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Descriptive analysis Histogram We present the histogram for breast cancer incidence rate for over 50 females. As can be seen, the graph shows that the data is not normally distributed as the shape of the histogram is not bell- shaped curve. Next we present the histogram for breast cancer incidence rate for under 50 females. As can be seen, the graph shows that the data is not normally distributed as the shape of the histogram is not bell-shaped curve.
5-point analysis The table below gives the 5-point analysis for the breast cancer incidence rate for the over 50. As can be seen, the average rate is 339.5 with the median rate being 352.9 while the maximum and minimum rates being 397.2 and 262.3 respectively. Is there significant difference in the actual rate and modelled rate for over 50? We sought to test whether there exists any significant difference in the actual rate and modelled rate. The following hypothesis was tested. Null hypothesis (H0): There is no significant difference in the actual rate and the modelled rate for the breast cancer incidence for female over 50. Alternative hypothesis (HA): There is significant difference in the actual rate and the modelled rate for the breast cancer incidence for female over 50. A t-test was performed to test the hypothesis at 5% level of significance. The results are presented below; > t.test(Rate,Modeled.Rate) Welch Two Sample t-test data:Rate and Modeled.Rate t = -0.0051, df = 79.993, p-value = 0.9959 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -17.2235617.13538 sample estimates: mean of x mean of y >summary(Rate) Min. 1st Qu.Median Mean 3rd Qu.Max. 262.3333.5352.9 339.5362.9397.2
As can be seen from the above, the p-value is given as 0.9959 (a value greater than 5% level of significance), we therefore fail to reject the null hypothesis and conclude that there is no significant difference in the actual rate and the modelled rate for the breast cancer incidence for female over 50. Is there significant difference in the actual rate and modelled rate for under 50? We sought to test whether there exists any significant difference in the actual rate and modelled rate. The following hypothesis was tested. Null hypothesis (H0): There is no significant difference in the actual rate and the modelled rate for the breast cancer incidence for female under 50. Alternative hypothesis (HA): There is significant difference in the actual rate and the modelled rate for the breast cancer incidence for female under 50. A t-test was performed to test the hypothesis at 5% level of significance. The results are presented below; As can be seen from the above, the p-value is given as 0.9908 (a value greater than 5% level of significance), we therefore fail to rejectthenullhypothesisandconcludethatthereisno > t.test(Rate,Modeled.Rate) Welch Two Sample t-test data:Rate and Modeled.Rate t = -0.0115, df = 79.676, p-value = 0.9908 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -1.0303121.018448 sample estimates: mean of x mean of y 43.3035343.30947
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
significant difference in the actual rate and the modelled rate for the breast cancer incidence for female under 50. Appendix R codes over50<-read.csv("C:\\Users\\310187796\\Desktop\\breastcancerover50female.csv") str(over50) attach(over50) par(mfrow=c(1,2)) hist(Rate, main="Histogram for the breast cancer incidencerate for over 50", col="red", data=over50) summary(Rate, data=over50) t.test(Rate,Modeled.Rate, data=over50) under50<-read.csv("C:\\Users\\310187796\\Desktop\\breastcancerunder50female.csv") str(under50) attach(under50) hist(Rate, main="Histogram for the breast cancer incidence rate for under 50", col="green", data=under50) summary(Rate, data=under50) t.test(Rate,Modeled.Rate, data=under50)