Data Analysis and Visualization

Verified

Added on  2023/01/04

|16
|2415
|97
AI Summary
This study report focuses on the statistical analysis of COVID-19 infection rates among white and BAME individuals in the UK. The report includes a comprehensive data collection and analysis to assess the impact of the outbreak on different categories of patients. Various hypotheses are tested using statistical tests.

Contribute Materials

Your contribution can guide someone’s learning journey. Share your documents today.
Document Page
Data Analysis and Visualization

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Contents
INTRODUCTION.......................................................................................................................................3
MAIN BODY..............................................................................................................................................3
Analysis...................................................................................................................................................3
Practical application and deployment......................................................................................................9
CONCLUSION.........................................................................................................................................15
REFERENCES..........................................................................................................................................16
Document Page
INTRODUCTION
The study-report is centered on statistical analysis of data of infection rates of COVID–19 differs
across white individuals and BAME throughout the United Kingdom. The data in the study is
focused on 50 distinct white versus BAME patients. Study provides a comprehensive data
collection to assess which categories of patients are most impacted by COVID-19 outbreak. In
attempt to accomplish so, a range of hypotheses are equipped which are tested on the basis of
requisite statistical tests.
MAIN BODY
Analysis
Description of data set.
The array of data sets is made up of about 50 people from different age groups. They vary in age
from 20 to 50 years. This set of data demonstrates the incidence of illness in various classes of
white and BAME people. The purpose of the analysis of information is to assess the proportion
of infection rates among all classes of persons, particularly white versus BAME (Blacks,
Asians and Minority Cultural and linguistic) individuals. Here the descriptive review of the data
is carried out in the following manner:
Analysis of historical statistics to better clarify the changes that have taken place in an entity is
descriptive review (Peres‐Neto, Dray, and ter Braak, 2017). The application of a collection of
past data to render comparisons is seen in Descriptive Analyses.
Test of descriptive statistics
Statistics
White
patient BAME
N Valid 50 50
Missing 0 0
Mean 36.180 36.740
Document Page
Median 36.000 34.500
Mode 20.0a 50.0
Std. Deviation 9.4192 9.3171
Variance 88.722 86.809
a. Multiple modes exist. The smallest
value is shown
White patient
Frequency Percent
Valid
Percent
Cumulative
Percent
Valid 20.0 4 8.0 8.0 8.0
22.0 2 4.0 4.0 12.0
25.0 2 4.0 4.0 16.0
26.0 1 2.0 2.0 18.0
27.0 2 4.0 4.0 22.0
28.0 1 2.0 2.0 24.0
29.0 1 2.0 2.0 26.0
30.0 2 4.0 4.0 30.0
32.0 1 2.0 2.0 32.0
33.0 4 8.0 8.0 40.0
34.0 2 4.0 4.0 44.0
35.0 3 6.0 6.0 50.0
37.0 2 4.0 4.0 54.0
38.0 2 4.0 4.0 58.0
40.0 4 8.0 8.0 66.0
42.0 1 2.0 2.0 68.0
43.0 4 8.0 8.0 76.0
44.0 3 6.0 6.0 82.0

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
45.0 1 2.0 2.0 84.0
48.0 2 4.0 4.0 88.0
49.0 1 2.0 2.0 90.0
50.0 4 8.0 8.0 98.0
55.0 1 2.0 2.0 100.0
Total 50 100.0 100.0
BAME
Frequency Percent
Valid
Percent
Cumulative
Percent
Valid 20.0 2 4.0 4.0 4.0
22.0 1 2.0 2.0 6.0
23.0 1 2.0 2.0 8.0
24.0 1 2.0 2.0 10.0
26.0 1 2.0 2.0 12.0
27.0 2 4.0 4.0 16.0
28.0 2 4.0 4.0 20.0
29.0 2 4.0 4.0 24.0
30.0 6 12.0 12.0 36.0
31.0 1 2.0 2.0 38.0
32.0 3 6.0 6.0 44.0
33.0 1 2.0 2.0 46.0
34.0 2 4.0 4.0 50.0
35.0 1 2.0 2.0 52.0
39.0 1 2.0 2.0 54.0
40.0 3 6.0 6.0 60.0
41.0 1 2.0 2.0 62.0
43.0 1 2.0 2.0 64.0
Document Page
44.0 4 8.0 8.0 72.0
45.0 3 6.0 6.0 78.0
46.0 1 2.0 2.0 80.0
47.0 3 6.0 6.0 86.0
50.0 7 14.0 14.0 100.0
Total 50 100.0 100.0
Document Page
Development of appropriate hypothesis:
Hypothesis – A research-oriented hypothesis corresponds to valid, explicit and quantifiable
theory or mathematical inference about the potential implications of community-specific
community-based research study, including the perceived disparities among populations on a
particular factor either correlations.
H0: Here are substantial discrepancy in the prevalence of infections in both white patients as
well as BAME.

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
H1: There really is no substantial variation in the incidence of infections in the white patients as
well as BAME.
H0: There's indeed a link between the incidence of infections in white people and BAME.
H1: Here's no correlation between the rate/level of infections in white patients as well as BAME.
Techniques to perform the analysis:
There are a variety of methodologies and approaches for conducting the review, as well
as ANOVA test also is one of the effective approaches for calculating a large variation in a
specific data set. With the exception of such, association analysis may be a critical tool for
determining the relationship between the data collection of the white patients including BAMEs.
One-sample T-test could be employed to verify if the data collected is substantially
distinct from some degree of importance. -Everyone renders statements as to
how confidence intervals indicating μ is related to any degree of significance level-M.
Also, t-test is a form of descriptive review employed to determine if there are significant
contrasts here between various variables that could be applied to such attributes (Green,
2016). It is often used if the data sets perform as specified and may have unidentified
differences, such as the data sets recorded as a consequence of flipping dice hundred
times. Also a t-test is being employed as a fundamental test method which enables the
assessment of a group-specific inferences.
Correlation—In stats, any statistical relationship, either reciprocal or non-reciprocal,
amongst two or multiple factors or multivariate information is connection or dependence.
In the largest possible sense, correlations are any significant correlation, but it typically
represents the degree to which a set of requirements are linearly connected. Correlations
are useful since it is important to suggest a logical connection that could be easily used.
For example, demonstrate the relation between generating potential and climate that the
utility company would generate less electricity on a mild day. And then, in this case,
when cold weather allows people to have more energy for central heating, there is still a
causal correlation. In particular, although the structure of a partnership is not likely to be
sufficient for the presence of a causality that runs.
Document Page
Techniques to present the result of analysis:
In the report, there is no consistent method or process to resolve the data assessment. It is
necessary to note, however, that appropriate tables and figures for the documentation of specific
results from various types of studies should be given. -As in the above section, it is important to
include two modes of interpretation, ANOVA and comparisons, using the related maps and
figures in project report such that clients can recognize the essence of infection rates between
white patient as-well-as BAME in the United Kingdom (Meyer, Van Witteloostuijn, and
Beugelsdijk, 2017). For a detailed analysis of the inflation rate of white people and BAMEs, here
are just a number of graphs and maps:
Practical application and deployment.
Application of techniques:
Testing of hypothesis one-
At the level of 0.10
One-Sample Statistics
Statistic
Bootstrapa
Bias Std. Error
90% Confidence
Interval
Lower Upper
White
patient
N 50
Mean 36.180 .017 1.309 33.941 38.239
Std. Deviation 9.4192 -.1335 .7203 8.1647 10.4345
Std. Error
Mean 1.3321
BAME N 50
Mean 36.740 -.009 1.333 34.441 38.880
Std. Deviation 9.3171 -.1038 .5523 8.3032 10.1280
Document Page
Std. Error
Mean 1.3176
a. Unless otherwise noted, bootstrap results are based on 1000 bootstrap samples
One-Sample Test
Test Value = 0
t df
Sig. (2-
tailed)
Mean
Difference
90% Confidence Interval of
the Difference
Lower Upper
White
patient 27.161 49 .000 36.1800 33.947 38.413
BAME 27.883 49 .000 36.7400 34.531 38.949
Bootstrap for One-Sample Test
Mean
Difference
Bootstrapa
Bias Std. Error
Sig. (2-
tailed)
90% Confidence
Interval
Lower Upper
White
patient 36.1800 .0175 1.3088 .001 33.9410 38.2390
BAME 36.7400 -.0092 1.3328 .001 34.4410 38.8800
a. Unless otherwise noted, bootstrap results are based on 1000 bootstrap samples
At the level of 0.01:
One-Sample Statistics

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Statistic
Bootstrapa
Bias Std. Error
90% Confidence
Interval
Lower Upper
White
patient
N 50
Mean 36.180 .029 1.322 34.041 38.499
Std. Deviation 9.4192 -.1412 .7189 8.1234 10.4846
Std. Error
Mean 1.3321
BAME N 50
Mean 36.740 -.016 1.314 34.561 38.820
Std. Deviation 9.3171 -.1231 .5529 8.2384 10.0995
Std. Error
Mean 1.3176
a. Unless otherwise noted, bootstrap results are based on 1000 bootstrap samples
One-Sample Test
Test Value = 0
t df
Sig. (2-
tailed)
Mean
Difference
99.9% Confidence Interval
of the Difference
Lower Upper
White
patient 27.161 49 .000 36.1800 31.517 40.843
BAME 27.883 49 .000 36.7400 32.128 41.352
Bootstrap for One-Sample Test
Mean Bootstrapa
Document Page
Difference Bias Std. Error
Sig. (2-
tailed)
90% Confidence
Interval
Lower Upper
White
patient 36.1800 .0294 1.3218 .001 34.0410 38.4990
BAME 36.7400 -.0160 1.3143 .001 34.5610 38.8200
a. Unless otherwise noted, bootstrap results are based on 1000 bootstrap samples
At the level of 0.02
One-Sample Statistics
Statistic
Bootstrapa
Bias Std. Error
90% Confidence
Interval
Lower Upper
White
patient
N 50
Mean 36.180 .004 1.258 34.140 38.299
Std. Deviation 9.4192 -.1164 .6726 8.1665 10.4398
Std. Error
Mean 1.3321
BAME N 50
Mean 36.740 -.008 1.305 34.501 38.840
Std. Deviation 9.3171 -.1215 .5635 8.1801 10.1315
Std. Error
Mean 1.3176
a. Unless otherwise noted, bootstrap results are based on 1000 bootstrap samples
Document Page
One-Sample Test
Test Value = 0
t df
Sig. (2-
tailed)
Mean
Difference
98% Confidence Interval of
the Difference
Lower Upper
White
patient 27.161 49 .000 36.1800 32.976 39.384
BAME 27.883 49 .000 36.7400 33.571 39.909
Bootstrap for One-Sample Test
Mean
Difference
Bootstrapa
Bias Std. Error
Sig. (2-
tailed)
90% Confidence
Interval
Lower Upper
White
patient 36.1800 .0043 1.2584 .001 34.1400 38.2990
BAME 36.7400 -.0078 1.3048 .001 34.5010 38.8400
a. Unless otherwise noted, bootstrap results are based on 1000 bootstrap samples
Testing of hypothesis two:
Correlation analysis-
Correlations
White
patient BAME

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
White
patient
Pearson
Correlation 1 .072
Sig. (2-tailed) .621
N 50 50
BAME Pearson
Correlation .072 1
Sig. (2-tailed) .621
N 50 50
Interpretation of results.
Descriptive analysis- In view of the above-mentioned descriptive review of the aforementioned
data set, it could be observed that the mean value figure is 36.18 whereas the standard deviation
figure is 9.42. Therefore, it can be argued there's no correlation among the rates of infection in
case of white-patients as well as BAME. Which, when mean and standard deviations values
differ from one another, it could be concluded that data-sets just aren't accessible to one another.
Hypothesis one:
At such stage of 0.10-On basis of the aforementioned conducted single sample t-test, it could be
reported that the scale of the variance of implications or P is null or 0.00 that is less than 0.05.
Therefore, it can be perceived that herein null hypothesis becomes accurate however there
is substantial difference among the infection rates in case of white patients as well as BAME
at 0.10 point.
At point of 0.01-Similar with this, at either level of the 0.01, this could be reported that p value
is identical as that with 0.001, here, which is lower than 0.05. In this case, as a result, it could be
determined that assessed null hypothesis here is true as well as that there is substantial
discrepancy among infection rates of white patient’s case and BAME at 0.01.
Document Page
At the threshold of 0.02 – here, in additions to this at level of 0.02 also there is a large
discrepancy of 0.00 and is lower than the level of 0.05 that further indicates that there in test
is null hypotheses. And this can be stated that there is considerable discrepancy among the
rates of infection among white patients’ case and level of BAMEs at 0.02 (Wang, Zhao, Hastie
and Owen, 2017).
Testing of hypothesis two:
Throughout the second hypotheses, a correlation has also been identified. On the bases of the
aforementioned tables, it can be concluded that Pearson correlation level is 0.72, and is greater
than 0.6. This may also be argued that there is favorable and beneficial correlation here between
rates of infection among in case of white patients as well as BAMEs. Apart from that, magnitude
of meaning differential is 0.621 which indicates that there no null hypothesis since the level is
greater than 0.05 (Zackay, Ofek and Gal-Yam, 2016).
CONCLUSION
On the bases of the aforementioned project study, it can be inferred that there is substantial
discrepancy between the rates of infections of white patients' case and BAME. Much like there
is connection between rates of infection of the white people and BAMEs. This indicates
that infection rate proportion is similar to other part of both cases white citizens and BAMEs.
Document Page
REFERENCES
Peres‐Neto, P. R., Dray, S., and ter Braak, C. J., 2017. Linking trait variation to the environment:
critical issues with community‐weighted mean correlation resolved by the fourth‐corner
approach. Ecography, 40(7), 806-816.
Green, D., 2016. The Trump hypothesis: Testing immigrant populations as a determinant of
violent and drug‐related crime in the United States. Social Science Quarterly, 97(3), 506-
524.
Meyer, K. E., Van Witteloostuijn, A., and Beugelsdijk, S., 2017. What’s in a p? Reassessing best
practices for conducting and reporting hypothesis-testing research.
Wang, J., Zhao, Q., Hastie, T., and Owen, A. B., 2017. Confounder adjustment in multiple
hypothesis testing. Annals of statistics, 45(5), 1863.
Zackay, B., Ofek, E. O., and Gal-Yam, A., 2016. Proper image subtraction—optimal transient
detection, photometry, and hypothesis testing. The Astrophysical Journal, 830(1), 27.
1 out of 16
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]

Your All-in-One AI-Powered Toolkit for Academic Success.

Available 24*7 on WhatsApp / Email

[object Object]