logo

Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries

Data analysis report of the health and population statistics of East Asian and Pacific countries

20 Pages3160 Words304 Views
   

Added on  2023-06-11

About This Document

This report analyses the health and population statistics of East Asian and Pacific countries from 2001 to 2015. The report includes one-variable and two-variable analysis, clustering, and linear regression. The data has been collected from World Bank.

Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries

Data analysis report of the health and population statistics of East Asian and Pacific countries

   Added on 2023-06-11

ShareRelated Documents
Data analysis report of the health and population statistics of East Asian and Pacific countries
Name of the Student
Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries_1
Table of Contents
1 Introduction................................................................................................................................ 1
1.1 Authorisation and Purpose..............................................................................................1
Limitations................................................................................................................................. 1
Scope......................................................................................................................................... 1
Methodology............................................................................................................................. 1
2 Data Setup.................................................................................................................................. 1
3 Exploratory Data Analysis........................................................................................................... 2
3.1 One Variable Analysis.............................................................................................................. 2
3.1.1 One Variable Analysis – 1......................................................................................................2
3.1.2 One Variable Analysis – 2......................................................................................................3
3.1.3 One Variable Analysis – 3......................................................................................................6
3.2 Two-variable analysis...............................................................................................................7
3.2.1 Two-variable analysis 1.........................................................................................................7
3.2.2 Two-variable analysis 2.........................................................................................................8
4 Advanced analysis.....................................................................................................................10
4.1 Clustering............................................................................................................................... 10
4.1.1 Brief explanation of k-means and clustering.......................................................................10
4.1.2 Clustering Analysis..............................................................................................................11
4.2 Linear regression....................................................................................................................12
4.2.1 Brief definition of linear regression....................................................................................12
4.2.2 Linear Regression 1.............................................................................................................13
4.2.3 Linear Regression 2.............................................................................................................14
5 Conclusion................................................................................................................................ 16
6 Reflection..................................................................................................................................16
Reference.....................................................................................................................................17
Page | ii
Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries_2
1 Introduction
1.1 Authorisation and Purpose
The purpose of the present study is to analyse the health of East Asia and Pacific region with
reference to the period of 2001 to 2015. The data has been collected for World Bank. The
analysis of the data has implications for governments and planners. Improvements in the health
of the region can be initiated through the present study.
Limitations
The information provided for the present investigation pertains to the region of East Asia and
Pacific. The data has been taken from World Bank. In addition, the time period chosen for the
study is from 2001 to 2015.
The analysis is limited to the region of East Asia and Pacific only.
Scope
The data for the present study is replete with information related to the health of the region.
There are 26 attributes in the study with countries of East Asia and Pacific region. In addition,
the study present information on the attributes for the period of 2001 to 2015. However, the
data derived from the world bank has lots of missing data.
The analysis of the data is done through statistical analysis and interpretation of graphs. In the
first stage the data has been studied through three one-variable analyses. In the second stage
two-variable analysis is used. Next we analyse the information through k-means clustering.
Finally, relation between two attributes is studied through linear regression.
Methodology
For the analysis of the health of the East Asia and Pacific region quantitative information for the
period of 2001 to 2015 is studied. The information for the study has been gathered from World
Bank.
2 Data Setup
Before the analysis of the data can take place the data file needs to be loaded into the “R”
program. When the first line of Code is run a pop-up window opens. The user is requested to
input the location of the data file. Moreover, when the file is loaded into the “R” program the
first row is taken as the header. In addition, it was found that there are many missing values in
the “CSV” file, these are denoted as missing is the first line of code.
The second stage of the data analysis provides information to “R program” to load library files.
Library files are necessary to carry out different statistical tests and also to produce charts and
graphs.
Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries_3
3 Exploratory Data Analysis
3.1 One Variable Analysis
3.1.1 One Variable Analysis – 1
The percentage of one-year children immunized at children birth in 2014 is investigated as a
one-variable study. From the study it is found that the average % of one year children
immunized in the region is 89.88 with standard deviation of 9.83%. The minimum and
maximum % of children immunized are 70 and 90% respectively. From the boxplot it can be
seen that the immunization of countries in the region is left skewed.
Page | 2
jpeg("Plot1.jpeg")
fill <- "green"
line <- "blue"
Plot1<- ggplot(Data1, aes(x = factor(0), y = SH.IMM.IBCG)) + geom_boxplot(fill = fill, colour
= line, alpha = 0.7)
Plot1<- Plot1+ scale_x_discrete(name = "Immunization, BCG (% of one-year-old children)")
+ scale_y_continuous(name = "Count")
Plot1<- Plot1+ ggtitle("Distribution of Immunization, BCG (% of one-year-old children) in
2014")+ theme_bw()
describe(Data1$SH.IMM.IBCG)
Plot1
print(Plot1)
dev.off()
Data <- read.csv(file.choose(), header = TRUE, sep = "," , na.strings = "..")
# Loading required library files
library(data.table)
library(reshape2)
library(psych)
library(factoextra)
library(ggplot2)
library(lattice)
library(dplyr)
Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries_4

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries
|17
|3130
|360

Data Analysis Report of Health and Population Statistics of East Asian and Pacific Countries
|15
|2593
|166

(PDF) Analysis of Research in Healthcare Data Analytics
|17
|4129
|83

Report on Demographic Analysis of East Asia and Pacific Countries
|19
|4001
|64

World Health and Population Analysis
|11
|2363
|271

Analysis of East Asia and Pacific Health and Population Data
|22
|3497
|268