[DOWNLOAD] Descriptive Study Analysis
VerifiedAdded on 2021/04/24
|27
|5656
|154
AI Summary
This assignment provides a descriptive study analysis of three datasets: gender, years worked, and education. The analysis includes frequency tables, percentage calculations, and cumulative percentages for each dataset. It also presents summary statistics such as mean, median, mode, standard deviation, minimum, maximum, percentiles, and mode for each variable.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
P a g e | 1
Predicting Factors of Depression and
Anxiety in Mental Health Nurses: A
Quantitative Cross-Sectional Study
Student Name: Student ID:
Unit Name: Unit ID:
Date Due: Professor Name:
1
Predicting Factors of Depression and
Anxiety in Mental Health Nurses: A
Quantitative Cross-Sectional Study
Student Name: Student ID:
Unit Name: Unit ID:
Date Due: Professor Name:
1
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
P a g e | 2
Chapter 1: Using Research and Statistics in Health Care
1. Find a peer-reviewed quantitative research article and address the following:
a. Full citation for the article selected.
Gherardi-Donato, E. C. D. S., Cardoso, L., Teixeira, C. A. B., Pereira, S. D. S.,
&Reisdorfer, E. (2015). Association between depression and work stress in nursing
professionals with technical education level. Revistalatino-americana de
enfermagem, 23(4), 733-740(Gherardi-Donato 2015).
b. What is the statement of the purpose of the study?
The peer-reviewed quantitative research article analyses relationship amongst
depression with work stress in the domain of nursing profession, who possess
technical education levels. The scholars conducted in the city of state of Sao Paulo.
Approximately 310 nursing professionals were chosen in a random manner to
conduct a cross-sectional study.
c. Name the theoretical or conceptual model/framework used.
The theoretical model or conceptual framework of cross-section study was selected.
A Job Stress Scale or Working Stress Scale of model developed by Karasek was
applied in the study.
d Write out the research questions addressed by the study.
The study addressed several questions through its methods as follows;
Research Question 1: What is the complexity of work level?
Research Question 2: How much time is spent at the institute?
Research Question 3: Are work leaves easily available?
Research Question 4: What is the prevalence of chronic disease?
Research Question 5: Is smoking allowed in the institute?
e. List the main hypotheses that the study is testing (null & alternative)
The study aims at establishing null and alternate hypothesis through its methods.
Null hypothesis (H0): “Work related parameters does not impact stress levels
amongst nursing professionals”.
Alternate hypothesis for the study are;
H1: Complexity of work level affects levels of stress
H2 :Time is spend at the institute impacts stress
H 3: Work leaves availability impacts job stress
H 4: Prevalence of chronic disease affects job stress
H 5: Smoking permission in the institute affects stress
2
Chapter 1: Using Research and Statistics in Health Care
1. Find a peer-reviewed quantitative research article and address the following:
a. Full citation for the article selected.
Gherardi-Donato, E. C. D. S., Cardoso, L., Teixeira, C. A. B., Pereira, S. D. S.,
&Reisdorfer, E. (2015). Association between depression and work stress in nursing
professionals with technical education level. Revistalatino-americana de
enfermagem, 23(4), 733-740(Gherardi-Donato 2015).
b. What is the statement of the purpose of the study?
The peer-reviewed quantitative research article analyses relationship amongst
depression with work stress in the domain of nursing profession, who possess
technical education levels. The scholars conducted in the city of state of Sao Paulo.
Approximately 310 nursing professionals were chosen in a random manner to
conduct a cross-sectional study.
c. Name the theoretical or conceptual model/framework used.
The theoretical model or conceptual framework of cross-section study was selected.
A Job Stress Scale or Working Stress Scale of model developed by Karasek was
applied in the study.
d Write out the research questions addressed by the study.
The study addressed several questions through its methods as follows;
Research Question 1: What is the complexity of work level?
Research Question 2: How much time is spent at the institute?
Research Question 3: Are work leaves easily available?
Research Question 4: What is the prevalence of chronic disease?
Research Question 5: Is smoking allowed in the institute?
e. List the main hypotheses that the study is testing (null & alternative)
The study aims at establishing null and alternate hypothesis through its methods.
Null hypothesis (H0): “Work related parameters does not impact stress levels
amongst nursing professionals”.
Alternate hypothesis for the study are;
H1: Complexity of work level affects levels of stress
H2 :Time is spend at the institute impacts stress
H 3: Work leaves availability impacts job stress
H 4: Prevalence of chronic disease affects job stress
H 5: Smoking permission in the institute affects stress
2
P a g e | 3
f. Define the dependent variable and the main independent variables.
The study was conducted by means of dependent as well as independent variables.
Dependent variables for the study were reporting depression. Independent variables
include psychological (demand), work (control) and social support.
g. Briefly describe the research design.
Research design adopted for the study was descriptive-exploratory approach. In this
approach, primary aim of the scholars is to unearth dearth of information to arrive at
conclusion related to the study. All necessary information related to the study is used
for this procedure.
h. Describe the study sample (e.g., size, socio-demographic characteristics) and
how it was obtained.
For this scope of the study a sample of 338 participants from the hospital employees
were selected. The sample included both male and female participants that worked at
the hospital as nursing technicians or nursing assistants. Age of the participants was
less than 40 years and they belonged to Brazil area.
i. List the statistics used to test the hypotheses
Chi-square test was used to test the hypothesis.
j. Identify the main assumptions and limitations.
The assumption of this study is that nursing technicians or nursing assistants carry out
high demand workloads with relatively less control over their work performance.
This is not always true as nursing professionals are not always controlled or
supervised by professional nurses.
The study suffers from certain limitations as there were transversal data collection
method adopted that might prevent establishing chronological and casual relationship
amongst variables. The study also faces limitations from memory and diagnostic
biases.
Chapter 2: Organizing, Displaying, and Describing Data
3
f. Define the dependent variable and the main independent variables.
The study was conducted by means of dependent as well as independent variables.
Dependent variables for the study were reporting depression. Independent variables
include psychological (demand), work (control) and social support.
g. Briefly describe the research design.
Research design adopted for the study was descriptive-exploratory approach. In this
approach, primary aim of the scholars is to unearth dearth of information to arrive at
conclusion related to the study. All necessary information related to the study is used
for this procedure.
h. Describe the study sample (e.g., size, socio-demographic characteristics) and
how it was obtained.
For this scope of the study a sample of 338 participants from the hospital employees
were selected. The sample included both male and female participants that worked at
the hospital as nursing technicians or nursing assistants. Age of the participants was
less than 40 years and they belonged to Brazil area.
i. List the statistics used to test the hypotheses
Chi-square test was used to test the hypothesis.
j. Identify the main assumptions and limitations.
The assumption of this study is that nursing technicians or nursing assistants carry out
high demand workloads with relatively less control over their work performance.
This is not always true as nursing professionals are not always controlled or
supervised by professional nurses.
The study suffers from certain limitations as there were transversal data collection
method adopted that might prevent establishing chronological and casual relationship
amongst variables. The study also faces limitations from memory and diagnostic
biases.
Chapter 2: Organizing, Displaying, and Describing Data
3
P a g e | 4
1. What is a variable?
Variable is defined as a quantity whose value changes during the experiment or
investigation (MacKinnon, 2008).
2. What is an independent variable?
Independent variable is the direct outcomes of an experiment or manipulative values
of an experiment.
3. What is a dependent variable?
Dependent variables are generated by the help of independent variables. This kind of
variables is generally the effect of independent variables.
4. Define the following measurement scales and give at least one example of a variable
and how they would be measured by each:
a. Nominal Scales
This scale does not allocate any numerical value to the variable but just labels the
variable.
Example: Gender, which is either male or female but cannot have any numerical
value.
Color: This can have different labeling but no numerical value.
b. Ordinal Scales
This scale signifies characteristic of the data but numerical values cannot be assigned
to the variable.
Example: Mood of a person, Satisfaction level: No numerical value but there is an
order which gives a sense of comparative study
c. Interval Scales
This kind of scale is an orderly numerical scale with magnitude of change
(Wasserman, 2013).
Example: Range of a data set as time interval or temperature variation interval.
d. Ratio Scales
Ratio scale gives ordered and numerical data with a clear definition of “zero”.
Inferential statistical can be applied on ratio scaled data.
Example: Standard deviation of a data set.
5. Complete the following sentence: It is usually best to gather data at the highest level
of measurement for research variables because this helps in application of various
statistical tests and conclude from statistical inferences.
4
1. What is a variable?
Variable is defined as a quantity whose value changes during the experiment or
investigation (MacKinnon, 2008).
2. What is an independent variable?
Independent variable is the direct outcomes of an experiment or manipulative values
of an experiment.
3. What is a dependent variable?
Dependent variables are generated by the help of independent variables. This kind of
variables is generally the effect of independent variables.
4. Define the following measurement scales and give at least one example of a variable
and how they would be measured by each:
a. Nominal Scales
This scale does not allocate any numerical value to the variable but just labels the
variable.
Example: Gender, which is either male or female but cannot have any numerical
value.
Color: This can have different labeling but no numerical value.
b. Ordinal Scales
This scale signifies characteristic of the data but numerical values cannot be assigned
to the variable.
Example: Mood of a person, Satisfaction level: No numerical value but there is an
order which gives a sense of comparative study
c. Interval Scales
This kind of scale is an orderly numerical scale with magnitude of change
(Wasserman, 2013).
Example: Range of a data set as time interval or temperature variation interval.
d. Ratio Scales
Ratio scale gives ordered and numerical data with a clear definition of “zero”.
Inferential statistical can be applied on ratio scaled data.
Example: Standard deviation of a data set.
5. Complete the following sentence: It is usually best to gather data at the highest level
of measurement for research variables because this helps in application of various
statistical tests and conclude from statistical inferences.
4
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
P a g e | 5
6. An important description of a variable includes the shape and symmetry of the
distribution of its values. Describe the following:
a. Positive (right -skew)
b. Negative (left-skew)
c. Normal distribution
Normal distribution is a bell shaped symmetric curve (skewness zero) where mean,
median and mode coincides. It is a unimodal distribution. The probability density
function of the distribution is
1
σ √ 2 π e
− ( x −μ ) 2
2σ 2
. This is a continuous distribution.
d. Bimodal distribution
The binomial distribution is a unimodal discrete probability distribution. This
distribution can be symmetric or skewed in nature. The probability mass function is
f (x )=n Cr pr ( 1−p ) n−r
(r=0, 1, 2, 3, 4…n) for parameters n and p.
7. Complete the following sentence: The three measures of central tendency are the
Mean, Median and Mode.
8. Define/describe the following (include a statement about how each are calculated):
a. Mode:
Mode is that data in a dataset which has highest repetition. For example if data set is
{2, 6, 2, 8, 9, 4} then 2 is the mode
b. Median:
Median is the middle most value of a cumulatively arranged dataset. There can be
two medians in absence of a single median value. The given set of data is arrange in
less than or more than type sequence. The middle most value or middle most two
values are taken as median.
c. Mean
5
For right skewed or positive skewed distribution data is more
scattered out at the right tale. Mean and Median are greater
than Mode.
For left skewed or negative skewed distribution data is more
scattered out at the left tale. Mean and Median are lesser than
Mode.
6. An important description of a variable includes the shape and symmetry of the
distribution of its values. Describe the following:
a. Positive (right -skew)
b. Negative (left-skew)
c. Normal distribution
Normal distribution is a bell shaped symmetric curve (skewness zero) where mean,
median and mode coincides. It is a unimodal distribution. The probability density
function of the distribution is
1
σ √ 2 π e
− ( x −μ ) 2
2σ 2
. This is a continuous distribution.
d. Bimodal distribution
The binomial distribution is a unimodal discrete probability distribution. This
distribution can be symmetric or skewed in nature. The probability mass function is
f (x )=n Cr pr ( 1−p ) n−r
(r=0, 1, 2, 3, 4…n) for parameters n and p.
7. Complete the following sentence: The three measures of central tendency are the
Mean, Median and Mode.
8. Define/describe the following (include a statement about how each are calculated):
a. Mode:
Mode is that data in a dataset which has highest repetition. For example if data set is
{2, 6, 2, 8, 9, 4} then 2 is the mode
b. Median:
Median is the middle most value of a cumulatively arranged dataset. There can be
two medians in absence of a single median value. The given set of data is arrange in
less than or more than type sequence. The middle most value or middle most two
values are taken as median.
c. Mean
5
For right skewed or positive skewed distribution data is more
scattered out at the right tale. Mean and Median are greater
than Mode.
For left skewed or negative skewed distribution data is more
scattered out at the left tale. Mean and Median are lesser than
Mode.
P a g e | 6
Mean is the average of a data set. All the data values are multiplied with their
frequencies and summed up. Then the sum is divided by the total number of
observations or total frequency of the data set.
9. Discuss how you would choose the correct measure(s) of central tendency?
Mean is the popular measure of central tendency for a dataset without outlier values.
Median is preferred over Mean for skewed data as well. Mean becomes useless
measure for outlier values and skewed data as it increases the measure and it does not
provides correct information about the data. Both measures are good for close ended
data set. For open ended skewed data set Mode is the preferred measure.
10. Define/describe the following:
a. Measures of dispersion:
Measures of central tendency do not reveal all the characteristic of the given
information. Mean, Median, Mode only reflects the middle most values or general
magnitude. Measure of dispersion reveals the deficiency in uniformity of the data.
Scattering of the given information is measured in dispersion.
b. Range
Range is the length of the interval of the data; explicitly it gives the maximum minus
minimum value of the given data set.
c. Interquartile Range
Interquartile Range measures inconsistency of the data. It divides the data in four
equal portions with rank ordering. The three quarters are denoted by Q1 , Q2 , Q3 where
Q2 is the median. The Interquartile range is measured as Q3−Q1 .
d. Standard Deviation and Variance
Standard deviation is a measure of dispersion. It measures the deviation of the data
from the mean value. Sum of square of the differences from mean are taken and the
average of thus obtained data is calculated. Then by square root of the obtained value
standard deviation is obtained. The formula
σ = √ 1
n ∑
i=1
n
( xi−μ )
2
evaluates the
measure.
6
Mean is the average of a data set. All the data values are multiplied with their
frequencies and summed up. Then the sum is divided by the total number of
observations or total frequency of the data set.
9. Discuss how you would choose the correct measure(s) of central tendency?
Mean is the popular measure of central tendency for a dataset without outlier values.
Median is preferred over Mean for skewed data as well. Mean becomes useless
measure for outlier values and skewed data as it increases the measure and it does not
provides correct information about the data. Both measures are good for close ended
data set. For open ended skewed data set Mode is the preferred measure.
10. Define/describe the following:
a. Measures of dispersion:
Measures of central tendency do not reveal all the characteristic of the given
information. Mean, Median, Mode only reflects the middle most values or general
magnitude. Measure of dispersion reveals the deficiency in uniformity of the data.
Scattering of the given information is measured in dispersion.
b. Range
Range is the length of the interval of the data; explicitly it gives the maximum minus
minimum value of the given data set.
c. Interquartile Range
Interquartile Range measures inconsistency of the data. It divides the data in four
equal portions with rank ordering. The three quarters are denoted by Q1 , Q2 , Q3 where
Q2 is the median. The Interquartile range is measured as Q3−Q1 .
d. Standard Deviation and Variance
Standard deviation is a measure of dispersion. It measures the deviation of the data
from the mean value. Sum of square of the differences from mean are taken and the
average of thus obtained data is calculated. Then by square root of the obtained value
standard deviation is obtained. The formula
σ = √ 1
n ∑
i=1
n
( xi−μ )
2
evaluates the
measure.
6
P a g e | 7
11. To report the results of nominal data which of the following measures of central
tendency would be appropriate:
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Mode
12. To report the results of ordinal data which of the following measures of central
tendency would be appropriate?
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Median and Median
13. To report the results of interval data (e.g., Likert scale) which of the following
measure(s) of central tendency would be appropriate:
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Mean
14. To report the results of ratio scale data which of the following measure(s) of central
tendency would be appropriate:
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Mean and Median for skewed data
15. Identify the following distributions:
_______Unimodal______ __________ ____Bimodal___________
7
11. To report the results of nominal data which of the following measures of central
tendency would be appropriate:
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Mode
12. To report the results of ordinal data which of the following measures of central
tendency would be appropriate?
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Median and Median
13. To report the results of interval data (e.g., Likert scale) which of the following
measure(s) of central tendency would be appropriate:
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Mean
14. To report the results of ratio scale data which of the following measure(s) of central
tendency would be appropriate:
a. Mean
b. Mode
c. Median
d. All of the above
e. None of the above
ANS: Mean and Median for skewed data
15. Identify the following distributions:
_______Unimodal______ __________ ____Bimodal___________
7
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
P a g e | 8
________Right skewed____________ _______Left Skewed_____
16. Identify the following:
_____Histogram_______________ _____Pie Chart_____________________
Chapter 3: Key Principles Underlying Statistical Inference: Probability and
the Normal Distribution
1. Define the following:
a. Sensitivity:
Sensitivity is the ratio of true positive cases and true positive along with false
negative cases. For example
Sensitivity= true positive
true positive+ false negative measures
the probability of true illness situation among the ill persons (Tenenbaum, 2011).
b. Specificity:
Specificity is the ratio of true negative cases and false positive along with true
negative cases. For example
Specificity = true negative
false positive+true negative measures the
probability of no ill condition among the not ill persons.
8
________Right skewed____________ _______Left Skewed_____
16. Identify the following:
_____Histogram_______________ _____Pie Chart_____________________
Chapter 3: Key Principles Underlying Statistical Inference: Probability and
the Normal Distribution
1. Define the following:
a. Sensitivity:
Sensitivity is the ratio of true positive cases and true positive along with false
negative cases. For example
Sensitivity= true positive
true positive+ false negative measures
the probability of true illness situation among the ill persons (Tenenbaum, 2011).
b. Specificity:
Specificity is the ratio of true negative cases and false positive along with true
negative cases. For example
Specificity = true negative
false positive+true negative measures the
probability of no ill condition among the not ill persons.
8
P a g e | 9
c. Predictive Value:
Predictive value can be negative or positive. Positive value gives the ratio of true
positives compared to number of positives. It is nothing but power of a test. Negative
value gives the ratio of true negatives compared to number of negatives. This is type
II error of a test.
d. Efficiency
In hypothesis testing two types of estimation are done. They are Point estimation and
Interval estimation. Efficiency measures the quality of these estimators (Forbes,
2011). Efficiency helps to optimize the sample size for quality of estimation for both
cases.
2. Calculate the sensitivity (Sn), specificity (Sp), positive predictive value (PPV,
negative predictive value (NPV), and efficiency (EFF) for the following summarized
results?
Anemia
Pre
sent
Anemia
Abs
ent
Total
Serum ferritin + (Positive) 650 150 800
Test result - (Negative) 65 1200 1265
Total 715 1350 2065
ANS:
The following table helps to perform the calculation
AnemiaPresent AnemiaAbsent Total
True Result
Positive
(number
)
A
(true
positive)=650
B
(false
positive)=150
Test Positive =
800
Negative
(number
)
C
(false
negative) =65
D
(true
Negative)=1200
Test
Negative=1265
T-Disease
=715
T-non Disease=
1350 Total= 2065
9
c. Predictive Value:
Predictive value can be negative or positive. Positive value gives the ratio of true
positives compared to number of positives. It is nothing but power of a test. Negative
value gives the ratio of true negatives compared to number of negatives. This is type
II error of a test.
d. Efficiency
In hypothesis testing two types of estimation are done. They are Point estimation and
Interval estimation. Efficiency measures the quality of these estimators (Forbes,
2011). Efficiency helps to optimize the sample size for quality of estimation for both
cases.
2. Calculate the sensitivity (Sn), specificity (Sp), positive predictive value (PPV,
negative predictive value (NPV), and efficiency (EFF) for the following summarized
results?
Anemia
Pre
sent
Anemia
Abs
ent
Total
Serum ferritin + (Positive) 650 150 800
Test result - (Negative) 65 1200 1265
Total 715 1350 2065
ANS:
The following table helps to perform the calculation
AnemiaPresent AnemiaAbsent Total
True Result
Positive
(number
)
A
(true
positive)=650
B
(false
positive)=150
Test Positive =
800
Negative
(number
)
C
(false
negative) =65
D
(true
Negative)=1200
Test
Negative=1265
T-Disease
=715
T-non Disease=
1350 Total= 2065
9
P a g e | 10
Sensitivity:
A
A+ C∗100 =
650
715 ∗100 = 90.9090909%
Specificity:
D
D+ B∗100 =
1200
1350 ∗100 = 88.8888889%
Positive Predictive Value:
A
A+ B ∗100 = 81.25%
Negative Predictive Value:
D
D+C ∗100 = 94.8616601%
Efficiency:
A + D
TOTAL ∗100 = 89.58%
3. Describe the clinical performance of the serum ferritin test as a diagnostic tool (from the
above calculations).
ANS: Serrum Ferritin was highly effective as a diagnostic tool as all the statistical
inference data were between 80%-95%. The reagent was highly efficient in testing
the hypothetical conditions with efficiency of almost 90%.
4. Describe the “Normal Distribution”:
Normal distribution is a continuous distribution. The PDF of the distribution is
1
σ √ 2 π e
− ( x −μ ) 2
2σ 2
μ and σ are mean and standard deviation of the distribution. The
probability curve looks like a bell shaped curve. The three measures of central
tendency coincide for normal distribution. The total area under the bell shaped curve
is one. Under the limiting conditions probability−¿ 0 , sample size−¿∞ binomial
distribution tends to normal distribution.
5. Complete this sentence: A sample population curve is more likely to look like the
population curve when sample size is very large (greater than 30)
6. In practice, the normal distribution is among the most important distributions in
statistics for three reasons. List these reasons:
a. Many probability distributions in nature tend to normal distribution by using
Central Limit theorem.
b. Further statistical calculations become easy.
10
Sensitivity:
A
A+ C∗100 =
650
715 ∗100 = 90.9090909%
Specificity:
D
D+ B∗100 =
1200
1350 ∗100 = 88.8888889%
Positive Predictive Value:
A
A+ B ∗100 = 81.25%
Negative Predictive Value:
D
D+C ∗100 = 94.8616601%
Efficiency:
A + D
TOTAL ∗100 = 89.58%
3. Describe the clinical performance of the serum ferritin test as a diagnostic tool (from the
above calculations).
ANS: Serrum Ferritin was highly effective as a diagnostic tool as all the statistical
inference data were between 80%-95%. The reagent was highly efficient in testing
the hypothetical conditions with efficiency of almost 90%.
4. Describe the “Normal Distribution”:
Normal distribution is a continuous distribution. The PDF of the distribution is
1
σ √ 2 π e
− ( x −μ ) 2
2σ 2
μ and σ are mean and standard deviation of the distribution. The
probability curve looks like a bell shaped curve. The three measures of central
tendency coincide for normal distribution. The total area under the bell shaped curve
is one. Under the limiting conditions probability−¿ 0 , sample size−¿∞ binomial
distribution tends to normal distribution.
5. Complete this sentence: A sample population curve is more likely to look like the
population curve when sample size is very large (greater than 30)
6. In practice, the normal distribution is among the most important distributions in
statistics for three reasons. List these reasons:
a. Many probability distributions in nature tend to normal distribution by using
Central Limit theorem.
b. Further statistical calculations become easy.
10
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
P a g e | 11
c. Raw data can be converted to quartiles and percentiles for known mean and
standard deviation values.
7. What is a z-score?
Z-score measures deviation score. The standard deviations of the data from the mean
value of the raw data measured in positive as well as negative direction are basically
z-score.
8. What is the Central Limit Theorem?
According to Central Limit theorem probability distribution of sample mean (mean of
sample means) tends to normal distribution for large number of sample size
irrespective of the actual probability distribution of the population.
9. A normal distribution is characterized by:
a. a bell shape
b. a mean, median, and mode that are equal
c. a total area under the cure above the x-axis that is 1.
d. all of the above.
ANS: All of the above
10. A z-score of 0 corresponds to the
a. mean.
b. SD.
c. Interquartile range.
d. 75th percentile
ANS: Mean
11. The 50th percentile is always the
a. mean.
b. median
c. SD
d. a and b
ANS: Median
12. A sample population curve is more likely to look like the population curve when
a. the bell shape is wide
b. the sample size is small
c. the sample size is greater than 20
d. none of the above.
ANS: None of the above (as the sample size should be greater than 30)
11
c. Raw data can be converted to quartiles and percentiles for known mean and
standard deviation values.
7. What is a z-score?
Z-score measures deviation score. The standard deviations of the data from the mean
value of the raw data measured in positive as well as negative direction are basically
z-score.
8. What is the Central Limit Theorem?
According to Central Limit theorem probability distribution of sample mean (mean of
sample means) tends to normal distribution for large number of sample size
irrespective of the actual probability distribution of the population.
9. A normal distribution is characterized by:
a. a bell shape
b. a mean, median, and mode that are equal
c. a total area under the cure above the x-axis that is 1.
d. all of the above.
ANS: All of the above
10. A z-score of 0 corresponds to the
a. mean.
b. SD.
c. Interquartile range.
d. 75th percentile
ANS: Mean
11. The 50th percentile is always the
a. mean.
b. median
c. SD
d. a and b
ANS: Median
12. A sample population curve is more likely to look like the population curve when
a. the bell shape is wide
b. the sample size is small
c. the sample size is greater than 20
d. none of the above.
ANS: None of the above (as the sample size should be greater than 30)
11
P a g e | 12
13. When raw scores are converted to standard scores, the resulting distributions have a
mean equal to ___zero____ and a standard deviation equal to ____one_______.
14. Scores on a particular test are normally distributed with a mean of 70 and a SD of 15.
Between what two scores would you expect?
a. 68% of the scores to fall between: -84.925 and 84.925( -0.995 to 0.995 for z-
score)
b. 96% of the scores to fall between: -108.25 and 108.25 (-2.55 to 2.55 for z-score)
a. Calculations:
P ( −α≤X≤α ) =0 .68 => P ( X≤α ) =0. 34 => P ( Z≤α−70
15 )=
0 .34 => P ( Z≤α−70
15 ) =P ( Z≤0 . 995 )
=> α −70
15 =0 . 995 => α=84 . 925
B. Calculations:
P ( −α≤X≤α ) =0 . 96=> P ( X ≤α ) =0 . 48 => P ( Z≤ α −70
15 ) =0 . 48 => P ( Z≤ α −70
15 )=P ( Z≤2. 55 )
=> α −70
15 =2 .55 => α=108 .25
Chapter 4: Hypothesis Testing with Inferential Statistics.
1. Write a testable hypothesis and explain the difference between the null and alternative
hypotheses.
ANS: Testable hypothesis: Students marks in semester exams are related to
attendance percentage in class.
The null hypothesis will assume that semester marks are independent of attendance
percentage only to be nullified against an alternate hypothesis (Christensen, 2011).
Now that alternate hypothesis may be one tailed or two tailed depending upon the
experimenter’s choice. In this case a right tailed alternate hypothesis will say that
students get higher marks for better attendance in classes. Null hypothesis is meant to
be rejected under against the alternate hypothesis with the help of test statistic.
2. Define statistical significance and explain the meaning of a p-value.
ANS: Statistical significance level or alpha indicates the probability of uncertainty or
chance of unsuccessful association of sample statistic with population parameter.
Level of significance defines critical region (Härdle, 2012). P-value is the probability
of significance. It denotes the calculated probability value of the test statistic. For a
p-value less than alpha null hypothesis gets rejected.
12
13. When raw scores are converted to standard scores, the resulting distributions have a
mean equal to ___zero____ and a standard deviation equal to ____one_______.
14. Scores on a particular test are normally distributed with a mean of 70 and a SD of 15.
Between what two scores would you expect?
a. 68% of the scores to fall between: -84.925 and 84.925( -0.995 to 0.995 for z-
score)
b. 96% of the scores to fall between: -108.25 and 108.25 (-2.55 to 2.55 for z-score)
a. Calculations:
P ( −α≤X≤α ) =0 .68 => P ( X≤α ) =0. 34 => P ( Z≤α−70
15 )=
0 .34 => P ( Z≤α−70
15 ) =P ( Z≤0 . 995 )
=> α −70
15 =0 . 995 => α=84 . 925
B. Calculations:
P ( −α≤X≤α ) =0 . 96=> P ( X ≤α ) =0 . 48 => P ( Z≤ α −70
15 ) =0 . 48 => P ( Z≤ α −70
15 )=P ( Z≤2. 55 )
=> α −70
15 =2 .55 => α=108 .25
Chapter 4: Hypothesis Testing with Inferential Statistics.
1. Write a testable hypothesis and explain the difference between the null and alternative
hypotheses.
ANS: Testable hypothesis: Students marks in semester exams are related to
attendance percentage in class.
The null hypothesis will assume that semester marks are independent of attendance
percentage only to be nullified against an alternate hypothesis (Christensen, 2011).
Now that alternate hypothesis may be one tailed or two tailed depending upon the
experimenter’s choice. In this case a right tailed alternate hypothesis will say that
students get higher marks for better attendance in classes. Null hypothesis is meant to
be rejected under against the alternate hypothesis with the help of test statistic.
2. Define statistical significance and explain the meaning of a p-value.
ANS: Statistical significance level or alpha indicates the probability of uncertainty or
chance of unsuccessful association of sample statistic with population parameter.
Level of significance defines critical region (Härdle, 2012). P-value is the probability
of significance. It denotes the calculated probability value of the test statistic. For a
p-value less than alpha null hypothesis gets rejected.
12
P a g e | 13
3. Define “power”:
ANS: Power of a test is the probability of rejecting a false null hypothesis. Hence
power of a test = 1 – Type II error.
4. What is a Type I Error?
ANS: Type I error of a statistical test is the probability of rejection of a true null
hypothesis.
5. What is a Type II Error?
ANS: Type II error of a statistical test is the probability of acceptance of a false null
hypothesis.
6. What is the relationship between Type I and Type II Errors?
ANS: Type I error and type II error are inversely related. For a increase in type I error
type II decreases and vice-versa.
7. A type I error occurs when the (indicate the answer):
a. null hypothesis is accepted when it is false
b. null hypothesis is rejected when it is true
c. sample size is too small
d. effect size is not defined in advance.
ANS: b. null hypothesis is rejected when it is true
8. A type II error occurs when the (indicate the answer):
a. null hypothesis is accepted when it is false
b. null hypothesis is rejected when it is true
c. sample size is too small
d. effect size is not defined in advance.
ANS: a. null hypothesis is accepted when it is false
9. The a-level is defined by
a. the probability of making a type I error
b. the probability of making a type II error.
c. the researcher at the start of a study.
d. a and c only
ANS: a. the probability of making a type I error
10. Which of the following is more likely to contain the “true” population value of the
mean?
a. A 90% confidence interval (CI)
b. A 95% CI.
c. A 99% CI
d. All of the above.
13
3. Define “power”:
ANS: Power of a test is the probability of rejecting a false null hypothesis. Hence
power of a test = 1 – Type II error.
4. What is a Type I Error?
ANS: Type I error of a statistical test is the probability of rejection of a true null
hypothesis.
5. What is a Type II Error?
ANS: Type II error of a statistical test is the probability of acceptance of a false null
hypothesis.
6. What is the relationship between Type I and Type II Errors?
ANS: Type I error and type II error are inversely related. For a increase in type I error
type II decreases and vice-versa.
7. A type I error occurs when the (indicate the answer):
a. null hypothesis is accepted when it is false
b. null hypothesis is rejected when it is true
c. sample size is too small
d. effect size is not defined in advance.
ANS: b. null hypothesis is rejected when it is true
8. A type II error occurs when the (indicate the answer):
a. null hypothesis is accepted when it is false
b. null hypothesis is rejected when it is true
c. sample size is too small
d. effect size is not defined in advance.
ANS: a. null hypothesis is accepted when it is false
9. The a-level is defined by
a. the probability of making a type I error
b. the probability of making a type II error.
c. the researcher at the start of a study.
d. a and c only
ANS: a. the probability of making a type I error
10. Which of the following is more likely to contain the “true” population value of the
mean?
a. A 90% confidence interval (CI)
b. A 95% CI.
c. A 99% CI
d. All of the above.
13
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
P a g e | 14
ANS: d. All of the above.
11. If a statistical test is significant, it means that
a. it has important clinical applications
b. the study had acceptable power
c. the null hypothesis was rejected.
d. all of the above are true.
ANS: c. the null hypothesis was rejected.
12. Power can be increased by doing which of the following?
a. Increasing the a-level.
b. increasing the sample size
c. Increasing the effect size
d. All of the above
ANS: d. All of the above
13. Complete the following sentence: The power of a statistical test is its ability to detect
statistically significant differences; it is defined as the probability of rejecting a null
hypothesis when it is false (1- type II error).
14. In empirical research, hypotheses are tested using inferential statistics. Give 2
examples of inferential statistics:
a. Mean marks of all students for the subject Statistics
b. Grade scores of class III students in the subject Environment Science.
15. List and discuss the six-step process for testing hypotheses:
a. Statement of Null hypothesis
b. Statement of Alternate hypothesis
c. Collection of sample data
d. Choice of proper test Statistic.
e. Choice of level of significance
f. Conclusion about null hypothesis from the value of test statistic
16. What is a confidence interval and why would you construct one?
ANS: Confidence interval for a test statistic reflects the amount of uncertainty related
to the sample to assess the population parameter. A 95% confidence interval for a
sample test statistic indicates that there is a 5% uncertainty of assessing the
population parameter which is also known as Level of Significance.
14
ANS: d. All of the above.
11. If a statistical test is significant, it means that
a. it has important clinical applications
b. the study had acceptable power
c. the null hypothesis was rejected.
d. all of the above are true.
ANS: c. the null hypothesis was rejected.
12. Power can be increased by doing which of the following?
a. Increasing the a-level.
b. increasing the sample size
c. Increasing the effect size
d. All of the above
ANS: d. All of the above
13. Complete the following sentence: The power of a statistical test is its ability to detect
statistically significant differences; it is defined as the probability of rejecting a null
hypothesis when it is false (1- type II error).
14. In empirical research, hypotheses are tested using inferential statistics. Give 2
examples of inferential statistics:
a. Mean marks of all students for the subject Statistics
b. Grade scores of class III students in the subject Environment Science.
15. List and discuss the six-step process for testing hypotheses:
a. Statement of Null hypothesis
b. Statement of Alternate hypothesis
c. Collection of sample data
d. Choice of proper test Statistic.
e. Choice of level of significance
f. Conclusion about null hypothesis from the value of test statistic
16. What is a confidence interval and why would you construct one?
ANS: Confidence interval for a test statistic reflects the amount of uncertainty related
to the sample to assess the population parameter. A 95% confidence interval for a
sample test statistic indicates that there is a 5% uncertainty of assessing the
population parameter which is also known as Level of Significance.
14
P a g e | 15
17. Complete the following sentence: Confidence Intervals are typically constructed as
either 95% or 99% Confidence Intervals.
18. Describe significance level (alpha)?
ANS: Level of Significance is the uncertainty associated with sample statistic to
estimate the population parameter. It is nothing but the probability of rejecting the
null hypothesis when the hypothesis is actually true.
19. What is the difference between an Alpha that is one-or one that is two-tailed?
ANS:
20. If a statistical test is significant, it means that
a. it has important clinical applications.
b. the study had acceptable power
c. the null hypothesis was rejected
d. all of the above
ANS: d. all of the above
15
For one tail test the alternate hypothesis is: sample statistic
is either less than or more than the population parameter
but measured separately. Level of significance alpha for a
one tail test assigns the significance in one tail of the
distribution. For example a left test for 5% alpha assigns
probability 0.05 in left test for critical region.
For two tail test the alternate hypothesis is: sample
statistic can be either less than or more than the
population parameter in a single test. Level of
significance alpha for a two tail test assigns the
significance in both tails of the distribution. For example
a both tail test for 5% alpha assigns probability of 0.025
in both tails for the critical region.
17. Complete the following sentence: Confidence Intervals are typically constructed as
either 95% or 99% Confidence Intervals.
18. Describe significance level (alpha)?
ANS: Level of Significance is the uncertainty associated with sample statistic to
estimate the population parameter. It is nothing but the probability of rejecting the
null hypothesis when the hypothesis is actually true.
19. What is the difference between an Alpha that is one-or one that is two-tailed?
ANS:
20. If a statistical test is significant, it means that
a. it has important clinical applications.
b. the study had acceptable power
c. the null hypothesis was rejected
d. all of the above
ANS: d. all of the above
15
For one tail test the alternate hypothesis is: sample statistic
is either less than or more than the population parameter
but measured separately. Level of significance alpha for a
one tail test assigns the significance in one tail of the
distribution. For example a left test for 5% alpha assigns
probability 0.05 in left test for critical region.
For two tail test the alternate hypothesis is: sample
statistic can be either less than or more than the
population parameter in a single test. Level of
significance alpha for a two tail test assigns the
significance in both tails of the distribution. For example
a both tail test for 5% alpha assigns probability of 0.025
in both tails for the critical region.
P a g e | 16
Chapter5: SPSS Exercises for Chapters 1 – 4
Frequency Tables, Descriptive Statistics and Histograms in SPSS
1. a. Create and submit a frequency distribution
Table 1: FREQUENCY DISTRIBUTION FOR DATASET I
Car Parked
Frequency Percent Valid
Percent
Cumulative
Percent
Valid
10 1 4.0 4.0 4.0
14 1 4.0 4.0 8.0
15 2 8.0 8.0 16.0
16 1 4.0 4.0 20.0
18 1 4.0 4.0 24.0
20 1 4.0 4.0 28.0
22 1 4.0 4.0 32.0
24 1 4.0 4.0 36.0
25 1 4.0 4.0 40.0
26 1 4.0 4.0 44.0
27 1 4.0 4.0 48.0
30 1 4.0 4.0 52.0
32 2 8.0 8.0 60.0
37 1 4.0 4.0 64.0
38 1 4.0 4.0 68.0
40 1 4.0 4.0 72.0
46 1 4.0 4.0 76.0
50 1 4.0 4.0 80.0
51 1 4.0 4.0 84.0
55 1 4.0 4.0 88.0
56 1 4.0 4.0 92.0
62 1 4.0 4.0 96.0
81 1 4.0 4.0 100.0
Total 25 100.0 100.0
16
Chapter5: SPSS Exercises for Chapters 1 – 4
Frequency Tables, Descriptive Statistics and Histograms in SPSS
1. a. Create and submit a frequency distribution
Table 1: FREQUENCY DISTRIBUTION FOR DATASET I
Car Parked
Frequency Percent Valid
Percent
Cumulative
Percent
Valid
10 1 4.0 4.0 4.0
14 1 4.0 4.0 8.0
15 2 8.0 8.0 16.0
16 1 4.0 4.0 20.0
18 1 4.0 4.0 24.0
20 1 4.0 4.0 28.0
22 1 4.0 4.0 32.0
24 1 4.0 4.0 36.0
25 1 4.0 4.0 40.0
26 1 4.0 4.0 44.0
27 1 4.0 4.0 48.0
30 1 4.0 4.0 52.0
32 2 8.0 8.0 60.0
37 1 4.0 4.0 64.0
38 1 4.0 4.0 68.0
40 1 4.0 4.0 72.0
46 1 4.0 4.0 76.0
50 1 4.0 4.0 80.0
51 1 4.0 4.0 84.0
55 1 4.0 4.0 88.0
56 1 4.0 4.0 92.0
62 1 4.0 4.0 96.0
81 1 4.0 4.0 100.0
Total 25 100.0 100.0
16
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
P a g e | 17
b. Create and submit a histogram
Figure 1: HISTOGRAM FOR DATASET I
c. Compute the measures of central tendency (i.e., mean, median, and mode),
dispersion (i.e., range, standard deviation) and the Fisher’s measure of skewness of
the distribution and the Fisher’s measure of kurtosis.
Table 2: DESCRIPTIVE STATISTICS COMPUTATION TABLE FOR DATASET I
N Minimu
m
Maximu
m
Mean Std.
Deviatio
n
Variance Skewness Kurtosis
Statisti
c
Statistic Statistic Statisti
c
Statistic Statistic Statis
tic
Std.
Erro
r
Statisti
c
Std.
Erro
r
Nos. of
car
parked
25 10 81 33.68 17.797 316.727 .895 .464 .447 .902
17
b. Create and submit a histogram
Figure 1: HISTOGRAM FOR DATASET I
c. Compute the measures of central tendency (i.e., mean, median, and mode),
dispersion (i.e., range, standard deviation) and the Fisher’s measure of skewness of
the distribution and the Fisher’s measure of kurtosis.
Table 2: DESCRIPTIVE STATISTICS COMPUTATION TABLE FOR DATASET I
N Minimu
m
Maximu
m
Mean Std.
Deviatio
n
Variance Skewness Kurtosis
Statisti
c
Statistic Statistic Statisti
c
Statistic Statistic Statis
tic
Std.
Erro
r
Statisti
c
Std.
Erro
r
Nos. of
car
parked
25 10 81 33.68 17.797 316.727 .895 .464 .447 .902
17
P a g e | 18
d. Results of the frequency distribution:
ANS: The mean numbers of cars parked was 33.68, ranging from 10 to 81 cars. The
standard deviation was 17.797 cars, and the coefficient of variation was
CV = s
x
− ∗100 %
=52 . 84 % The Fisher’s measure of skewness of the distribution was
0.895 (SD = 0.464) and Fisher’s measure of kurtosis was 0.447 (SD = 0.902)
Data Set –II:
a. Create and submit a frequency distribution
Table 3: FREQUENCY DISTRIBUTION FOR DATASET II
18
Blood Cholesterol (in mg/dL)
Levels
N Valid 36
Missing 0
Mean 194.69
Std. Error of Mean 2.656
Median 193.00
Mode 183a
Std. Deviation 15.935
Variance 253.933
Skewness 1.915
Std. Error of Skewness .393
Kurtosis 5.204
Std. Error of Kurtosis .768
Percentiles
25 183.25
50 193.00
75 197.50
d. Results of the frequency distribution:
ANS: The mean numbers of cars parked was 33.68, ranging from 10 to 81 cars. The
standard deviation was 17.797 cars, and the coefficient of variation was
CV = s
x
− ∗100 %
=52 . 84 % The Fisher’s measure of skewness of the distribution was
0.895 (SD = 0.464) and Fisher’s measure of kurtosis was 0.447 (SD = 0.902)
Data Set –II:
a. Create and submit a frequency distribution
Table 3: FREQUENCY DISTRIBUTION FOR DATASET II
18
Blood Cholesterol (in mg/dL)
Levels
N Valid 36
Missing 0
Mean 194.69
Std. Error of Mean 2.656
Median 193.00
Mode 183a
Std. Deviation 15.935
Variance 253.933
Skewness 1.915
Std. Error of Skewness .393
Kurtosis 5.204
Std. Error of Kurtosis .768
Percentiles
25 183.25
50 193.00
75 197.50
P a g e | 19
b. Compute the measures of central tendency (i.e., mean, median, and mode), dispersion
(i.e., range, standard deviation) and the Fisher’s measure of skewness of the
distribution and the Fisher’s measure of kurtosis.
Table 4: DESCRIPTIVE STATISTIC COMPUTATION TABLE FOR DATASET II
N Minimum Maximu
m
Mean Std.
Deviation
Skewness Kurtosis
Statistic Statistic Statistic Statistic Statistic Statistic Std.
Error
Statistic Std.
Error
Blood
Cholestero
l (in
mg/dL)
Levels
36 173 254 194.69 15.935 1.915 .393 5.204 .768
c. Create and submit a histogram
Figure 2: HISTOGRAM FOR DATASET II
19
b. Compute the measures of central tendency (i.e., mean, median, and mode), dispersion
(i.e., range, standard deviation) and the Fisher’s measure of skewness of the
distribution and the Fisher’s measure of kurtosis.
Table 4: DESCRIPTIVE STATISTIC COMPUTATION TABLE FOR DATASET II
N Minimum Maximu
m
Mean Std.
Deviation
Skewness Kurtosis
Statistic Statistic Statistic Statistic Statistic Statistic Std.
Error
Statistic Std.
Error
Blood
Cholestero
l (in
mg/dL)
Levels
36 173 254 194.69 15.935 1.915 .393 5.204 .768
c. Create and submit a histogram
Figure 2: HISTOGRAM FOR DATASET II
19
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
P a g e | 20
d. Results of the frequency distribution:
ANS: The mean of Cholesterol (in mg/dL) level was 194.69 mg/dL, ranging from
173 mg/dL to 254 mg/dL. The standard deviation was 15.935 mg/dL, and the
coefficient of variation was
CV = s
x
− ∗100 %
=8 . 18 % The Fisher’s measure of skewness
of the distribution was 1.915 (SD = 0.393) and Fisher’s measure of kurtosis was
5.204 (SD = 0.768)
2. From the data in the table below, determine the following through the use of SPSS:
a. What percentage of this group is married?
ANS: 53.3% of this group is married.
b. What is the mean age and standard deviation of age?
ANS: Mean age was 38.77 years and standard deviation of 10.227 years
c. How many participants are older than 30 years?
ANS: 19 participants are older than 30 years.
d. What percentage of this group is older than 30 years?
ANS: 61.29% participants are older than 30 years.
e. What percentage of this group is male?
ANS: 52.63% males are of this age group.
f. What are the mean and standard deviation of years of work experience?
ANS: Mean is 15.13 years and standard deviation is 9.028 years of years of work
experience.
g. What percentage of this group has a BS Degree?
ANS: 53.3% of this age group has a BS degree
h. Report the results of the data set (following the example on page 55 of the Kellar
& Kelvin text.
ANS:
The mean age of participants was 38.77 years, ranging from 25 years to 58 years.
The standard deviation was 10.227 years, and the coefficient of variation was
20
d. Results of the frequency distribution:
ANS: The mean of Cholesterol (in mg/dL) level was 194.69 mg/dL, ranging from
173 mg/dL to 254 mg/dL. The standard deviation was 15.935 mg/dL, and the
coefficient of variation was
CV = s
x
− ∗100 %
=8 . 18 % The Fisher’s measure of skewness
of the distribution was 1.915 (SD = 0.393) and Fisher’s measure of kurtosis was
5.204 (SD = 0.768)
2. From the data in the table below, determine the following through the use of SPSS:
a. What percentage of this group is married?
ANS: 53.3% of this group is married.
b. What is the mean age and standard deviation of age?
ANS: Mean age was 38.77 years and standard deviation of 10.227 years
c. How many participants are older than 30 years?
ANS: 19 participants are older than 30 years.
d. What percentage of this group is older than 30 years?
ANS: 61.29% participants are older than 30 years.
e. What percentage of this group is male?
ANS: 52.63% males are of this age group.
f. What are the mean and standard deviation of years of work experience?
ANS: Mean is 15.13 years and standard deviation is 9.028 years of years of work
experience.
g. What percentage of this group has a BS Degree?
ANS: 53.3% of this age group has a BS degree
h. Report the results of the data set (following the example on page 55 of the Kellar
& Kelvin text.
ANS:
The mean age of participants was 38.77 years, ranging from 25 years to 58 years.
The standard deviation was 10.227 years, and the coefficient of variation was
20
P a g e | 21
CV = s
x
− ∗100 %
=26 . 38 % The Fisher’s measure of skewness of the distribution was
0.1645 (SD = 0.427) and Fisher’s measure of kurtosis was -1.229 (SD = 0.833).
The mean number of years worked by the participants was 15.13 years, ranging from
2 years to 27 years. The standard deviation was 9.028 years, and the coefficient of
variation was
CV = s
x
− ∗100 %
=59 . 67 % The Fisher’s measure of skewness of the
distribution was -0.108 (SD = 0.427) and Fisher’s measure of kurtosis was -1.626
(SD = 0.833).
i. Develop one problem statement that the data set could address.
ANS: An organization wants to measure life satisfaction of the residents of place
xyz who are involved in nursing jobs. The organization wants to study the mental
condition of the nurses and performance based on their ages.
j. Develop at least 1 null and alternative hypothesis statements that this data could
address.
ANS: Null Hypothesis:
1. Age has no effect on mental health of the nurses
2. Educational qualification has no effect on stress handling in nursing
Alternate Hypothesis:
1. Mental health of nurses deteriorates with age due to complexity of job
2. Educational qualification improves stress handling and leads to job satisfaction
k. Identify the measurement levels of each variable in the data set (nominal, ordinal,
interval or ratio)
ANS:
ID # Gender Age
(Years)
Marital
Status
Years
Worke
d
Degree
Scal
e
Nomina
l
Interva
l
Nomina
l Interval Nomina
l
21
CV = s
x
− ∗100 %
=26 . 38 % The Fisher’s measure of skewness of the distribution was
0.1645 (SD = 0.427) and Fisher’s measure of kurtosis was -1.229 (SD = 0.833).
The mean number of years worked by the participants was 15.13 years, ranging from
2 years to 27 years. The standard deviation was 9.028 years, and the coefficient of
variation was
CV = s
x
− ∗100 %
=59 . 67 % The Fisher’s measure of skewness of the
distribution was -0.108 (SD = 0.427) and Fisher’s measure of kurtosis was -1.626
(SD = 0.833).
i. Develop one problem statement that the data set could address.
ANS: An organization wants to measure life satisfaction of the residents of place
xyz who are involved in nursing jobs. The organization wants to study the mental
condition of the nurses and performance based on their ages.
j. Develop at least 1 null and alternative hypothesis statements that this data could
address.
ANS: Null Hypothesis:
1. Age has no effect on mental health of the nurses
2. Educational qualification has no effect on stress handling in nursing
Alternate Hypothesis:
1. Mental health of nurses deteriorates with age due to complexity of job
2. Educational qualification improves stress handling and leads to job satisfaction
k. Identify the measurement levels of each variable in the data set (nominal, ordinal,
interval or ratio)
ANS:
ID # Gender Age
(Years)
Marital
Status
Years
Worke
d
Degree
Scal
e
Nomina
l
Interva
l
Nomina
l Interval Nomina
l
21
P a g e | 22
Reference Lists
Christensen, L. B., Johnson, B., Turner, L. A., & Christensen, L. B. (2011). Research methods,
design, and analysis.
Forbes, C., Evans, M., Hastings, N., & Peacock, B. (2011). Statistical distributions. John Wiley
& Sons.
Gherardi-Donato, E. C. D. S., Cardoso, L., Teixeira, C. A. B., Pereira, S. D. S., &Reisdorfer, E.
(2015). Association between depression and work stress in nursing professionals with
technical education level. Revistalatino-americana de enfermagem, 23(4), 733-740.
Härdle, W., & Simar, L. (2012). Applied multivariate statistical analysis.
MacKinnon, D. P. (2008). Introduction to statistical mediation analysis. Routledge.
Tenenbaum, J. B., Kemp, C., Griffiths, T. L., & Goodman, N. D. (2011). How to grow a mind:
Statistics, structure, and abstraction. science, 331(6022), 1279-1285.
Wasserman, L. (2013). All of statistics: a concise course in statistical inference. Springer
Science & Business Media.
22
Reference Lists
Christensen, L. B., Johnson, B., Turner, L. A., & Christensen, L. B. (2011). Research methods,
design, and analysis.
Forbes, C., Evans, M., Hastings, N., & Peacock, B. (2011). Statistical distributions. John Wiley
& Sons.
Gherardi-Donato, E. C. D. S., Cardoso, L., Teixeira, C. A. B., Pereira, S. D. S., &Reisdorfer, E.
(2015). Association between depression and work stress in nursing professionals with
technical education level. Revistalatino-americana de enfermagem, 23(4), 733-740.
Härdle, W., & Simar, L. (2012). Applied multivariate statistical analysis.
MacKinnon, D. P. (2008). Introduction to statistical mediation analysis. Routledge.
Tenenbaum, J. B., Kemp, C., Griffiths, T. L., & Goodman, N. D. (2011). How to grow a mind:
Statistics, structure, and abstraction. science, 331(6022), 1279-1285.
Wasserman, L. (2013). All of statistics: a concise course in statistical inference. Springer
Science & Business Media.
22
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
P a g e | 23
APPENDIX
Age (Years)
Table 5: DESCRIPTIVE STUDY FOR AGE DATASET III
Statistics
Age (Years)
N Valid 30
Missing 0
Mean 38.77
Median 40.33a
Mode 27
Std. Deviation 10.227
Minimum 25
Maximum 58
Percentiles
25 28.50b
50 40.33
75 48.00
Frequency Percent Valid Percent Cumulative
Percent
Valid 25 1 3.3 3.3 3.3
26 2 6.7 6.7 10.0
27 5 16.7 16.7 26.7
30 3 10.0 10.0 36.7
33 1 3.3 3.3 40.0
37 1 3.3 3.3 43.3
39 2 6.7 6.7 50.0
41 1 3.3 3.3 53.3
42 3 10.0 10.0 63.3
43 2 6.7 6.7 70.0
47 1 3.3 3.3 73.3
49 3 10.0 10.0 83.3
23
APPENDIX
Age (Years)
Table 5: DESCRIPTIVE STUDY FOR AGE DATASET III
Statistics
Age (Years)
N Valid 30
Missing 0
Mean 38.77
Median 40.33a
Mode 27
Std. Deviation 10.227
Minimum 25
Maximum 58
Percentiles
25 28.50b
50 40.33
75 48.00
Frequency Percent Valid Percent Cumulative
Percent
Valid 25 1 3.3 3.3 3.3
26 2 6.7 6.7 10.0
27 5 16.7 16.7 26.7
30 3 10.0 10.0 36.7
33 1 3.3 3.3 40.0
37 1 3.3 3.3 43.3
39 2 6.7 6.7 50.0
41 1 3.3 3.3 53.3
42 3 10.0 10.0 63.3
43 2 6.7 6.7 70.0
47 1 3.3 3.3 73.3
49 3 10.0 10.0 83.3
23
P a g e | 24
50 2 6.7 6.7 90.0
51 1 3.3 3.3 93.3
57 1 3.3 3.3 96.7
58 1 3.3 3.3 100.0
Total 30 100.0 100.0
Table 6: DESCRIPTIVE STUDY FOR GENDER DATASET III
Gender
Statistics
Gender
N Valid 30
Missing 0
Mean 1.47
Median 1.47a
Mode 1
Std. Deviation .507
Minimum 1
Maximum 2
Percentiles
25 .b,c
50 1.47
75 1.97
Frequenc
y
Percent Valid Percent Cumulative
Percent
Valid
MALE 16 53.3 53.3 53.3
FEMALE 14 46.7 46.7 100.0
Total 30 100.0 100.0
24
50 2 6.7 6.7 90.0
51 1 3.3 3.3 93.3
57 1 3.3 3.3 96.7
58 1 3.3 3.3 100.0
Total 30 100.0 100.0
Table 6: DESCRIPTIVE STUDY FOR GENDER DATASET III
Gender
Statistics
Gender
N Valid 30
Missing 0
Mean 1.47
Median 1.47a
Mode 1
Std. Deviation .507
Minimum 1
Maximum 2
Percentiles
25 .b,c
50 1.47
75 1.97
Frequenc
y
Percent Valid Percent Cumulative
Percent
Valid
MALE 16 53.3 53.3 53.3
FEMALE 14 46.7 46.7 100.0
Total 30 100.0 100.0
24
P a g e | 25
Table 7: DESCRIPTIVE STUDY FOR YEARS WORKED DATASET III
Statistics
Years Worked
N Valid 30
Missing 0
Mean 15.13
Median 16.67a
Mode 4b
Std. Deviation 9.028
Minimum 2
Maximum 27
Percentiles
25 5.33c
50 16.67
75 24.33
Years Worked
Frequency Percent Valid Percent Cumulative
Percent
Valid 2 1 3.3 3.3 3.3
3 1 3.3 3.3 6.7
4 4 13.3 13.3 20.0
5 2 6.7 6.7 26.7
6 1 3.3 3.3 30.0
7 1 3.3 3.3 33.3
8 1 3.3 3.3 36.7
10 1 3.3 3.3 40.0
14 1 3.3 3.3 43.3
16 2 6.7 6.7 50.0
17 1 3.3 3.3 53.3
19 2 6.7 6.7 60.0
20 3 10.0 10.0 70.0
24 2 6.7 6.7 76.7
25
Table 7: DESCRIPTIVE STUDY FOR YEARS WORKED DATASET III
Statistics
Years Worked
N Valid 30
Missing 0
Mean 15.13
Median 16.67a
Mode 4b
Std. Deviation 9.028
Minimum 2
Maximum 27
Percentiles
25 5.33c
50 16.67
75 24.33
Years Worked
Frequency Percent Valid Percent Cumulative
Percent
Valid 2 1 3.3 3.3 3.3
3 1 3.3 3.3 6.7
4 4 13.3 13.3 20.0
5 2 6.7 6.7 26.7
6 1 3.3 3.3 30.0
7 1 3.3 3.3 33.3
8 1 3.3 3.3 36.7
10 1 3.3 3.3 40.0
14 1 3.3 3.3 43.3
16 2 6.7 6.7 50.0
17 1 3.3 3.3 53.3
19 2 6.7 6.7 60.0
20 3 10.0 10.0 70.0
24 2 6.7 6.7 76.7
25
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
P a g e | 26
25 1 3.3 3.3 80.0
26 4 13.3 13.3 93.3
27 2 6.7 6.7 100.0
Total 30 100.0 100.0
Table 8: DESCRIPTIVE STUDY FOR EDUCATION DATASET III
Education
Frequency Percent Valid Percent Cumulative
Percent
Valid
BS 16 53.3 53.3 53.3
MS 14 46.7 46.7 100.0
Total 30 100.0 100.0
Table 9: DESCRIPTIVE STUDY FOR MARITAL STATUS DATASET III
Statistics
Marital Status
N Valid 30
Missing 0
Mean 1.53
Median 1.53a
Mode 2
Std. Deviation .507
Minimum 1
Maximum 2
Percentiles
25 1.03b
50 1.53
75 .
Marital Status
Frequency Percent Valid Percent Cumulative
Percent
Valid
single 14 46.7 46.7 46.7
married 16 53.3 53.3 100.0
Total 30 100.0 100.0
26
25 1 3.3 3.3 80.0
26 4 13.3 13.3 93.3
27 2 6.7 6.7 100.0
Total 30 100.0 100.0
Table 8: DESCRIPTIVE STUDY FOR EDUCATION DATASET III
Education
Frequency Percent Valid Percent Cumulative
Percent
Valid
BS 16 53.3 53.3 53.3
MS 14 46.7 46.7 100.0
Total 30 100.0 100.0
Table 9: DESCRIPTIVE STUDY FOR MARITAL STATUS DATASET III
Statistics
Marital Status
N Valid 30
Missing 0
Mean 1.53
Median 1.53a
Mode 2
Std. Deviation .507
Minimum 1
Maximum 2
Percentiles
25 1.03b
50 1.53
75 .
Marital Status
Frequency Percent Valid Percent Cumulative
Percent
Valid
single 14 46.7 46.7 46.7
married 16 53.3 53.3 100.0
Total 30 100.0 100.0
26
P a g e | 27
27
27
1 out of 27
Related Documents
Your All-in-One AI-Powered Toolkit for Academic Success.
+13062052269
info@desklib.com
Available 24*7 on WhatsApp / Email
Unlock your academic potential
© 2024 | Zucol Services PVT LTD | All rights reserved.