This document provides an explanation of descriptive statistics, including the construction of a frequency distribution table, histogram, and measures of central tendency and variability. It also discusses the context in which the data points represent a population or a sample.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Running Head: DESCRIPTIVE STATAISTICS 1 DESCRIPTIVE STATAISTICS Name of student: Name of Institution: Date: Author Note:
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
DESCRIPTIVE STATAISTICS Psy2530- Problem Set #1 1)The data presented above could be considered a sample or population. Using the information given in the introductory paragraph, explain to me in what context these 80 data points would represent a population and an example where they would represent a sample. Solution:The data represented above can be considered as a population if the study involves a small group of individuals. For example, the investigation of the existence of gender difference in the level of psychological well- being among the staff members of a school. In the scenario of a staff member of a school, if the total number of staff members in a school is 80, then the data set can be considered as a population. On the other hand, if the investigation involves gender differences in a school with many students and staff members, then the data set can be considered as a sample of the whole population in that school. A sample is a subset of a population(Libman & Zipora, 2010) 2)a)What is the independent variable in this study? Solution: The independent variable is gender (men or women). b)What is the dependent variable? Is it a continuous or discrete variable? What is the scale of measurement for this variable? Solution:The dependent variable is the level of psychological well- being. The dependent variable is continuous. The variable is measured on a numerical scale. 3)Construct a frequency distribution table with columns for a) absolute frequencies, b) cumulative absolute frequencies and c) relative frequencies using all 80 data points. Create a histogram illustrating the absolute frequencies. Be sure to calculate and present all measures of central tendency and variability for this distribution, i.e., mean, median, mode, range, variance, and standard deviation.
DESCRIPTIVE STATAISTICS Solution:Frequency tables forWomen Frequency table for women shows the number of women for each category of age groups. The age groups have been divided into classes with a width of 10. The column of “lower limit” below represents the lower limit of each class that starts from 0, and the “upper limit” shows the upper limit for each class which starts at 19. The absolute frequency is found by simply counting the number of individuals whose ages fall in the age bracket (class). Cumulative frequency has been obtained by adding up the absolute frequencies for each class. Since the total is 80, the percentage (%) has been obtained by dividing each of the values in the cumulative frequency column by 80 and multiplying by 100. Absolute frequency = Count of individuals in each class Cumulative frequency for class n = Cumulative of class (n-1) + Absolute Frequency of class n. For example: Cumulative Frequency for class (70-79) = Cumulative Frequency of class (60-69) + Absolute Frequency of class (70-79). That is; 27 + 10= 37 Lower limitUpper LimitAbsolute Frequency Cumulative Frequency Cumulative % 019000.00% 2029000.00% 3039000.00% 4049225.00% 50597922.50% 6069182767.50% 7079103792.50% 808923997.50% 9099140100.00% More0100.00% Solution:Frequency tables for Men Absolute frequency = Count of individuals in each class Cumulative frequency for class n = Cumulative of class (n-1) + Absolute Frequency of class n. For example: Cumulative Frequency for class (70-79) = Cumulative Frequency of class (60-69) + Absolute Frequency of class (70-79).
DESCRIPTIVE STATAISTICS That is; 27 + 9 = 36 Lower Limitupper limit Absolute Frequency Cumulative Frequency Cumulative % 019000.00% 2029112.50% 3039237.50% 404981127.50% 505961742.50% 6069102767.50% 707993690.00% 808933997.50% 9099140100.00% More0100.00% Solution:Histograms The histogram is a pictorial representation of the frequencies of each each clsas. The histogram represents a representation of class frequency against mid class. Class width= (Lower Limit + Upper Limit)/2 ClassMid- classFrequency 0-199.5Absolute Frequency 20-2924.50 30-3934.50 40-4944.50 50-5954.52 60-6964.57 70-7974.518 808984.510 90-9994.52 More1 0
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
DESCRIPTIVE STATAISTICS 697959498999192939More 0 2 4 6 8 10 12 14 16 18 20 0.00% 20.00% 40.00% 60.00% 80.00% 100.00% 120.00% Histogram showing the level Psychological Well- being of Women Frequency Cumulative % upper limit Frequency ClassMid- classFrequency 0-199.5Absolute Frequency 20-2924.50 30-3934.51 40-4944.52 50-5954.58 60-6964.56 70-7974.510 808984.59 90-9994.53 More1 0
DESCRIPTIVE STATAISTICS 697949598939299919More 0 2 4 6 8 10 12 0.00% 20.00% 40.00% 60.00% 80.00% 100.00% 120.00% Histogram showing theblevel Psychological Well- being of men Frequency Cumulative % upper limit Frequency Solution:Measures of central tendency and variability Mean= (sum of all data points/80) Mean = (70 + 80 + 77 + 66 + 60 + 70 + 65 + 65 + 60 + 50 + 51 + 80 + 60 + 65 + 65 + 64 + 69 + 54 + 61 + 57 + 73 + 43 + 75 + 60 + 90 + 70 + 65 + 55 + 55 + 62 + 70 + 45 + 75 + 65 + 50 + 70 + 68 + 67 + 60 + 79 + 75 + 50 + 58 + 41 + 60 + 72 + 60 + 70 + 40 + 79 + 80 + 55 + 60 + 70+ 50 + 65 + 42+ 48+ 70+ 60+ 80+ 45+ 60+ 40+ 45+ 55+ 65+ 55+ 65+ 62+ 20+ 75+ 78+ 95+ 45+ 60+ 85+ 35+ 35+ 75)/80 = (4966)/80 = 62.075 Median= The middle number. Median is found by arranging the number from the smallest to the largest and pickingmiddle number. Since the total number of data points is even (80), the median will fall in the 40thand 41stdata points. The data points are arranged as follows: The 40thdata point = 62 and the 41stdata point = 64 Therefore; the median = (62+64)/2 = 63
DESCRIPTIVE STATAISTICS PositionData point
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
DESCRIPTIVE STATAISTICS 4365 4465 4565 4665 4765 4865 4965 5065 5166 5267 5368 5469 5570 5670 5770 5870 5970 6070 6170 6270 6372 6473 6575 6675 6775 6875 6975 7077 7178 7279 7379 7480 7580 7680 7780 7885 7990 8095 The modeis the data point with the highest number of appearances. From the data points, it is clear that the data point with the highest number of appearance is 60.
DESCRIPTIVE STATAISTICS The Maximumdata point is the data point with the highest entry which is 95. The minimumis the data point with the lowest entry which is 75 The rangeis the difference between the maximum and minimm Range= Maximum – Minimum = 95- 75 = 20 The varianceis found as follows: First, we get the difference between each data point and the mean (x-mean) Second, we get the square of the difference above; (x-mean)^2 Next, we get the sum of (x-mean)^2 Variance= (Sum of (x-mean)^2)80 =14381.55/80 = 179.77 The Standard deviation =Square root of variance = square root of 179.77 = 13.41 Data point(x)x-mean(x-mean)^2 20-42.0751770.30563 35-27.075733.055625 35-27.075733.055625 40-22.075487.305625 40-22.075487.305625 41-21.075444.155625 42-20.075403.005625 43-19.075363.855625 45-17.075291.555625 45-17.075291.555625 45-17.075291.555625 45-17.075291.555625 48-14.075198.105625
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
DESCRIPTIVE STATAISTICS Solution:Boxplot for men b)What conclusions can be drawn by looking at this graph? Solution:I can see that women have a relatively higher level of phycological well- being than men. Similarly, the data set for women have outliers unlike the data for men that do not have 1an outlier. An outlier is a data point that appears to significantly different from the rest of the data points(Pfanzagl & Johann, 2017). 5)a) Create a stem and leaf display to represent all the data. Solution: #The R code Women<-c(70, 80, 77, 66, 60, 70, 65, 65, 60, 50, 51, 80, 60, 65, 65, 64, 69, 54, 61, 57, 73, 43, 75, 60, 90, 70, 65, 55, 55, 62, 70, 45, 75, 65, 50, 70, 68, 67, 60, 79) Men<-c(75, 50, 58, 41, 60, 72, 60, 70, 40, 79, 80, 55, 60, 70, 50, 65, 42, 48, 70, 60, 80, 45, 60, 40, 45, 55, 65, 55, 65, 62, 20, 75, 78, 95, 45, 60, 85, 35, 35, 75) stem(Women) stem(Men) Solution: Women
DESCRIPTIVE STATAISTICS The decimal point is 1 digit(s) to the right of the | 4 | 3 4 | 5 5 | 0014 5 | 557 6 | 00000124 6 | 5555556789 7 | 000003 7 | 5579 8 | 00 8 | 9 | 0 Solution:Men >stem(Men) The decimal point is one digit(s) to the right of the |1 2 | 0 3 | 55 4 | 00125558 5 | 005558 6 | 0000002555 7 | 000255589 8 | 005 9 | 5 b) What are the strengths and weaknesses of the stem and leaf diagram compared to other types of graphs? Solution:A stem and leaf diagram is more convenient in the determination of mode and median of a data set compared to the other methods(Shang & Han, 2015). A stem and leaf diagram is more convenient in determining the exact values of the outliers in a data set(Walter, 2011).
DESCRIPTIVE STATAISTICS 6)If there are outliers/atypical values in our dataset, explain what would happen to the measures of central tendency and variability (mean, median, mode, standard deviation, variance) if we eliminated these outliers. Which measures are most affected when outliers are excluded? Solution:Eliminating the outliers in a data set will reduce the values of the measures of central tendency and variability. The measures that are most affected by the outliers are the measures of central tendency i: e the mean and the median(Libman & Zipora, 2010). 7)I want to obtain the results of participants on a scale of 0 to 10 rather than 0 to 100, so I divide the score of each participant by 10. a)What is the name of this type of procedure? Solution: Weighted average procedure b)What will happen to the mean and standard deviation of the sample after this change? Solution: The new mean will be the original mean divided by 10 (original mean/10). Similarly, the standard deviation will be the original standard deviation divided by 10. 8)Calculatethemeasuresofcentraltendencyandvariabilityformenandwomen separately. What differences can be noted between groups? How are these measures different from the measurements obtained for the 80 participants put together?What does this say about both groups? Solution:The descriptive statistics table is shown in the table below. The differences that can be noted between the groups are that; 1.The mean value of the psychological well- being of women is higher than that of the men. 2.The standard deviation of well- being of men is higher than that of the women.
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
DESCRIPTIVE STATAISTICS 3.The median value of well- being of women is higher than that of men. Solution: The differences demonstrate that generally, women are perceived to be better psychologically than the men. Similarly, the differences suggest the psychological well- being is not universal among men and women. Mean of Women =(70 + 80 + 77 + 66 + 60 + 70 + 65 + 65 + 60 + 50 + 51 + 80 + 60 + 65 + 65 + 64 + 69 + 54 + 61 + 57 + 73 + 43 + 75 + 60 + 90 + 70 + 65 + 55 + 55 + 62 + 70 + 45 + 75 + 65 + 50 + 70 + 68 + 67 + 60 + 79)/40 =2586/40 =64.64 Mean for men= (75 + 50 + 58 + 41 + 60 + 72 + 60 + 70 + 40 + 79 + 80 + 55 + 60 + 70+ 50 + 65 + 42+ 48+ 70+ 60+ 80+ 45+ 60+ 40+ 45+ 55+ 65+ 55+ 65+ 62+ 20+ 75+ 78+ 95+ 45+ 60+ 85+ 35+ 35+ 75)/40 =2380/40 =59.5 Mode is the data point with the highest number of appearances. The mode for women is 65 The mode for men is 60 WomenMen Mean64.65Mean59.5 Median65Median60 Mode65Mode60 Standard Deviation9.9937Standard Deviation15.9775 Sample Variance99.87436Sample Variance255.2821 Range47Range75 Minimum43Minimum20 Maximum90Maximum95 Sum2586Sum2380 Count40Count40
DESCRIPTIVE STATAISTICS 9)I would like to know if there is a difference between the well-being of men and women. What would be my null hypothesis? My alternative hypothesis? Solution: H0: There is no significant difference in the average well- being of men and women H1: There is a significant difference in the average well- being of men and women Where H0 is the null hypothesis and H1 is the alternative hypothesis. 10)Assume that the distribution of the female and male scores both follow a normal distribution with the mean and variance you found for each group. You pick a woman with a score of 57 and a man with a score of 55. Which of these two people has a better level of well-being compared to other individuals of the same gender? Solution:The woman has a better well- being than the man. The woman has a better well- being because she has a smaller deviation from the mean (3.55), mean for women being 64.55. The man, on the other hand, has a larger standard deviation from the mean (4.5), the mean for men being 59.5 11)Assume that the distribution of all 80 scores has the mean and variance that you have found and that the data has a normal distribution. I randomly choose a participant and find that he/she has a score of 50. What proportion would (or percentage) of the population have a level of well-being greater than (or better than) that person? Solution: The mean for all 80 data points is 62.075 while the standard deviation is 1.5. The difference between the mean and the participant with a score of 50 is 62.075- 50=12.05. To get the number of deviations from the mean, we divide this difference by the value of one deviation and get that it is more than 3.We know that 1swviation is 68.27%, two deviations is 95.45% while three deviations are 99.994. Therefore, we can comfortably say that 99.994 % of the population have a better psychological well- being than this participant.
DESCRIPTIVE STATAISTICS References Libman, & Zipora. (2010). Integrating Real-Life Data Analysis in Teaching Descriptive Statistics: A Constructivist Approach.Journal of Statistics Education. Pfanzagl, & Johann. (2017). Mathematical Statistics || Descriptive Statistics.Springer Series in Statistics. Shang, & Han, L. (2015). Resampling Techniques for Estimating the Distribution of Descriptive Statistics of Functional Data.Communications in Statistics-Simulation and Computation. Walter, K. (2011). Interpreting socio-economic data—a foundation of descriptive statistics. Journal of Statistical Papers, 2.