AQM Assignment 2: Statistical Techniques and Probability Analysis

Verified

Added on  2023/03/17

|19
|1391
|74
Homework Assignment
AI Summary
This document presents a comprehensive solution to Applied Quantitative Methods Assignment 2, focusing on statistical analysis and probability. The solution includes detailed calculations and explanations for various statistical concepts. It begins with frequency tables and histograms to analyze passenger data, followed by measures of central tendency. The assignment then delves into sample vs. population, standard deviation, and interquartile range. Correlation analysis and regression equations are derived to examine relationships between variables. Probability concepts are addressed through various scenarios, including independent events, binomial and Poisson distributions, and normal distributions. Hypothesis testing and the Central Limit Theorem are also explored. The document provides clear step-by-step solutions, making it a valuable resource for students studying statistics and quantitative methods. The assignment covers topics such as frequency distribution, measures of central tendency, standard deviation, interquartile range, correlation, regression analysis, probability calculations, binomial distribution, and normal distribution.
Document Page
APPLIED QUANTITATIVE METHODS
ASSIGNMENT 2
Student Name
[Pick the date]
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Question 1
(a)Frequency table for passengers at Melbourne train station is indicated as
follows.
288 to 457 18 0.300 0.300 372.500
457 to 744 12 0.200 0.500 600.500
744 to 1032 9 0.150 0.650 888.000
1032 to 1319 7 0.117 0.767 1175.500
1319 to 1607 5 0.083 0.850 1463.000
1607 to 1895 2 0.033 0.883 1751.000
1895 to 2182 2 0.033 0.917 2038.500
2182 to 2470 2 0.033 0.950 2326.000
2470 to 2757 1 0.017 0.967 2613.500
2757 to 3044 2 0.033 1.000 2900.500
Total 60 1.000 7.783 16129.000
Classes Frequency Relative
Frequency
Cumulative Relative
Frequency
Class Mid
Point
(b)Frequency histogram
1
Document Page
(c)Determination of measures of central tendency
Sorted data - Ascending
2
Document Page
3
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Sample size = 60
Sum of data values = 58594
Mean = 58594/60 = 976.57
= (733+862)/2 = 797.5
Maximum frequency has observed for 401 and thus, mode would be 401.
4
Document Page
Question 2
(a)The given data corresponds to a sample as only selected data from the
population has been considered in the dataset provided. A year has 52
weeks and thereby the population data would contain attendance
corresponding to each of these weeks which is not the case. Thereby, it
would be regarded as a sample (Shi and Tao, 2015).
(b)Standard deviation for student’s weekly attendance
Standard deviation of sample = 1
N1 σ(x x)2 = 1
71 × (32909.714) = 74.06
(c)The Inter Quartile Range (IQR) for the total number of chocolate bars
sold
Sorted data - Ascending
5
Document Page
1st quartile = 25th percentile = {25*(7+1)/100}th value = 2nd value = 6014
3rd quartile = 75th percentile = {75*(7+1)/100}th value = 6th value = 7223
Inter quartile range = 3rd quartile - 1st quartile = 7223-6014 = 1209
The Interquartile range or IQR is preferred over standard deviation in case
of skewed data as the former (IQR) is not impacted by outliers which is not
true for latter (standard deviation) (Taylor and Cihon, 2017).
(d) The coefficient of correlation can be computed based on the following
formula.
6
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Variables are strongly correlated in a positive manner because the r value is
not only positive but also significantly high and close to 1 (Hastie, Tibshirani
and Friedman, 2014).
Question 3
7
Document Page
(a)Regression line
Least Square Regression Equation
Y = a+bX, where, a = intercept and b = slope
a =
476181676576 339223425707
{(7 1676576) 2 33922} = 1628.68
b =
7 23425707 {339247618}
{(7 (1676576)) 2 33922} = 10.677
Regression equation
y = 1628.689 +10.67 x
Number of chocolate bars sold = 1628.689 + (10.67* Weekly attendance)
For the above regression equation, the intercept value is approximately
1629. This would imply that this is the expected weekly sales of chocolate
8
Document Page
when weekly attendance is zero. The slope coefficient is 10.67 which
implies that a unit change in the weekly attendance would alter tha number
of chocolate bars sale by 10.67 units. Both the given variables would change
in the same direction as slope is positive (Medhi, 2016).
(b)R square
R square = (
R2 = (0.9680)2 = 0.9370
93.7% of the changes in chocolate bars sold will be explained by changes in
weekly attendance of student.
9
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Question 4
(a) P (Holmes OR Grassroots)
10
Document Page
= 35 + 92 + 12
35 + 92 + 54 + 12 = 0.7202
(b) P (External AND Scientific)
= 54
35 + 92 + 54 + 12 = 0.2798
(c) P (Holmes AND Scientific)
= 35
35 + 92 = 0.2756
(d) Training is event X and Recruitment is event Y will be independent
events when
P X.P Y= P X AND Y
PX= 54 + 12
35 + 92 + 54 + 12 = 0.3412
P Y= 35 + 54
35 + 92 + 54 + 12 = 0.4611
P X.P Y= 0.3412 0.4611 = 0.1573
P X AND Y= 0.2576
P(X). P(Y) is not same as P (X and Y) which means Training and
Recruitment” will not be termed as independent events.
11
chevron_up_icon
1 out of 19
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]