ProductsLogo
LogoStudy Documents
LogoAI Grader
LogoAI Answer
LogoAI Code Checker
LogoPlagiarism Checker
LogoAI Paraphraser
LogoAI Quiz
LogoAI Detector
PricingBlogAbout Us
logo

Two-Sample T-Test and Regression Analysis

Verified

Added on  2020/09/17

|20
|3025
|29
AI Summary
This assignment involves solving a two-sample t-test and regression analysis problem set. The first part of the assignment calculates the TCI (Test for Combined Inference) for Sales and Shipping Cost variables, including Kurtosis, Skewness, Range, Minimum, Maximum, Sum, Count, Mean, Standard Deviation, and Z-value. It also determines the Population mean and presents a t-test for two-sample assuming equal variances. The second part performs an ANOVA (Analysis of Variance) analysis on the Order Priority and Shipping Cost variables, including Source of Variation, SS, df, MS, F, P-value, and F crit. Lastly, it conducts a regression analysis on the Sales variable, presenting Regression Statistics, ANOVA, and Coefficients with Standard Error, t Stat, P-value, Lower 95%, Upper 95%, Lower 95.0%, and Upper 95.0%.

Contribute Materials

Your contribution can guide someone’s learning journey. Share your documents today.
Document Page
STATISTICAL ANALYSES

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Document Page
EXECUTIVE SUMMARY
In present research study multiple tools are applied on dataset and it is identified that change in
number of units sold is not bringing any big variation in entire sales amount. It is also identified
that West side is generating higher amount of sales for firms then East side. Thus, focus must be
customers that live in West side of the nation. Sales value is deviating at fast rate which reflect
that sales through transactions very at different level on varied order ID. Finally, it can be
said that sales through different sort of customers is almost equal . thus, attention equally need
to be given on all sort of customers.
Document Page
TABLE OF CONTENTS
INTRODUCTION...........................................................................................................................1
TASK 1............................................................................................................................................1
Preparing sample of 60 days from 2002 data..............................................................................1
TASK 2............................................................................................................................................1
Task 3...............................................................................................................................................5
Calculation of confidence interval...............................................................................................5
TASK 4............................................................................................................................................5
(1)Independent sample t test........................................................................................................5
(2) One way ANOVA..................................................................................................................6
TASK 5............................................................................................................................................6
Coorelation and regression..........................................................................................................6
CONCLUSION................................................................................................................................7
REFERENCE..................................................................................................................................9
APPENDIX....................................................................................................................................10
Summary statistics.........................................................................................................................10
Table 1CI calculation.....................................................................................................................11
Figure 1Freuqnecy distribution for order priority...........................................................................2
Figure 2Frequeny distribution for sort of customers.......................................................................2
Figure 3Frequency distribution for region.......................................................................................3
Figure 4Shipping mode distribution................................................................................................3
Figure 5Shiping cost chart...............................................................................................................5

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
INTRODUCTION
Statistics is the one of the domain that have due significence for the business firms. In
the present research study Australia sales data is analyzed in proper manner. Major purpose of
carrying out research study is to find out areas where most of sales are made and trends that exist
in respect to sales and product categories as well as different sort of customers. In this regard
descriptive statistics is computed and frequency tables are prepared and same are analyzed
through charts. Apart from this, in the report detail discussion is carried out on variables that are
in dataset by considering results obtained on regression analysis and ANOVA as well as t test.
Thus, it can be said that detailed analysis of variables is done in the present research study and it
is expected that reader will obtain detailed information about sales trends that exist in Australia
market.
TASK 1
Preparing sample of 60 days from 2002 data
According to requirement data is sorted and sample of 60 transactions is taken in to
account.
TASK 2
Order
Priority
Frequen
cy
Customer
Segment
Frequen
cy
Regi
on
Frequen
cy Ship Mode
Frequen
cy
Not
specified 8 Consumer 17 E 36 Regular Air 48
Low 10 Corporate 18 W 24
Delivery
Truck 5
Medium 11 Home office 9 Express air 6
High 17 Small business 16
Critical 14
1 | P a g e
Document Page
8
10
11
17
14
Frequency
Not specified Low Medium High Critical
Figure 1Freuqnecy distribution for order priority
Interpretation
Chart is reflecting that there are 17 transactions that comes in category of low priority
and there are 11 transactions that comes in category of medium order priority followed by there
are 14 respondents that comes in category of critical. Apart from this there are 8 respondents that
comes in category of not specified. Hence, it can be said that most of transactions are in high
priority category.
Consumer Corporate Home office Small business
0
2
4
6
8
10
12
14
16
18
20
17 18
9
16
Frequency
Figure 2Frequeny distribution for sort of customers
Interpretation
2 | P a g e
Document Page
It can be observed that there are 17 transactions that fall in category of consumer and 18
comes in clas of corporate. Apart from this, there are 9 respondetnts that are in class of home
office followed by 16 transactions comes in small business category. Overall it can be said that
there is almost equal distributio of transactions in different sort of customers.
36
24
Frequency
E W
Figure 3Frequency distribution for region
Interpretation
Chart is reflecting most of transactions happened in west side of Australia and moderate
number of transactions which are 24 occurred in East side of Australia.
Regular Air
Delieveryb Truck
Express air
0 10 20 30 40 50 60
48
5
6
Frequency
Figure 4Shipping mode distribution
3 | P a g e

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Interpretation
It can be observed that majority of shipments happened through regular air and few of
items are delievered through Express air and Delievery truck. It can be said that majority of
shipment are happened through specific mode.
Interpretation
It can be seen from chart that mean value in case of shipping cost is 9.41 and standard
deviation is 1.67. This means that most of times sales value remain 9.41 and value is deviating at
moderate rate. Sales average value is 1352.11 and standard deviation is 2925.14. Thus, it can be
said that most of times sales of 1352.11 is observed in most of business days and standard
deviation value is 2925.14. Hence, it can be said that sales values are deviating at higher value
then mean value. On average basis two days are taken for shipment of products and standard
deviation value is 1.3. Thus, it can be said that there is significent importance of statistics for the
firms to make business decisions.
2 122 242 362 482 602 722 842 962 10821202132214421562168218021922
$0.00
$5,000.00
$10,000.00
$15,000.00
$20,000.00
$25,000.00
$30,000.00
$35,000.00
Sales
Chart is reflecting that moderately sales level breach the mean value level of sale that is made in
the single day. It can be said that performance is extemely well and good amount of sales is
made in most of transactions.
4 | P a g e
Document Page
2 122 242 362 482 602 722 842 962 10821202132214421562168218021922
$0.00
$20.00
$40.00
$60.00
$80.00
$100.00
$120.00
$140.00
$160.00
Shipping Cost
Figure 5Shiping cost chart
Like sales in case of shipping cost also same thing is observed and it is identified that shipping
cost remain in control asn less number of times average level is breached.
Task 3
Calculation of confidence interval
We are 95% confident that range for sales in respect to home office customers is -619.55
and upper level is 1306 (See Appendix Table 1 CI calculaiton). Population mean for variable is
1585.08 and mean for sample is 1306.00. Hence, it can be said that population mean is out of CI
level and sample mean is also less then population mean (Cramér, 2016). This happened because
samll sample size is taken in present researh study. On other hand, in case of shipping cost
population mean value is 12.45 and for same CI value is between 6.14 and 9.41. Hence, again
population mean is higher then CI range and sample mean comes in upper confidence interval.
Reason behind non inclusion of population mean in range is same.
TASK 4
(1)Independent sample t test
H0: There is no significent mean difference between order priority and shipping cost across
varied categories.
H1: There is significent mean difference between order priority and shipping cost across varied
categories.
5 | P a g e
Document Page
Interpretation
On analysis of facts it can be said that there is no significent mean difference between
order priority and shipping cost (See Appendix Table 2 calculation). Hence, with change in order
priority category which is critical or any other no big change comes in shipping cost.
(2) One way ANOVA
H0: There is no significent mean difference between sales across different regions.
H1: There is significent mean difference between sales across different regions.
Interpretation
ANOVA results clearly reflect that value of level of significene is 0.00<0.05 and this is
indicating that across different regions sales level is significently different (See ANOVA table in
appendix). On average basis sales is made on West regions and variance of only 0.24 is observed
which means that region is deviating at low rate. Average sales value is 1352.11 and variance is
8556450. Thus, sales value is deviating at high rate.
TASK 5
Coorelation and regression
H0: There is no significent mean difference between sales and order quantity.
H1: There is significent mean difference between sales and order quantity.
ANOVA
df SS MS F
Signific
ance F
Regressi
on 1
4372145
3.41
437214
53.41
5.4994
45652
0.02246
5626
Residual 58
4611090
75.1
795015
6.468
Total 59
5048305
28.5
Coeffici
ents
Standard
Error t Stat
P-
value
Lower
95%
Upper
95%
Lower
95.0%
Upper
95.0%
Intercept
-
441.203
8405
846.9265
599
-
0.52094
6988
0.6043
87025
-
2136.51
1543
1254.1
03862
-
2136.51
1543
1254.10
3862
Order
Quantity
63.9707
895
27.27861
104
2.34508
9689
0.0224
65626
9.36671
6831
118.57
48622
9.36671
6831
118.574
8622
6 | P a g e

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
0 10 20 30 40 50 60-$5,000.00
$0.00
$5,000.00
$10,000.00
$15,000.00
$20,000.00
Order Quantity Line Fit Plot
Sales
Predicted Sales
Order Quantity
Sales
Interpretation
Regression is the one of the important method that is used to identify relationship
between variables. In present case there is single dependent variable and independet variables. It
can be observed that dependent variable is sales and independent variable is order quantity. It can
be observed from the table that value of multiple R is 0.29 and this reflect that there is low
coorelation between both variables (See Regression table in Appendix). Hence, with change in
number of units any big change will not comes in sales value. Value of level of significence is
0.02<0.05 and this means that there is no significent mean difference between independent and
dependent variables. Beta value for order quantity is 63.97 which means that with single change
in number of units sold 63.97 points change comes in sales value. Hence, managers need to give
more focus on the units sold in premises or in busuines on daily basis (Gravetter and Wallnau,
2016). This is because if sold units will not be higher then in that case it is expected that good
amount of plunge will not be observed on sales. It is possible that majority of products that are
included in dataset are of low value and due to this reason on sale of less units very low amount
of sales may be observed in the business.Thus, firms need to make large amount of sales in
business in order to earn sufficient amount of profit in the business.
CONCLUSION
On basis of above discussion it is concluded that there is significent importance of
statistical tools for the business firms. By using descriptive statistics tools and regression analysis
like hypothesis tests detailed analysis of variables can be done and in this way research is carried
out in proper manner. It is also concluded that with small change in units big variation does not
comes in sales value. Most of sales is made through West side of Australia and products are most
of times delievered through Regular air vehicle. Hence, it can be said that in order to make
7 | P a g e
Document Page
higher sales large number of units need to be sold in the market. Major limitation of analysis is
that extensive or more deep analysis of research study is not done as by using techniques related
to visualization in depth lots of facts can be identified. Small sample size was taken in the
present research study and it is also major limitation and in future large sample size need to be
taken to avoid such kind of problems.
8 | P a g e
Document Page
REFERENCE
Books and Journals
Cramér, H., 2016. Mathematical methods of statistics. Princeton university press.
Gravetter, F.J. and Wallnau, L.B., 2016. Statistics for the behavioral sciences. Cengage
Learning.
9 | P a g e

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
APPENDIX
Raw data
Orde
r ID
Order
Priority
Order
Quantity Sales
Ship
Mode
Shippin
g Cost
Regio
n
Customer
Segment
Days to
ship
5165
0 3 43
$10,854.8
3 2 $28.06 E Consumer 2
4153
9 3 41 $2,303.71 1 $8.99 W Consumer 1
3536
0 4 4 $41.06 1 $6.28 E Corporate 2
5065
7 3 16 $60.02 3 $1.93 E Corporate 2
4637
2 3 7 $375.57 1 $16.11 E Consumer 0
3564
9 2 25 $174.03 1 $5.74 W Consumer 2
3667
9 4 28 $1,491.83 1 $8.99 E Corporate 3
4745
9 0 10 $197.21 1 $1.49 E Home Office 1
1651
9 2 49 $138.96 1 $0.70 E
Small
Business 2
3056
6 1 34 $225.45 1 $9.54 W
Small
Business 2
1034
2 2 38 $1,679.04 1 $14.45 E Consumer 2
3555
5 2 15 $71.61 1 $4.93 E Corporate 2
3878
4 2 50 $298.12 1 $0.50 W Home Office 2
1667
4 1 49 $1,959.43 1 $4.62 E
Small
Business 7
1888
4 0 5 $131.27 2 $14.36 E Home Office 1
4851
2 0 47 $148.26 3 $0.81 W Consumer 0
1843
2 3 12 $76.42 3 $0.83 E Corporate 2
3814
6 1 45 $341.39 1 $6.16 W
Small
Business 4
5292
9 3 19 $53.94 1 $1.25 W
Small
Business 1
4096
4 4 40 $301.12 1 $8.37 E Corporate 3
10 | P a g e
Document Page
3219
8 2 31 $240.63 1 $5.53 E Consumer 0
6433 2 41 $159.26 1 $0.50 E Home Office 1
3062
6 4 23 $220.82 3 $1.09 E Corporate 1
4013
4 1 1 $9.75 1 $4.72 W
Small
Business 5
1520
5 4 42 $1,753.51 1 $9.20 W Corporate 1
3466
0 0 36 $3,832.37 1 $5.81 W Home Office 2
3155
6 4 35 $269.35 1 $3.68 W Consumer 1
1232
3 0 32 $513.74 1 $10.13 E Corporate 1
2048
6 3 42 $163.98 1 $0.70 E Corporate 1
4524
8 2 38 $154.85 1 $5.13 E Corporate 2
326 3 17 $100.95 1 $3.37 E Consumer 1
5779
9 3 25
$12,343.0
7 2 $69.30 W Consumer 2
5347 2 39 $276.03 1 $5.03 W Corporate 3
2931
8 0 21 $2,954.14 3 $35.00 E Corporate 1
4208
1 1 36 $188.34 1 $5.74 E
Small
Business 5
1680
5 1 26 $172.33 1 $5.74 W Corporate 2
5488
2 4 17 $86.29 1 $0.49 E Consumer 3
1021
0 0 10 $767.34 1 $14.52 W Corporate 1
5421
5 3 14 $321.21 1 $1.99 W
Small
Business 2
1229
3 3 31 $283.17 1 $1.39 E Home Office 1
5120
0 2 29 $1,617.88 1 $5.50 W
Small
Business 3
2790
4 3 8 $460.59 1 $8.99 W Consumer 1
3258
0 3 38 $261.38 1 $8.74 E Corporate 2
5344
3 4 35 $1,477.63 1 $4.62 W Consumer 1
11 | P a g e
Document Page
5100
8 3 20 $269.66 1 $4.59 E Consumer 2
5735
0 3 33 $140.63 1 $5.68 W
Small
Business 1
5657
7 4 30 $7,497.08 2 $54.12 E
Small
Business 1
5085
4 4 27 $137.50 1 $4.95 E Corporate 0
1868
8 0 49
$14,346.7
3 1 $19.99 W Home Office 2
4064
3 4 23 $258.74 1 $5.72 E Home Office 2
2355
8 1 17 $579.71 2 $45.51 E Home Office 5
4947
9 2 46 $5,615.40 2 $26.30 E Consumer 2
3789
1 3 24 $163.36 1 $6.60 E
Small
Business 2
6432 4 30 $311.08 1 $4.50 E Corporate 2
4278
8 4 26 $241.92 1 $2.27 W
Small
Business 2
4208
1 1 3 $32.49 1 $1.39 E
Small
Business 2
2214
9 1 17 $48.24 1 $1.01 E
Small
Business 5
1562
2 3 21 $135.99 1 $9.68 W Consumer 1
1542
8 4 50 $1,444.96 3 $4.00 W
Small
Business 1
1181
5 1 22 $351.27 1 $7.42 E Consumer 2
Summary statistics
Order
Priority
Order
Quantity Sales Ship Mode
Mean
2.31666
6667 Mean
28.0333
3333 Mean
1352.1
10625 Mean 1.3
Standard
Error
0.17546
0116
Standard
Error
1.73725
6534
Standard
Error
377.63
40741
Standard
Error
0.0833
61577
Median 3 Median 28.5 Median
265.36
5 Median 1
Mode 3 Mode 17 Mode #N/A Mode 1
Standard 1.35910 Standard 13.4567 Standard 2925.1 Standard 0.6457
12 | P a g e

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Deviation 8215 Deviation 3125 Deviation 4096 Deviation 16
Sample
Variance
1.84717
5141
Sample
Variance
181.083
6158
Sample
Variance
855644
9.636
Sample
Variance
0.4169
49153
Kurtosis
-
1.08712
0758 Kurtosis
-
0.88860
5217 Kurtosis
10.461
50403 Kurtosis
2.4475
28388
Skewness
-
0.35227
3056 Skewness
-
0.15631
484 Skewness
3.2518
37699 Skewness
1.9693
74856
Range 4 Range 49 Range
14336.
98 Range 2
Minimum 0 Minimum 1 Minimum 9.75 Minimum 1
Maximum 4 Maximum 50 Maximum
14346.
73 Maximum 3
Sum 139 Sum 1682 Sum
81126.
6375 Sum 78
Count 60 Count 60 Count 60 Count 60
Shipping Cost
Customer
Segment Days to ship
Mean 9.4125 Mean 2.4 Mean
1.93333333
3
Standard Error
1.67606934
6 Standard Error 0.15070456 Standard Error
0.17282411
1
Median 5.605 Median 2 Median 2
Mode 8.99 Mode 2 Mode 2
Standard
Deviation
12.9827773
3
Standard
Deviation
1.16735250
2
Standard
Deviation
1.33868980
6
Sample
Variance
168.552507
2
Sample
Variance
1.36271186
4
Sample
Variance
1.79209039
5
Kurtosis
9.58124599
6 Kurtosis
-
1.42291663
6 Kurtosis
3.31805336
7
Skewness
2.96440797
9 Skewness
0.21691534
9 Skewness
1.57254065
9
Range 68.81 Range 3 Range 7
Minimum 0.49 Minimum 1 Minimum 0
Maximum 69.3 Maximum 4 Maximum 7
Sum 564.75 Sum 144 Sum 116
Count 60 Count 60 Count 60
Region
13 | P a g e
Document Page
Mean 1.4
Standard Error 0.06377928
Median 1
Mode 1
Standard
Deviation
0.49403218
2
Sample
Variance
0.24406779
7
Kurtosis
-
1.88873764
9
Skewness
0.41879164
8
Range 1
Minimum 1
Maximum 2
Sum 84
Count 60
Table 1CI calculation
Sales Shipping Cost
Mean $1,306.00 $9.41
Standard deviation 2947.27188 12.98277733
Z value 1.96 1.96
Square root 3 7.745966692
STDEV/SR 982.4239599 1.676069346
Z*STDEV/SR 1925.550961 3.285095919
Mean-(Z*STDEV/SR) -$619.55 $6.13
Mean+(Z*STDEV/
SR) $1,306.00 $9.41
Population mean 1585.082118 12.4500999
Table 2T Test
t-Test: Two-Sample Assuming Equal Variances
Order
Priority
Shipping
Cost
Mean 2.316667 9.4125
14 | P a g e
Document Page
Variance 1.847175 168.5525
Observations 60 60
Pooled Variance 85.19984
Hypothesized Mean
Difference 0
df 118
t Stat -4.21061
P(T<=t) one-tail 2.5E-05
t Critical one-tail 1.65787
P(T<=t) two-tail 5E-05
t Critical two-tail 1.980272
ANOVA output
Anova: Single Factor
SUMMARY
Groups Count Sum
Averag
e
Varian
ce
Region 60 84 1.4
0.2440
68
Sales 60
81126.
64
1352.1
11
85564
50
ANOVA
Source of Variation SS df MS F
P-
value F crit
Between Groups
547325
76 1
547325
76
12.793
29
0.0005
06
3.9214
78
Within Groups
5.05E+
08 118
427822
5
Total
5.6E+0
8 119
Regression analysis
SUMMARY
OUTPUT
Regression Statistics
Multiple R
0.2942
89312
R Square
0.0866
06199
15 | P a g e

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Adjusted R
Square
0.0708
5803
Standard
Error
2819.6
02183
Observations 60
ANOVA
df SS MS F
Signific
ance F
Regression 1
4372145
3.41
437214
53.41
5.4994
45652
0.02246
5626
Residual 58
4611090
75.1
795015
6.468
Total 59
5048305
28.5
Coeffic
ients
Standar
d Error t Stat
P-
value
Lower
95%
Upper
95%
Lower
95.0%
Upper
95.0%
Intercept
-
441.20
38405
846.926
5599
-
0.5209
46988
0.6043
87025
-
2136.51
1543
1254.1
03862
-
2136.5
11543
1254.1
03862
Order
Quantity
63.970
7895
27.2786
1104
2.3450
89689
0.0224
65626
9.36671
6831
118.57
48622
9.3667
16831
118.57
48622
16 | P a g e
1 out of 20
[object Object]

Your All-in-One AI-Powered Toolkit for Academic Success.

Available 24*7 on WhatsApp / Email

[object Object]