MIS771: Descriptive Analytics and Visualisation Report

Verified

Added on  2023/01/10

|25
|3795
|1
Report
AI Summary
This report provides a comprehensive analysis of Mad Dog Craft Beer's data using descriptive analytics and data visualization techniques. The study explores the factors influencing order quantity and customer recommendations for the brewery's pale ale beer. It involves descriptive statistics, factor analysis, and regression modeling to identify key variables such as loyalty, customer type, region, and distribution channel. The report develops several models to estimate order quantity and the likelihood of recommendation, including logistic regression analysis. Furthermore, the report includes a forecasting model for production planning. The findings offer valuable insights for Mad Dog Craft Beer to maintain its competitive advantage in a dynamic market. The report also includes various tables, figures, and appendices to support the analysis and findings.
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
Contents
INTRODUCTION.......................................................................................................................................3
1. DEPENDENT VARIABLES DESCRIPTION....................................................................................4
1. MODELLING QUANTITY ORDERED............................................................................................6
2.1 FACTOR IDENTIFICATION...........................................................................................................6
2.2 MODEL BUILDING.........................................................................................................................6
2.3 INTERACTION EFFECT.................................................................................................................8
2. MODELLING LIKELIHOOD OF RECOMMENDATION................................................................9
3.1 MODEL.............................................................................................................................................9
3.2 PREDICTED PROBABILITIES.....................................................................................................11
3.3 PREDICTED PROBABILITIES VISUALIZATIONS....................................................................11
3. FORECASTING PRODUCTION.....................................................................................................11
CONCLUSION.........................................................................................................................................13
REFERENCES..........................................................................................................................................14
APPENDICES...........................................................................................................................................16
APPENDIX 1........................................................................................................................................16
APPENDIX 2........................................................................................................................................17
APPENDIX 3........................................................................................................................................22
APPENDIX 4........................................................................................................................................24
2
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
INTRODUCTION
The dynamic nature of the modern business environment has made it necessary for businesses to
become dynamic as well (Kiechel, 2010; Sirajuddin, et al., 2017). Research, innovations and
technology have played a major role in making the modern business environment dynamic (Aalst
& Wil, 2011). Thus, for businesses to become dynamic, it is necessary to for them to invest in
research, innovations and technology. Among these three factors, research acts as a foundation
for the other two factors.
Through research, innovators get access to previously done work in the same field thereby
enabling them to develop better innovations (Albright & Winston, 2014). Also through research,
the optimal technology can be identified for a business process (Laudon & Guercio, 2014).
Optimal in this case implying a technology that is efficient will still being up to date. Hence
research can be considered as vital for acquiring and maintaining competitive advantage in the
modern dynamic business environment.
Mad Dog Craft Beer is an Australian based brewery company. This micro-brewery has managed
to establish a competitive advantage with fast growth (both in terms of sales and production) in
its customer bases in Regional Victoria and Melbourne. The company is interested in
maintaining its competitive advantage especially with the increase in competition from new
micro-brewery within its customer bases.
This research paper aims at conducting a research on the Order Quantity and Recommendations
of Mad Dog Craft Beer’s pale ale beer. Inferences can be drawn and actionable
recommendations determined from the analysis on the two variables. By using the inferences and
recommendations, Mad Dog Craft Beer can establish policies to enable it maintain its
3
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
competitive advantage despite the rise in competition from new market entries. This paper will
also aim at developing a model for the forecasting of the production of Mad Dog Craft Beer’s
pale ale beer line. This forecasting model will enable the company to better plan their finances as
well as estimate the expected future profits.
The study paper will look at the descriptive statistics of the Order Quantity and Recommendation
variables, develop models for estimation of these two variables and develop a forecasting model
for the production of pale ale beer line.
1. DEPENDENT VARIABLES DESCRIPTION
The table, Table 1: Dependent Variable Summary Description, below gives the summary
description of the two dependent variables in this research study:
Table 1: Dependent Variable Summary Description
Variable Variable
Description
Nature of
Variable
Variable Type Measurement
Scale
Order_Qty Informs on the
number of pale
ale beers
bottles ordered
by a customer.
Numerical
Variable.
Dependent
Variable.
Ratio Scale.
4
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
Recommendation Informs on
whether a
customer
would
recommend the
pale ale beer to
others.
Categorical
Variable with
two levels; 0
represents No
and 1
represents Yes.
Dependent
Variable.
Nominal Scale.
From Table 3: Descriptive Statistics of Order_Qty and Recommend, we observe that
customers that purchase the pale ale beer are slightly more likely to recommend it to other
people than they are likely not to. 101 customers replied they would recommend the pale ale
beer to other people while 99 said they would not recommend it. The average order quantity
of the pale ale beer was 1.665 with a standard deviation of 0.8932 and mode of 7.2. The
lower quartile (Q1), median, Upper Quartile and Interquartile Range values for Order_Qty
were 7.1, 7.6, 8.2 and 1.1 respectively. The range for the Order_Qty data is 5.6 units. From
the histogram of the Order_Qty, Figure 1: Histogram of Order_Qty, we observe that the data
distribution can be described as skewed to the right (negatively skewed). This is also
confirmed from the value of Skewness = -0.2064 with a kurtosis of 0.5840.
5
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
1. MODELLING QUANTITY ORDERED
2.1 FACTOR IDENTIFICATION
In order to identify the possible factors that affect the quantity ordered variable, Order_Qty,
the Factor Analysis Method was applied. Factor Analysis is a multivariate analysis method
that is used in dimension reduction to obtain a reasonably acceptable number of variables for
analysis (Freedman, 2009; Vicenc, 2017). The Principal Analysis method reduces the size of
a set of independent variables to a small manageable size for the analysis of the impact on the
dependent variable (Han & Jaiwei, 2011; O'Neil & Schutt, 2013).
From Table 4: Eigen Values, we observe that the following factors had an Eigen value
greater than 1; Loyalty (3.810), Cust_Type (2.764), Region (1.722) and Dist_Channel
(1.081). Therefore, we consider the Loyalty, Cust_Type, Region and Dist_Channel as the
possible factors affecting the quantity ordered of the pale ale beer line of the Mad Dog Craft
company. This can also be seen from Figure 2: Scree Plot, where there is a point of inflection
at Eigen value 5, thus we consider factor 1 to 4.
2.2 MODEL BUILDING
The purpose of this model is to estimate the quantity ordered for the pale ale beer line of the
Mad Dog Craft Beer company. The model will consider the possible factors that affect the
Order_Qty variable in modelling the estimation of the quantity ordered for the pale ale beer
line.
The factors that are considered are the Loyalty, Cust_Type, Region and Dist_Channel. These
factors will form the independent variables while the Order_Qty forms the dependent
6
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
variable. The sample is composed of 200 observations of customers on the 5 variables of
interest for the model.
Regression analysis is used in the generation of the model for the estimation of the quantity
ordered for the pale ale beer line. Regression analysis is a statistical analysis technique that
estimates the value of one variable(s), known as the dependent variable(s), using another
variable(s), known as independent variable(s) (Jaulin, 2010). This study specifically applies
the multiple regression analysis technique for estimation. The multiple regression analysis is
a type of regression analysis whereby there are multiple independent variables estimating a
single dependent variable (Jorge, et al., 2013; Witten, 2011).
The first model contained all four independent variables with the summary output given in
Table 5: Model 1. The model had an AIC = -98.88 with the model equation given as below:
Orde rQty =6.866+ 0.0734 Loyalty+0.0182Cus tType 0.1214 Region+ 0.5499 Dis t Channel
The second model contained the following independent variables; Loyalty, Cust_Type and
Region. The summary output is given in Table 6: Model 2. The model had an AIC = -79.90
with the model equation given as below:
Orde rQty =7.120+0.0890 Loyalty0.0570 Cus tType0.1172 Region
The third model contained the following independent variables; Loyalty and Cust_Type with
the summary output given in Table 7: Model 3. The model had an AIC = -76.21 with the
model equation as given below:
Orde rQty =6.866+ 0.0867 Loyalty0.0496 CustType
The fourth model had the following independent variables; Cust_Type, Region and
Dist_Type with the summary output given in Table 8: Model 4. The model had an AIC = -
73.12 with the model equation as given below:
7
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
Orde rQty =7.355+0.0120 Cus tType0.0353 Region+0.7076 Dis tChannel
The fifth model had the following independent variables: Region and Dist_Type with the
summary output given in Table 9: Model 5. The model had an AIC = -75.11 with model
equation as given below:
Orde rQty =7.3620.0362 Region+ 0.7057 Dis tChannel
The sixth model had the following independent variables; Loyalty and Dist_Channel with the
summary output given in Table 10: Model 6. The model had an AIC = -101.7 with the model
equation as given below:
Orde rQty =6.805+ 0.0714 Loyalty+0.5822 Dis tChannel
The sixth model had the lowest AIC = -101.7, this implies that the sixth model was the best
fit model of the six iterations of the estimation model for the quantity ordered of the pale ale
beer line. The test for this model is given in Table 11: Model 6 Prediction. From this table we
observe that the model has a reasonably efficient prediction ability.
2.3 INTERACTION EFFECT
This model contains the following independent variables; Quality and Brand_Image with the
summary output given in Table 12: Model 7. The model had an AIC = 123.5 with the model
equation as given below:
Orde rQty =3.588+ 0.3094 Quality+0.3116 Bran dImage
From the model equation above, a unit change in the quality of the pale ale beer results in a
0.3094 change in the quantity perception of pale ale beer ordered. Also, a unit change in
8
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
brand image perception of the pale ale beer results in a 0.3116 change in the quantity of pale
ale beer ordered. Considering an α = 0.05 level of significance, the p-values of both the
Quality and Brand_Image, 2.61e-14 and 1.54e-10 respectively, are lower than α = 0.05 level
of significance. Therefore, both the independent variables are significant predictors in the
estimation of the Order_Qty variable.
The value of the Adjusted R Squared = 0.3340, this implies that the model explains only
33.40% of the relationship between the variable in the model. Hence, despite having a very
low AIC = 123.5, the low value of the Adjusted R Squared indicates that the model is a poor
fit.
The regression analysis indicate that the quality perception and brand image perceptions are
significant predictors of the quantity of the order of pale ale beer. Since quality perception is
determined by the brand image perception, investing in the brand image for the pale ale beer
line would significantly improve the quantity of the orders of the pale ale beer.
2. MODELLING LIKELIHOOD OF RECOMMENDATION
3.1 MODEL
This modelling process utilizes logistic regression analysis to develop a predictive model for
the estimation of the likelihood that a customer will recommend pale ale beer to other people.
The logistic regression is a regression analysis technique that estimates a dichotomous
categorical response (dependent) variable using independent variables that are both
numerical and categorical (Barbara & Susan, 2014; Hosmer, 2013).
9
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
The model has Recommendation as the dependent variable and the independent variables are;
Distribution Channel, Quality, Brand Image and Shipping Speed. The summary model output
is given in Table 13: Model 8 (Coefficients). The resultant model equation is as given below:
Recommendation=13.28+ 0.9684 Dis tChannel +0.6541 Quality+0.6214 Bran dImage + 1.159 Shippin gSpeed
From the model we observe that a change in the distribution channel results in a 0.9684
change in the likelihood of recommendation, a unit change in the quality perception results in
a 0.6541 change in the likelihood of recommendation, a unit change in the brand image
perception results in a 0.6214 change in the likelihood of recommendation and a unit change
in the shipping speed results in a 1.159 change in the likelihood of recommendation.
The p-values of the variables are 0.0102, 2.1e-05, 0.0015 and 5.7e-05 for Distribution
Channel, Quality, Brand Image and Shipping Speed respectively. Considering an α = 0.05
level of significance, then all the p-values are lower than α = 0.05 level of significance and
hence all the variables are significant predictors of the likelihood of a customer
recommending the pale ale beer to other people.
From the chi-square test in Table 14: Model 8 (Tests), the p-value = 7.35e-18, which is less
than α = 0.05 level of significance. Thus, we conclude that the model is fit and significant for
the estimation of the likelihood of a customer recommending a pale ale beer to other people.
3.2 PREDICTED PROBABILITIES
Table 15:Predicted Probabilities for Model 8 considers a case for 20 customers that
purchased the pale ale beer from Mad Dog Craft Beer company. From this table we observe
that as the quality perception and the brand image perception increase, the likelihood of a
10
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
customer recommending the pale ale beer to other people equally increases. This is true for
both the customers who purchase pale ale beer from sales representative and directly from
the Mad Dog Craft Beer company. Also in both cases, customers that consider the brand
image as poor or as neutral, are less likely to recommend pale ale beer to other people.
3.3 PREDICTED PROBABILITIES VISUALIZATIONS
The plots Figure 3: Predicted Probability of Recommendation for Customers Purchasing
from Sales Representative and Figure 4: Predicted Probability of Recommendation for
Customers that Purchase Directly respectively show the graphs of the predicted probabilities
of recommendation for customers that purchase from sales representatives and customers that
purchase directly from Mad Dog Craft Beer company. The two plots generally have the same
trend and indicate that as the quality perception and the brand image perception increase, the
likelihood of a customer recommending the pale ale beer to other people equally increases.
3. FORECASTING PRODUCTION
The SARIMA model was applied in the forecasting of the production of pale ale beer. The
SARIMA is a special case of an ARIMA model that is applied in instances when there is an
aspect on seasonality in the time series data (Galit, et al., 2018; Shaffer, 2011). The data on
the production of pale ale beer is recorded quarterly, hence introducing an aspect of
seasonality in the time series data for the production. Thus, the SARIMA model is
appropriate for the forecasting of the production of the pale ale beer since it caters for the
seasonality present in the data.
11
Document Page
DESCRIPTIVE ANALYTICS AND VISUALISATION
The summary output of the model is given in Table 16: SARIMA Model. From the summary
output, we observe that the model for forecasting the production of pale ale beer can be given
as follows:
Productio nt =1.703+0.8779 Productio nt10.0452 Productio ns1 + e1.342 et10.6243 es 1
From the model above, we observe that future production has a 0.8779 per unit dependence
on the production of the previous quarter and a -0.0452 per unit dependence on the
production of the previous season (year). The error due to the previous quarter accounts for -
1.342 per unit while the error due to the previous season (year) accounts for -0.6243 per unit.
The SARIMA model forecast for the turnover in production of the pale ale beer line for the
next four quarters as given in Table 2: Forecast Turnover in Production of Pale Ale Beer Line
below:
Table 2: Forecast Turnover in Production of Pale Ale Beer Line
Year Quarter Forecasted Turnover in
Production
2019 Q2 1728.22
Q3 2039.05
Q4 1608.08
2020 Q1 1722.63
CONCLUSION
From the analysis done and the inferences drawn in this research study we can conclude that the
main factors that significantly affect the quantity of order of the pale ale beer line are Loyalty,
Distribution Channel, Quality and Brand Image. However, the two models that have these
12
chevron_up_icon
1 out of 25
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]