This case study analyzes the profit, number of customers, shipping type, customer type, region and category for International Online Book Store. It includes descriptive statistics, two sample t test, one way ANOVA, correlation analysis and regression analysis. Recommendations and implementation plan are also provided.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Data Analytics: A Business Case Study Executive Summary For this case study, we have developed the data. We have data regarding the sale of books (1320 books) in the month. We assume that this data is from company International Online Book Store. We considered the following attributes Book Name, Book Price (in $), Profit (in $), Book Sale Price (in $), Number of customers who bought this books, Shipping Type (Free or Paid), Customer Type (New or Existing), Region (WA and SA) and Book Category (Comics & Graphic Novels, Mystery, Thriller & Suspense, Romance and Literature & Fiction). We presented the total monthly sale amount (in $) and total monthly profit (in $) for shipping type, customer type, region and category.We have presented the descriptive statistics for number of customers forshipping type, customer type, region, and category. We used two sample t test and one way ANOVA for testing the difference between mean number of customers who bought the books for shipping type, customer type, region and category. We studied the correlation between product price, profit, sale price, number of pages and number of customers. We try to fit the regression model for total sale amount. We have also provided recommendations and plan for company. 1
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Table of Contents Sr. No.TopicPage No. 1List of Abbreviations and assumptions made3 2Introduction – What is the problem?4 3Research Methodology5 4Analytical Findings6 5Recommendations to the company11 6 An implementation plan based on the recommendations you have provided 12 7Conclusion12 8List of References14 9Appendix16 2
List of Abbreviations and assumptions made Max: Maximum Min: Minimum SA: South Australia WA: Western Australia 3
Introduction – What is the problem? By the definition of book, “A book is a series of pages assembled for easy portability and reading, as well as the composition contained in it”. In everyone life book plays the important role. Today we can get the bool easily from online store. Book section in every online store is main part of business. Today we purchase the most items online. We can get desired product at home within stipulated time. eCommerce becoming very raising and popular business in every corner of the world. Online shopping is most popular from the all eCommerce business. eCommerce business increasing exponentially in recent decade bring new challenges to the service provider. Businesscompetitionand customersatisfactionarethe most importantfactorsin the eCommerce business. About Data: For this case study, we have developed the data. We have data regarding the sale of books (1320 books) in the month. We assume that this data is from company International Online Book Store. We considered the following attributes i)Book Name ii)Book Price (in $) iii)Profit (in $) iv)Book Sale Price (in $) v)Number of customers who bought this products vi)Shipping Type (Free or Paid) vii)Customer Type (New or Existing) 4
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
viii)Region (WA and SA) ix)Book Category (Comics & Graphic Novels, Mystery, Thriller & Suspense, Romance and Literature & Fiction) We defined following variables for the study objectives as Total Monthly sale amount (in $) = Book Sale Price (in $) × Number of customers Total monthly profit (in $) = Profit (in $) × Number of customers Project Problem: We concentrate on the following i)Profit analysis by shipping type, customer type, region and category. ii)Whether the mean number of customers are significantly different for different shipping type, customer type, region and category. iii)Correlation analysis of variables iv)Regression analysis for total monthly sales Research Methodology Statistical tool and techniques are important aspect of data analysis. In literature there are many statistical tools and techniques are available. But use of proper tools and technique is important part of analysis. For the profit analysis, we presented the total monthly sale amount (in $) and total monthly profit (in $) forshipping type, customer type, region and category.We have presented the descriptive statistics for number of customers forshipping type, customer type, region, and category. We used two sample t test and one way ANOVA for testing the difference between mean number of customers who bought the books for shipping type, 5
customer type, region and category. We studied the correlation between product price, profit, sale price, number of pages and number of customers. We try to fit the regression model for total sale amount. We used Python 3.6.5 and MS-Excel for the data analysis. The sample code are given in appendixes.We used Grus (2015), McKinney (2012), Pedregosa et al. (2011) and Schutt and O'Neil (2013). Analytical Findings In this section, we carried the following Profit Analysis Description Statistics Two sample t test One way ANOVA Correlation Analysis Regression Analysis Profit Analysis: For the profit analysis we have given the total monthly sale amount (in $), total monthly profit (in $)and profit percentage for shipping type, customer type, region and category. We referred Berenson et al. (2012), Black (2009), Groebner et al. (2008), Kvanli et al. (2000) and Mendenhall and Sincich (1993). Profit analysis is represented in Table 1. 6
Table 1: Profit analysis according to for shipping type, customer type, region and category AttributesLevel Total Monthly Sale (in $) Total Monthly Profit (in $) Profit Percentage Shipping TypeFREE8687084889.77% PAID141014138159.80% Customer TypeExisting4239941599.81% New185485181449.78% RegionSA123220121209.84% WA104664101839.73% Category Comics & Graphic Novels32336404912.52% Literature & Fiction9709796409.93% Mystery, Thriller & Suspense20941261112.47% Romance7751060037.74% Total227884223039.79% From Table 1 we can claim that International Online Book Store earns on average 9.79% profit on each laptop. The profit percentage is not significantly different for shipping type, customer type and region. The profit percentage for Comics& Graphic Novels and Mystery, Thriller & Suspense books is more whereas profit percentage on Romance book is less. Descriptive statistics for number of customer: Total monthly sale and profit are proportional to the number of customers. So here we displays the summary statistics for the number of customers who bought the books for shipping type, customer type, region and category. We referred the well-known books for this section such as Bickeland Doksum (2015), Casella and Berger (2002), DeGroot and Schervish (2012), Hodges Jr and Lehmann (2005), Papoulis (1990), Pillers (2002) and Ross (2014). Table 2 displays the size, mean, standard deviation, minimum and maximum of number of customers who bought books. 7
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Table 2: Summary statistics for numbers of customer who bought the books for shipping type, customer type, region and category AttributesLevelSizeMeanStandard DeviationMinMax Shipping Type FREE4115.8252.461114 PAID9094.2872.154112 Customer Type Existing2434.7982.434111 New10774.7592.348114 Region SA7924.2842.181112 WA5285.4892.442114 Category Comics & Graphic Novels2584.4422.238113 Literature & Fiction5924.5962.249113 Mystery, Thriller & Suspense1644.4452.117112 Romance3065.5392.633114 Total13204.7662.363114 From Table2, we observed following i)International Online Book Store get averagely 4.766 customers for each book. ii)Mean number of customers who bought the books at free shipping is more than mean number of customers who bought books by paid shipping. iii)Mean number of new customers is less than mean number of existing customers. iv)Mean number of customers from SA region is less than mean number of customers from WA region. v)Mean number of customers demanding Romance books is more than mean number of customers demanding other category books. Two Sample t-test: Here we are interested to know whether there is significant difference between the mean of number of customers who bought the books for shipping type, customer type and 8
region. Our null hypothesis is that there is no significant difference between mean of number of customers for levels of attributes and alternative hypothesis is that there is significant difference between mean of number of customers for levels of attributes.We used two sample independent test assuming unequal variances. Table 3 displays the value of test statistic and p-value of two sample independent test assuming unequal variances. Table 3: Two sample independent test for shipping type, customer type and region AttributesLevelsTest Statisticp-value Shipping TypeFree and Paid10.920.000 Customer Type New and Existing0.230.817 RegionWA and SA-9.160.000 From Table 3 we observed the following i)There is significant difference in mean number of customers who bought the books at free shipping and who bought at paid shipping. ii)There is no significant difference in mean number of new customers and existing customers who bought the books. iii)There is significant difference in mean number of customers from WA and SA region. One way ANOVA: By using one way ANOVA, we test whether the mean number of customer who bought the books for different category is significantly different or not. Here we test following null and alternative hypothesis as Null Hypothesis: There is no significant difference between the mean of number of customer who bought the books of different category. 9
Alternative Hypothesis: There is significant difference between the mean of number of customer who bought the books of different category. Table 4 shows the value of F statistic and p-value for one way ANOVA. Table 4: Output of one way ANOVA for Category Attribute sLevelF Statistic P Value CategoryComics & Graphic Novels, Mystery, Thriller & Suspense, Romance and Literature & Fiction15.030.000 We observed that there is significant difference between the mean of number of customer who bought the books of different category. Mean number of customers who bought the romance category book is significantly more than other category. Correlation Analysis: Here we studied the correlation between product price, Sale price, profit and number of customers. Table 5 represents the correlation coefficient between product price, Sale price, profit and number of customers. Table 5: Pearson’s correlation coefficient for Product Price, Sale Price, Profit and Numbers of customers Product PriceSale PriceProfitNumbers of customer Product Price10.9900.0260.118 Sale Price0.99010.1660.117 Profit0.0260.16610.006 Numbers of customer0.1180.1170.0061 From Table 5, we observed that i)Product price is positive correlated with sale price, profit and number of customers. 10
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
ii)Sale price is positively related with profit and number of customers. iii)Profit is positively correlated with number of customers but correlation is very low. Regression analysis: We used simple linear regression model for predicting the monthly sale using number of customers who bought the books as predictor variable. Table 6 represents the F Statistics, P value, R2and regression coefficients of simple linear regression. Table 7: Output of Regression Analysis F Statistic 5981.55 P Value0.000 R20.819 Intercept-12.264 Slope38.797 We observed that P Value =0.000 suggests that there is significant relationship between total monthly sale and number of customers who bought the books. We also observed R2as 0.819 suggests that model fitting is very good and adequate. We fitted the following straight line as Total sale (in $) = -12.264 + 38.797 × Number of Customers Recommendations to the company From the data analysis, we observed that i)Mean number of customers who bought the books at free shipping is more than paid shipping suggest that we should give free delivery to most of the books so that number of customers increases. 11
ii)Mean number of customers in SA region is less than WA region suggesting that we should use some marketing strategies in SA region. iii)Mean number of customers for Romance books is more than other category books suggest that every new Romance book must be available in our store and it should be informed to all customers. An implementation plan based on the recommendations you have provided i)We should increase strength of shipping staff so that free shipping will be possible for most of the books. ii)We should adopt the marketing strategies in WA region for SA region so that number of customers increases. We can use more advertising board in SA region. iii)We can implement some offer or facility for SA region. iv)We must target the other region of Australia for selling the books. v)As the demand for Romance book is more we should advertise each Romance book. Conclusions From Profit analysis, claim that International Online Book Store earns on average 9.79% profit on each laptop. The profit percentage is not significantly different for shipping type, customer type and region. The profit percentage for Comics& Graphic Novels and Mystery, Thriller & Suspense books is more whereas profit percentage on Romance book is less.We observed that International Online Book Store get averagely 4.766 customers for each book. Mean number of customers who bought the books at free shipping is more than mean number of customers who bought books by paid shipping. Mean number of new customers is less than mean number of existing customers. Mean number of customers from SA region is less than mean number of customers from WA region. Mean number of 12
customers demanding Romance books is more than mean number of customers demanding other category books. Weobservedthefollowingthereissignificantdifferenceinmeannumberof customers who bought the books at free shipping and who bought at paid shipping. We observed that there is no significant difference in mean number of new customers and existing customers who bought the books. We claimed that there is significant difference in mean number of customers from WA and SA region. We observed that there is significant difference between the mean of number of customer who bought the books of different category. Mean number of customers who bought the romance category book is significantly more than other category. Profit is positively correlated with number of customers but correlation is very low. Regression analysis suggests that there is significant relationship between total monthly sale and number of customers who bought the books. We have also provided recommendations and plan for company. 13
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
List of References Berenson, M., Levine, D., Szabat, K.A. and Krehbiel, T.C., 2012.Basic business statistics: Concepts and applications. Pearson higher education AU. Bickel, P.J. and Doksum, K.A., 2015.Mathematical statistics: basic ideas and selected topics,volume I (Vol. 117). CRC Press. Black, K., 2009.Business statistics: Contemporary decision making. John Wiley & Sons. Casella, G. and Berger, R.L., 2002.Statistical inference (Vol. 2). Pacific Grove, CA: Duxbury. DeGroot, M.H. and Schervish, M.J., 2012.Probability and statistics. Pearson Education. Groebner, D.F., Shannon, P.W., Fry, P.C. and Smith, K.D., 2008.Business statistics. Pearson Education. Grus, J., 2015.Data science from scratch: first principles with python. " O'Reilly Media, Inc.". Hodges Jr, J.L. and Lehmann, E.L., 2005.Basic concepts of probability and statistics. Society for industrial and applied mathematics. Kvanli, A.H., Pavur, R.J. and Guynes, C.S., 2000.Introduction to business statistics. Cincinnati, OH: South-Western. Lee, G.G. and Lin, H.F., 2005. Customer perceptions of e-service quality in online shopping. International Journal of Retail & Distribution Management, 33(2), pp.161-176. McKinney, W., 2012.Python for data analysis: Data wrangling with Pandas, NumPy, and IPython." O'Reilly Media, Inc.". Mendenhall, W. and Sincich, T., 1993.A second course in business statistics: Regression analysis. San Francisco: Dellen. Papoulis, A., 1990.Probability & statistics (Vol. 2). Englewood Cliffs: Prentice-Hall. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V. and Vanderplas, J., 2011. Scikit-learn: MachinelearninginPython.Journalof machinelearningresearch, 12(Oct), pp.2825-2830. Pillers Dobler, C., 2002.Mathematical statistics: Basic ideas and selected topics. Ross, S.M., 2014.Introduction to probability and statistics for engineers and scientists. Academic Press. Schutt, R. and O'Neil, C., 2013.Doing data science: Straight talk from the frontline." O'Reilly Media, Inc.". Wolfinbarger, M. and Gilly, M.C., 2001. Shopping online for freedom, control, and fun. California Management Review, 43(2), pp.34-55. 14
Appendix (Python code) Here we attached some of the code used for this study. Appendix 1: Python code for importing data file importcsv # for importing csvmodule file_name ="BookData.csv"# data file with open(filename, 'r') as csvfile:# reading csv file csvreader =csv.reader(csvfile) # creating a csv reader object Appendix 2: Python code for creating data frame import pandas as pd data= file_name df = pd.DataFrame(data, columns = [‘columns name’]) Appendix 3: Python code for basic statistic df[‘variable’].mean()# for mean df[‘variable’].std()# for standard deviation df[‘variable’].min()# for minimum observation df[‘variable’].max()# for maximum observation df[‘variable’].describe()# for summary statistics Appendix 4: Python code for independent two sample t-test assuming unequal variances import scipy.stats as stats stats.ttest_ind(sample1, sample2, equal_var=False) Appendix 5: Python code for one way ANOVA import scipy.stats as stats stats.f_oneway(var1, var2, var3, var4, var5) Appendix 6: Python code for regression import statsmodels.api as sm y = pd.DataFrame(data, columns= [‘Total_Monthly_Sale’]) X = pd.DataFrame(data, columns= [‘Number_of_Customers’]) model = sm.OLS(y, X)# y is monthly sale and X is number of customers model.summary() 15