Quantitative Analysis: Summary Statistics, Histograms, Scatterplots, and Correlations
Verified
Added on 2023/01/19
|12
|2069
|27
AI Summary
This document provides a quantitative analysis of sales, fresh foods, and specials. It includes summary statistics, histograms, scatterplots, and correlations between the variables.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
Quantitative analysis Student name: Instructor: 1|P a g e
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
NUMBER TWO 2a. Descriptive statistics Statistics SalesFreshfoodsSpecial s NValid200200200 Missing000 Mean33.377812.44687.5546 Std. Error of Mean1.31213.87324.29951 Median24.56009.04006.6300 Mode23.01a.006.00 Std. Deviation18.5562 6 12.349474.23570 Variance344.335152.50917.941 Skewness.6931.597.981 Std. Error of Skewness.172.172.172 Kurtosis-.7472.308.917 Std. Error of Kurtosis.342.342.342 Range83.3961.7721.64 Minimum4.88.001.00 Maximum88.2761.7722.64 Percentile s 2519.13003.92004.7300 5024.56009.04006.6300 7550.755016.58259.8675 a. Multiple modes exist. The smallest value is shown Table 1 Table 1 represents the summary statistics of three variables which are in monetary value. The first variable is sales. This represents the amount that was spent by each of the customers in dollars. The other variable is fresh foods. This represents the amount spent on fresh food in terms of dollars. The last one is specials. This represents the amount of money spent on products which were given 50% discount. As can be observed from the statistics, the mean amount spent on sales is 33.78 dollars. The mean amount spent on fresh foods is 12.44 dollars. Lastly, the mean amount of money spent on specials was 7.55 dollars. The median amount spent on sales is 24.56 dollars. The mean amount spent on fresh foods is 12.44 dollars. Lastly, the mean amount of money spent on specials was 7.55 dollars. 2|P a g e
2b. Histogram and ogive curve 0-5 10-15 20-25 30-35 40-45 50-55 60-65 70-75 80-85 90-95 100-105 110-115 0 5 10 15 20 25 30 35 40 45 50 Histogram - store 2 Sales intervals Frequency Figure 1 The above figure is of a histogram representing the distribution of amount spent on sales in dollars. It can be seen that the distribution depicts two peaks. The first high peak is between 10 and 25 dollars. This indicates that most customers spent between 10 to 25 dollars. The other peak was between 50 dollars and 65 dollars. This indicates that most customers spent between 50 to 65 dollars. It can also be observed that very few people spent more than 65 dollars and less than 10 dollars on sales. 5 15 25 35 45 55 65 75 85 95 105 115 0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% 80.00% 90.00% 100.00% Ogive curve-store 2 upper limits Cummulative frequency Figure 2 3|P a g e
Figure 2 is of an ogive curve also known as the histogram curve. It gives the linear trend of the cumulative frequency of the amount spent on sales. It is different from the histogram in that it plots the cumulative data points and not the data points themselves. It can be observed that it rises steadily from bottom left to right then stabilizes around 90% of the cumulative frequency. Descriptive statistics for amount spent in store 1 STORE 1 Descriptive statistics SalesFresh foodsSpecials Mean47.8619.4811.62 Standard Error1.220.360.19 Median52.9720.1111.44 Mode51.9420.1011.40 Standard Deviation16.404.892.51 Sample Variance268.9823.956.29 Kurtosis11.560.971.37 Skewness1.950.030.94 Range140.2635.5613.46 Minimum13.720.008.24 Maximum153.9835.5621.70 Sum8614.203506.292092.16 Count180180180 Table 2 Table 2 represents the summary statistics of three variables which are in monetary value. The first variable is sales. This represents the amount that was spent by each of the customers in dollars. The other variable is fresh foods. This represents the amount spent on fresh food in terms of dollars. The last one is specials. This represents the amount of money spent on products which were given 50% discount. As can be observed from the statistics, the mean amount spent on sales is 47.86 dollars. The mean amount spent on fresh foods is 19.48 dollars. Lastly, the mean amount of money spent on specials was 11.62 dollars. The median amount spent on sales is 52.97 dollars. The mean amount spent on fresh foods is 20.11 dollars. Lastly, the mean amount of money spent on specials was 11.44 dollars. It can be concluded that the amounts spent on the variables in store 1 were higher than the amounts spent on the variables in store 2. 4|P a g e
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Ogive curve for store 1 20 30 40 50 60 70 80 90 100 110 120 130 140 150 More 0.00% 20.00% 40.00% 60.00% 80.00% 100.00% Ogive Sales Frequency Figure 3 Figure 3 is of an ogive curve also known as the histogram curve. It gives the linear trend of the cumulative frequency of the amount spent on sales in store 1. It is different from the histogram in that it plots the cumulative data points and not the data points themselves. It can be observed that it is generally constant rises steadily from bottom left to right then stabilizes around 95% of the cumulative frequency. 15 25 35 45 55 65 75 85 95 105 115 125 135 145 155 0 10 20 30 40 50 60 70 80 Histogram - store 1 sales upper limit Frequency Figure 3 The above figure is of a histogram representing the distribution of amount spent on sales in dollars. It can be seen that the distribution depicts two peaks. The first high peak is between 30 5|P a g e
and 40 dollars. This indicates that most customers spent between 30 to 40 dollars. The other peak was between 55 dollars and 65 dollars. This indicates that most customers spent between 55 to 65 dollars. It can also be observed that very few people spent more than 70 dollars and less than 25 dollars on sales. 2C. SCATTERPLOTS OF SALES VERSUS FRESH FOOD 0102030405060708090100 0 10 20 30 40 50 60 70 Scatterplot Sales Fresh food Figure 4 Figure 4 above is a scatterplot of fresh food versus sales. It can be observed that there is a pattern produced between the two variables. Below the sales amount of 40 dollars, no particular pattern is evident but beyond 40 dollars, it can be observed that the two variables produce a curve like line. 6|P a g e
2C. SCATTERPLOTS OF SALES VERSUS SPECIALS 0102030405060708090100 0 5 10 15 20 25 Scatterplot Sales Specials Figure 5 Figure 5 above is a scatterplot of specials versus sales. It can be observed that there is a pattern produced between the two variables. Below the sales amount of 40 dollars, no particular pattern is evident but beyond 40 dollars, it can be observed that the two variables have a linear kind of relationship. Pearson Correlation between sales and fresh food Correlations SalesFresh foods Sales Pearson Correlation1.820** Sig. (2-tailed).000 N200200 Fresh foods Pearson Correlation.820**1 Sig. (2-tailed).000 N200200 **. Correlation is significant at the 0.01 level (2-tailed). Table 3 A test of correlation was conducted on sales and fresh foods. Pearson correlation coefficient was employed because the data was large and hence assumed to be normally distributed. The Pearson correlation between the two variables is 0.8. This indicates that there is a very strong and positive 7|P a g e
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
correlation between the two variables. The p-value computed as can be observed is 0.00. This means that the correlation is significant. Pearson Correlation between sales and special Correlations SalesSpecials Sales Pearson Correlation1.700** Sig. (2-tailed).000 N200200 Specials Pearson Correlation.700**1 Sig. (2-tailed).000 N200200 **. Correlation is significant at the 0.01 level (2-tailed). Table 4 A test of correlation was conducted on sales and specials. Pearson correlation coefficient was employed because the data was large and hence assumed to be normally distributed. The Pearson correlation between the two variables is 0.7. This indicates that there is a very strong and positive correlation between the two variables. The p-value computed as can be observed is 0.00. This means that the correlation is significant. 8|P a g e
Covariance between the three variables Correlations SalesSpecialsFresh foods SalesPearson Correlation1.700**.820** Sig. (2-tailed).000.000 Sum of Squares and Cross-products 68522.59 4 10948.03 5 37394.116 Covariance344.33555.015187.910 N200200200 SpecialsPearson Correlation.700**1.871** Sig. (2-tailed).000.000 Sum of Squares and Cross-products 10948.03 5 3570.2869069.047 Covariance55.01517.94145.573 N200200200 Fresh foods Pearson Correlation.820**.871**1 Sig. (2-tailed).000.000 Sum of Squares and Cross-products 37394.11 6 9069.04730349.377 Covariance187.91045.573152.509 N200200200 **. Correlation is significant at the 0.01 level (2-tailed). Table 5 Table 5 above gives the results of covariance between the three variables. The covariance between sales and specials is 55.02. The covariance between sales and fresh foods is 187.91. To add on, the covariance between specials and fresh foods is 45.57. QUESTION THREE 3A. Contingency table SALES High value sales Medium value sales Low value salesTOTAL SPECIAL S High value specials306296257859 Low value specials200190151541 TOTAL5064864081400 9|P a g e
Table 6 The contingency table above is of sales versus specials. The sales were categorized into high value sales, medium value sales and low value sales. On the other hand, the specials were categorized into high value specials and low value specials. 3B. Relative frequency/joint probabilities RELATIVE FREQUENCY/JOINT PROBABILITY SALES High value salesMedium value salesLow value sales TOTA L SPECIAL S High value specials0.2190.2110.1840.614 Low value specials0.1430.1360.1080.386 TOTAL0.3610.3470.2911.000 Table 7 3C. Conditional probability table CONDITIONAL PROBABILITY TABLE SALES High value sales Medium value sales Low value salesTOTAL SPECIAL S High value specials0.3560.3450.2991.000 Low value specials0.3700.3510.2791.000 Table 8 The two variables are independent. This is because the joint probabilities are equal to their corresponding conditional probabilities. For example P(highvaluesales)=0.36 But theP(highvaluesales∨valuespecials)=0.36 The last row of table 7 can be compared with the values in table 8 to confirm the independence of the two variables. 10|P a g e
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
3D Probability tree Figure 6 RELATIVE FREQUENCY/JOINT PROBABILITY SALES High value salesMedium value salesLow value sales TOTA L SPECIAL S High value specials0.2190.2110.1840.614 Low value specials0.1430.1360.1080.386 TOTAL0.3610.3470.2911.000 Table 9 From the table above, it can be said that there is no difference in joint probabilities from the probability tree and relative frequencies as given above. This can be confirmed from the following. P(mediumvaluesales)=0.35 11|P a g e
But theP(mediumvaluesales∨valuespecials)=0.35 The last row of table 7 can be compared with the values in table 8 to confirm the independence of the two variables. 12|P a g e