Data Handling and Business Intelligence: Sales and Profit Analysis

Verified

Added on  2023/06/18

|16
|3368
|303
Report
AI Summary
This report delves into data handling and business intelligence, beginning with an analysis of sales and profit decline in a superstore dataset using Excel for pre-processing. It evaluates Excel's utility in data cleaning, filtering, and visualization, while also acknowledging its limitations in advanced pricing rules and security. The report outlines steps for using Excel to identify sales trends and relationships with profit, highlighting a positive correlation between the two. The second part explores descriptive statistics and K-means clustering using SPSS, interpreting the results to understand homogenous groups based on selected characteristics. Finally, it briefly discusses common data mining methods and compares the pros and cons of using SPSS over Excel for data analysis, emphasizing the importance of data pre-processing and tool selection in business intelligence.
tabler-icon-diamond-filled.svg

Contribute Materials

Your contribution can guide someone’s learning journey. Share your documents today.
Document Page
Data handling and Business Intelligence -2
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
TABLE OF CONTENTS
PART 1............................................................................................................................................3
Determine the decline in the sales/Profit.....................................................................................3
PART 2............................................................................................................................................7
2.1 Presenting the screenshot of clustering i.e. K-means............................................................8
2.2 Common data mining methods............................................................................................12
2.3 Pros and cons of using SPSS over Excel.............................................................................13
REFERENCES..............................................................................................................................15
Document Page
PART 1
Determine the decline in the sales/Profit
Evaluating the use of Excel for pre-processing the data
Data pre-processing is the data mining technique that is used to involves transforming
raw data into an understandable format. Also, it is all about cleaning, instance selection,
normalization, transformation, feature extraction and selection etc. However, in the real-world,
data is often incomplete, inconsistent and also likely to contain many errors. There are many
tools and techniques used for pre-processing the data which include Weka software, Excel etc.
Nariswari and Nugraha (2020) stated that for non-programmers, Microsoft Excel is the great tool
used for pre-processing and handling structure data. It is so because it has many functions and
techniques which makes it easier to clean structure data. Thus, it can be reflected that with the
help range of functions within Excel, can determine the impact or interrelationship between sales
and profit.
In addition to this, Excel is one of the top tools for pre-processing of data due to the use
of variety of functions which includes Pivot table, If-function, Count, Trim etc. In the context of
present dataset, it has been examined that it is a huge data, and for pre-processing, Excel is used
that helps to filter the data in order to properly respond the questions. This in turn assist to
answer the questions and with the help of effective visualization, presentation of the report will
be improved and helps to interpret the data in more effective manner (HINDASAH and
NURYAKIN, 2020). On the critical note, it has been examined that it is difficult to manage
advanced pricing rules and due to lack of control and security, Excel cannot be used by the data
analysis because it might leak the important data which affected the results as well. Besides this,
it is also clearly reflected that in pricing term, the excel cannot be managed the terms easily
because it is difficult to manage advanced pricing rules, this might affect the results as well.
Overall, it has been clearly reflected that with the help of excel, superstore market can be
easily determining the fluctuation within sales due to years. Also, through this, it can be stated
that it helps to analyse the performance of a company over years and which steps need to be
chosen in order to raise the sales (Kohli, Godwin and Urolagin, 2021). So that, it can be stated
that Excel is a strong and effective pre-processing tool that helps in improving the decision and
Document Page
also derive best results accordingly. In the context of present study or chosen dataset, it can be
explained that by using different options within Excel, the superstore can determine the
importance of sales and the trend which can affected the overall performance of the company.
As per the provided data set of the superstore, it has been examined that charts and
graphs has been used as an excel function. Further, the tool is best suitable because it assists to
determine the sales figure over the years mentioned in order to determine whether the sales is
increases by year or not. This in turn also helps in determine the relationship of sales with profit.
However, many secondary research indicates that there is a positive relationship between the
variables. The steps are enumerated below:
In the first step, there is a need to filter all data, which in turn assist to extract the
essential data.
By using filter option, select a range in which sales can be identified. Under this, data
related to sales can be identified easily from the huge data.
By selecting the range, for a specific year, apply the formula of sum in order to determine
total sales for four consecutive years i.e. 2009, 2010, 2011 and 2012.
The same steps apply in the profit side, where total profit of the respective years can be
identified. This in turn helps to examine total sum of the profit and sales in order to
analyze the relationship with both variables.
Further, to drawn the graphs and charts, select the data range and click on insert in order
to present the graphs or charts that helps to make a report more presentable and identify
the relationship directly.
Also, with the help of generating graphs from the selected data, it has been identified that
both have shared a positive relationship with each other and this in turn reflected that
company have to focused upon the products so that sales can be increases that contribute
towards a profit.
Overall, it can be stated that by following the above mentioned steps, it will be easy for the
scholar to determine the answer of the questions and understand any relationship between the
variables as well.
Evaluation of the data-set
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
1 2 3 4
Series1
Series2
2009 2010 2011 2012
1625226.23
476755.974
1320410
1603443
Sales
Series1 Series2
1 2 3 4
0
20000
40000
60000
80000
100000
120000
140000
160000
180000
200000
2009 2010 2011 2012
159438
119645
175687 187456
Profit
Series1 Series2
Interpretation: As per the above, it has been identified that in 2010, the sales of the Superstore
has decline and that is why, the profit of the firm has a direct affected in opposite manner. Also,
there was a gradual increase in sales rate within 2011 and 2012. That is why, it has a direct
impact over the business performance. Further, there is a direct relationship identified within a
sales and profit. So, if there is a fluctuation within a sale, then it has a direct relationship within
the profit.
Document Page
In addition to this, from the dataset of Superstore, it has been identified that in 2009, the
profit of the firm is 159438, whereas the profit might decline in 2010 due to either high expenses
and low inventory. Moreover, the profit of the firm also increases by 175687 in 2011 and in
preceding year, the sales increases by 187456. However, there is an interrelationship with sales,
such that in 2009, the sales of the superstore were 1625226 where in next year, there was a
decline in sale i.e. 476755.974. Apart from this, in next year, 2011, the sales of the company
increases by 1320410 and then by 1603443. Thus, it reflected that there is a need to if the sales
are high, it shares a positive relation with the profit. So, focusing upon the company’s inventory
and quality of products assist to increase the sales of a company.
Šaković Jovanović and et.al., (2020) explained that if the sales is exceed with variable
cost, each additional unit of a sales volume increases by the gross profit as well as net income.
Similarly, the graph also reflected that with the change in sales, the profit margin of the company
also affected. This in turn shows that if the form lower down the cost without impacting revenue
and maintain the sales of a supermarket, then the profit will go up. Therefore, can stated that
product sales inevitably lead to greater profits and also increase in monthly sales volume.
However, profit margin is considering the most important barometer of a company’s health and
it has highly influenced by company’s sales. On the other side, business owners basically strive
to have both profit and revenue, along with this, businesses face many difficulties. Thus, to
minimize the same, it is necessary for the company to ensure about the stock and make strategy
that helps to minimize the negative impact over the businesses.
In addition to this, Wan, Britto and Zhou (2020) also investigated that in some cases, a
company may have a significant revenue but also enjoy very little profit from its sales. The same
can be applied in the year of 2010 where, the business are in loss because there is a sudden
decline in the phase of sales that has a direct impact over the business performance profit.
Nariswari and Nugraha (2020) also supported in their stud that this is usually occur because a
company’s expenses are so high and it create difficulties in order to create strong profit.
However, on the other side, revenue can be slow such that even small expenses can decrease the
profit margin of a firm. That is why, company must monitor the situation frequently in order to
implement the changes so that it does not cause any negative impact over the business
performance. Apart from this, it has been also analyzed that the sales and profit relationship can
Document Page
be analyzed by owner’s fault because they expect too much and due to not fulfilling the same, it
has affected the sales or profit.
Overall, it can be stated that due to positive relationship between the variables i.e. sales
and profit, the businesses or superstores can attain the defined aim of a business. Further, an
entity can have sales or revenue without making a profit, but it cannot have a profit without any
revenue. That is why, this interrelationship reflected that earning profit is not the only motive of
the company, but sustaining the brand image over the business should also contribute to enhance
the sales of a business. Hence, the data set of a superstore reflected that by minimizing the price
will contribute to increase the profit and attract the customers in order to raise the business profit.
PART 2
Descriptive statistics
gender Age Rice
N
Valid 100 100 100
Missing 0 0 0
Mean 1.50 20.35 .60
Median 1.50 19.00 1.00
Mode 1a 22 1
a. Multiple modes exist. The smallest value is
shown
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Gender Age Rice
1.5
20.35
0.61.5
19
1
0
22
1
Descriptive statistic
Mean Median Mode
Interpretation: As per the above descriptive statistic, it has been identified that average number
of selected participants are fall under the category of 20.35 and most of them belong to 22 age
group. On the other side, around 0.60 is the mean of those people who eat rice. Further, 50% of
the male participants are eat rice and majority of them eat rice as compared to female.
gender
Frequency Percent
Valid
Male 50 50.0
Female 50 50.0
Total 100 100.0
Document Page
50; 50%50; 50%
Gender
Male
Female
Interpretation: As per the above graph, it has been identified that 50% of the selected
participants are male and 50% of them are females.
2.1 Presenting the screenshot of clustering i.e. K-means
Step 1: In the first step, first put entire data in input table in SPSS software, as mentioned
below:
Document Page
Step 2: Click on Analyze at file menu and then click on classify. Further click on K-means to
generate the findings
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Step 4: In this, put all the variables into another side, and turn on iterate and classify in order to
generate the results.
Document Page
Step 5: the results are as mentioned below:
Initial Cluster Centers
Cluster
1 2
gender 1 2
Age 13 26
Rice 1 0
Iteration Historya
Iteration Change in Cluster Centers
1 2
1 4.489 2.731
2 .228 .384
3 .000 .000
Document Page
a. Convergence achieved due to no
or small change in cluster centers.
The maximum absolute coordinate
change for any center is .000. The
current iteration is 3. The minimum
distance between initial centers is
13.077.
Final Cluster Centers
Cluster
1 2
gender 1 2
Age 18 24
Rice 1 1
Number of Cases in each
Cluster
Cluster 1 56.000
2 44.000
Valid 100.000
Missing .000
Interpretation: K-means is used in order to determine relatively homogenous groups based
upon the selected characteristic. Similarly, the results generated through the above clearly
reflected that there is a minor change in gender, however rice are consumed by both groups. This
in turn reflected that each cluster is determined on another one which means that there is an
interrelationship between the values of each cluster.
2.2 Common data mining methods
In this modern era, data mining techniques have been developed and used that include
association, classification, clustering and sequential pattern etc. These data mining methods are
as mentioned below:
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Classification analysis: It is one of the most common method where data can be
classified into different sets in order to attain an accurate analysis or prediction. This is mainly
used in banks where owners want to determine who should be offered a loan (Regin, Rajest and
Singh, 2021). This can be also used in the retail sector in order to classify different products
within each department.
Clustering analysis: It is another data mining method where data objects are grouped in a
clusters on the basis of similarities. With the help of this method, a degree of association is
maximal within each cluster can be identified. This is used in medical industry while trying to
test any vaccine over humans or animals who share the same characteristic.
Regression analysis: One of the most common and effective data mining tool which is
used in order to determine the association between the variables like sales and income. Under
this, independent and independent variable can be analyzed that examine the casual; relationship.
This is also used in all type of business and especially to the weather forecasting.
Association Rule Learning: It is also used in all the businesses because it helps in
forecasting customer behavior. This technique helps in determine the relationship behavior
between different variable and also examine the hidden pattern within a data. So, it can be stated
that this technique is highly preferred in sales transactions (Hou and et.al., 2020). For example,
in e-commerce business, this tool has been used in order to identify which product may be
frequently used by the customers that helps to examine the customer buying behavior.
2.3 Pros and cons of using SPSS over Excel
Statistical Package for the Social science is the tool that is used for complex statistical
data analysis and most of the top research agencies also used this tool in order to analyze the
survey data and mine text data (Huang, 2021). This is highly preferred over Excel because it
provide accurate results which can be relied and valid.
Advantages: It is a comprehensive statistical software and many complex statistical tests are
available within this software. Apart from this, the interpretation can be performed easily by
generating the results which can be understand by people. Whereas, results generated through
excel are not actual interpreted (Tong and Shen, 2021). Also, SPSS helps in displaying easily and
Document Page
quickly data tables which cannot be expanded. Therefore, it can be stated that with the help of
results through SPSS the researcher can interpret the results effectually. On the other side, excel
also allow multiple methods of importing and exporting the data and allow it to be integrated into
workflows, but SPSS clearly serves the process of statistics and formulation of data through
manipulation technique.
Disadvantages: the biggest disadvantage of using this method is such that it can be expensive to
purchase and that is why, most of the researcher do not use this method. Also, it is usually
needed training session in order to completely understand the method which might not be
possible for all of them (Khan and et.al., 2021). Therefore, the graph feature used in SPSS are
not as simple as Microsoft excel, so to present the data in effective manner, most of them used
Excel. Also, very large data cannot be analyzed through SPSS, but in case of Excel, data can be
analyzed easily.
Overall, it has been stated that SPSS software has its own benefits and limitation, but
excel allow the user to store information in a tabular format and then interact with variety of
numbers. Thus, with the changing trend, researcher uses SPSS software in order to analyze the
relationship and creates a new insight that helps in businesses.
Document Page
REFERENCES
Books and Journals
HINDASAH, L. and NURYAKIN, N., 2020. The relationship between organizational capability,
organizational learning and financial performance. The Journal of Asian Finance,
Economics, and Business. 7(8). pp.625-633.
Hou, L. and et.al., 2020. Pattern identification and risk prediction of domino effect based on data
mining methods for accidents occurred in the tank farm. Reliability Engineering & System
Safety. 193. p.106646.
Huang, H., 2021. Analysis on Teaching Effectiveness of Primary Mathematics teachers based on
SPSS. International Journal of Social Science and Education Research. 4(4). pp.256-263.
Khan, H. and et.al., 2021. Pros and cons of online course from medical student’s standpoint. The
Professional Medical Journal. 28(03). pp.387-391.
Kohli, S., Godwin, G. T. and Urolagin, S., 2021. Sales Prediction Using Linear and KNN
Regression. In Advances in Machine Learning and Computational Intelligence (pp. 321-
329). Springer, Singapore.
Nariswari, T. N. and Nugraha, N. M., 2020. Profit Growth: Impact of Net Profit Margin, Gross
Profit Margin and Total Assests Turnover. International Journal of Finance & Banking
Studies (2147-4486). 9(4). pp.87-96.
Regin, R., Rajest, S. S. and Singh, B., 2021. Spatial Data Mining Methods Databases and
Statistics Point of Views. Innovations in Information and Communication Technology
Series, pp.103-109.
Šaković Jovanović, J. and et.al., 2020. The relationship between E-commerce and firm
performance: The mediating role of Internet sales channels. Sustainability. 12(17). p.6993.
Tong, J. and Shen, J., 2021, March. Research on Customer Satisfaction under Two Restrictive
Discount Promotion Strategies Based on SPSS analysis software. In 2021 2nd International
Conference on E-Commerce and Internet Technology (ECIT) (pp. 404-409). IEEE.
Wan, X., Britto, R. and Zhou, Z., 2020. In search of the negative relationship between product
variety and inventory turnover. International Journal of Production Economics. 222.
p.107503.
chevron_up_icon
1 out of 16
circle_padding
hide_on_mobile
zoom_out_icon
logo.png

Your All-in-One AI-Powered Toolkit for Academic Success.

Available 24*7 on WhatsApp / Email

[object Object]