TABLE OF CONTENTS INTRODUCTION...........................................................................................................................1 PART 1............................................................................................................................................1 By using data set of superstores analyse profit and sales over years and analyse it by using Excel for pre- processing of data, also analyse and visualize the data........................................1 Demonstration of ways in which data can be practically analysed using Excel functions such as Lookup, Pivot table, graphs and charts...................................................................................6 PART 2............................................................................................................................................9 By using audidealership.csv file show conjunction with Weka with the example of clustering.9 Explanation of commonly used data mining methods that can be used in business. Explain them with real time example......................................................................................................12 Advantages and disadvantages of Weka....................................................................................13 CONCLUSION..............................................................................................................................13 REREFENCES..............................................................................................................................14
INTRODUCTION Data handing can be defined as a process through which data can be handled, stores, disposed off in a secure and safe manner. There are various ways through which data can be handled. Data handling lies within Business Intelligence as it helps an organization to manage, store and analyse their data in order to reach to a conclusion. Business Intelligence can be defined as a technology driven process which is used for data analysis and presentation of information or data which can further be used by organizations to take important business decisions(Tirpude, Karandikar and Welekar, 2020). BI helps in providing historical and current data to be analysed so that predictive views for the business can be generated. Business intelligence with data mining helps organizations to take effective and appropriate decisions and develop future business strategies which not only helps them to achieve their defined goals and objectives but also helps them to enhance their overall revenue and profitability. However, it can also be said that data mining is a part of Business intelligence(Bayer and et. al., 2017). Both of them are fruitful for business in different ways like, data mining helps in analysing patterns within the data. Data mining helps in resolving any kind of issue where as BI helps in decision making. The only difference between BI and data mining is that data mining is used for small data set whereas BI is used for huge data sets. This assignment will lay emphasis on analysis of data of Audi leadership and Superstore data in order to analyse overall sales and profit of the organization. This assignment will also help in explaining different kinds of data mining methods that can be used by organizations and advantages and disadvantages of Weka over Excel. PART 1 By using data set of superstores analyse profit and sales over years and analyse it by using Excel for pre- processing of data, also analyse and visualize the data There are various kinds of tools and technologies that can be used for analysis of a data set and can be used for extracting and analysing useful and important data in an appropriate manner(Sasikala, Kalaiselvi and Scholar, 2016). These tools can be used by organizations for analysing their yearly or quarterly sales or profit so that they can take important decision or develop strategic plans in order to increase sales and profit. Excel is one of the most commonly used data mining tools which can be used by organizations for analysis of their historical and current data so that they can take appropriate decisions. It is one of the most commonly preferred tools because it has various kinds of inbuilt functions that helps in doing calculations, calculate 1
sales, profit, extract and analyse important and required data, search doe a data or relevant value. Most importantly it is used for organizing data or information in a useful manner so that data can be understood and analysed appropriately(Hamed and et. al., 2017). Not only this organization of data also helps in using formulas wherever and whenever required. Excel provides an option to the user to implement required formula and generate graph or charts for better understanding. One of the functions in Excel that can be used by organizations for analysis of their overall or historical sales and profit is Pivot table and graph. Here, data of Superstore data will be analysed in order to analyse decline in sales and profit over years. ï‚·Sum of sales and sum of profit from 2009 to 2012 Row LabelsSum of Sales Sum of Profit Furniture5178590.542117433.03 20091472671.72461804.53 20101252518.4169397.4 20111268656.07850422.45 20121184744.324-4191.35 Office Supplies3752762.1518021.43 20091035399.64177646.27 2010910359.95118143.24 2011796383.7986960.01 20121010618.72135271.91 Technology5984248.182886313.52 20091701825.482194645.22 20101397208.679237376.69 20111364905.113242928.04 20121520308.909211363.57 Grand Total14915600.821521767.98 Figure1Sum of sales and sum of profit from 2009 to 2012 2
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Figure2Sum of sales Figure3Sum of profit Data Interpretation: All the three graphs above clearly explain decline in sales and profit over years. First graphs explain both sum of sales over years and sum of profit over years in one graph so that comparison between both of them can be done. Other two graphs explain sum of sales and profit individually. From all the threw graphs it has been interpreted that highest sales was seen in 2009 in Technology department and highest profit was seen in year 2011 in Technology department. ï‚·Average sales and profit of Technology over years Row Labels Average of Sales Average of Profit Technology2897.941008429.2075157 20093145.703293359.7878373 20102631.278114447.037081 20112916.463917519.0770085 20122895.826493402.5972762 Grand Total2897.941008429.2075157 3
Figure4Average sales and profit of Technology Data Interpretation: From the above graph it can be clearly interpreted that average sales of Technology was lowest in 2010 and average profit of technology was lowest in 2009. ï‚·Average sales and profit of Furniture over years Row Labels Average of Sales Average of Profit Furniture3003.8228268.11660673 20093287.21367137.9565402 20102846.63276421.35772727 20113035.062388120.6278708 20122834.316565-10.02715311 Grand Total3003.8228268.11660673 4
Figure5Average sales and profit of Furniture Data Interpretation: From the above graph it can be interpreted that Average sales of furniture was highest in 2009 and lowest in 2010 whereas it cannot be determined properly for Average profit but from the table it can be interpreted that average profit of furniture was highest in 2009 and lowest in 2012. ï‚·Average sales and profit of office Supplies over years Row Labels Average of Sales Average of Profit Office Supplies814.0481779112.3690738 2009885.7139778151.9643028 2010778.0854274100.9771282 2011716.17247378.20144784 2012871.9747368116.7143313 Grand Total814.0481779112.3690738 Figure6Average sales and profit of office Supplies 5
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Data Interpretation: From the above graph it can be interpreted that average sales of Office supplies were highest in 2012 and lowest in 2012 whereas average profit of Office supplies was highest in 2009 and lowest in 2011. Demonstration of ways in which data can be practically analysed using Excel functions such as Lookup, Pivot table, graphs and charts There are various kinds of in-built function present in Excel. Out of all of those lookup and Pivot table are most commonly used in-built functions of excel. LookupFunction:ThisfunctionhasbeencategorisedinExcelunderLookupand Referencefunctions. This function helps in performing rough match lookup either in one row or in one column(Massaro and et. al., 2019). It is basically used for selecting values from a defined range. It is mostly used in financial calculations in which comparison of two rows or column is required to be done, for this, lookup function can be used. It is basically designed to handle simplest case of horizontal and vertical look up as well. This formula is basically applied to the data which is assembled in a particular order. It is basically used for a large amount of data and in order to find out value of an appropriate data from a huge set of data. The basic syntax used for lookup function for searching a data is =LOOKUP (value, loopup_range, [result_range]) For example: If user wants to find average sales valie in 2011 from the below data then lookup finction can be used Row Labels Average of Sales Average of Profit Furniture3003.8228268.11660673 20093287.21367137.9565402 20102846.63276421.35772727 20113035.062388120.6278708 20122834.316565-10.02715311 =LOOKUP(A5,A3:A6,B3:B6) 3035.062388 6
Pivot Table: Out of all the in-built functions pivot table and pivot graph are one of the most powerful and most commonly used in built functions of Excel(Trivedi and et. al., 2017). It is used to extract required and important data in order to be analysed in a proper manner. ï‚·Sum of profit and sum of sales region wise Row Labels Sum of ProfitSum of Sales Atlantic238960.662014248.204 North Carolina2841.11116376.4835 Northwest Territories8307.0583817.746 Ontario439214.573780242.063 Prarie321160.122837304.602 Quebec140426.651510195.08 West297008.613597549.276 Yukon73849.21975867.371 Grand Total1521767.9814915600.82 Figure7Sum of profit and sum of sales region wise ï‚·Average sales and Average profit region wise Row Labels Average of Sales Average of Profit Atlantic1865.044633221.2598704 North Carolina1473.12004435.96341772 Northwest Territories1420.639763140.7974576 Ontario1749.302204203.2459833 Prarie1663.132826188.2532943 Quebec1933.668476179.8036492 West1806.905713149.1755952 7
Yukon1800.493304136.253155 Grand Total1775.878179181.1844243 Figure8Average sales and Average profit region wise ï‚·Average sales and Average profit of Ontario and Prarie region Row Labels Average of Sales Average of Profit Ontario1749.302204203.2459833 Georgina1961.88645209.2537984 Hanover1550.93624193.4693109 Ontario1778.693177168.9889851 Orangeville1580.510948222.3565569 Waterloo2140.386816275.659791 Prarie1663.132826188.2532943 Manitoba1731.209057172.0392938 Saskachewan1604.004183202.3362103 Grand Total1711.286957196.6316757 8
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Figure9Average sales and Average profit of Ontario and Prarie region PART 2 By using audidealership.csv file show conjunction with Weka with the example of clustering Figure10audidealership data relationship 9
Figure11Hierarchical clustering 10
Figure12Cluster data using the k means algorithm 11
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Explanation of commonly used data mining methods that can be used in business. Explain them with real time example Data mining can be defined as a process which is used by organizations for analysis of raw data for extraction of useful and important information. It is used for analysing patterns within the data and also for resolving any kind of issue faced by the organizations. it can also be used by organizations for gaining more knowledge about their customers, their buying pattern or behaviour so that they can develop effective and appropriate strategies in order to increase their overall sales, revenue and profit(Zhou and et. al., 2017). Some of the most commonly used data mining methods are as follows: Association: This data mining method is used by organizations in order to find correlation between data. It is mostly used by business organizations for identification of hidden patterns within the data so that relationship between selected data can be identified. It is mostly used for decision making and for strategic future planning(Saleem and et. al., 2019). For example: this data mining method can be used by retail sector organizations for identification of their customersbuyingbehaviour or pattern.Suchas: if a supermarketwantstocheckwhat combination of items are mostly sold by them, like, more than 70 percent of customers who buy milk also purchase bread and approximately 18 percent of customers are those who purchase both the items. Classification: This method is used by organizations so that they can distinguish between data or information on the basis of their features, behaviour and many more. This method can also be used by banks, retail sector organizations. With the help of this method organizations can categorise their data on the basis of some set criteria’s. For example, if a bank wants to categorize their loans on the basis of income of customers who took loan and on the basis of risk associated with each loan this method can be used. 12
Clustering Analysis: this method is similar to classification method but with little bit of difference(Kavakiotis and et. al., 2017). This method is used segregate the data in small segments on the basis of similarities and dissimilarities of the data or information. For example: if an automotive organization wants to segregate their customers who have paid for the vehicle in single time and those who have taken loan to purchase vehicle. Prediction: it is a kind of data mining method which is used by organizations so that they can predict future by analysing current and past data. It is one of the most important and valuable method of data mining that can be used by organizations for predicting their future sales, revenue and for many other factors(Mendes and Vilela, 2017). For this many other data mining methods are clubbed together so that future can be predicted such as: trend analysis, pattern matching, relation analysis and classification. It is mostly used by retail sector organizations for prediction of next year sales and profit so that on the basis of this data they can plan their strategies accordingly. Advantages and disadvantages of Weka Weka isthe toll thatmainly used for pre-processing, classification,regression and association rules. It is data mining software that is very helps as this uses collection of machine learning of algorithms. It is the algorithms that can be applied directly to the data and data and java code. However, this is well suited for developing new machine of learning scheme. Therefore, advantages and disadvantage of Weka are termed out in following context as are-: Advantages of Weka-: Weka data mining supports an enterprise that attains its fullest perspective. This is termed out as approach that helps to evaluate how business becoming impacted with particular qualities, supportscompanyentrepreneurand leadstoimprovetheirearnings. In additiontothis, individual have already been utilising weka data mining for the years in different formats. Its supports firm entrepreneurs advance the earnings and steer clear generating entity mistakes down the line. This is one of the attractive computer software that supports an enterprise to evaluate and analyse the whole information in more effective mode. In addition to this, it can be stated that Weka mainly contains tools for data pre-processing, classification, regression, clustering, association rules, and visualisation and this very helpful for business to collect crucial elements. Disadvantages of Weka 13
Despite of all the above mentioned advantage, there are some of the disadvantage such as excessive work intensity that needs investment in high performance team and staff training. The of the disadvantage of Weka is that this experience problem with processing if the amount of data becomes too much. This kind of the situation occurred because data mining tools tries to load all of it into the memory. In order to avoid it, Weka offers the simple command line that makes the things easier to handle large amount of data. There is also difficulty in collecting the data. CONCLUSION Hereby, this can be summarized that data handling and business intelligence are defined as a research area that links to intersection of the computer science, mathematics, artificial intelligence and statistics. However, business intelligence and analytics are much more than the technical advancement as this mainly used to collect and analyse data. This assignment has covered the emphasis on analysis of data of Audi leadership. Furthermore, report has explained different kinds of data mining methods that can be used by organizations. Lastly, advantages and disadvantages of Weka over Excel has been defined. 14
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
REREFENCES Books and Journals Bayer, H., and et. al., 2017. Big data mining and business intelligence trends.Journal of Asian Business Strategy.7(1). p.23. Hamed, M., and et. al., 2017. Using Data Mining and Business Intelligence to Develop Decision Support Systems in Arabic Higher Education Institutions. InModernizing Academic Teaching and Research in Business and Economics(pp. 71-84). Springer, Cham. Kavakiotis, I., and et. al., 2017. Machine learning and data mining methods in diabetes research.Computational and structural biotechnology journal.15. pp.104-116. Massaro, A., and et. al., 2019. A business intelligence platform Implemented in a big data system embedding data mining: a case of study.International Journal of Data Mining & Knowledge Management Process (IJDKP).9(1). pp.1-20. Mendes, R. and Vilela, J.P., 2017. Privacy-preserving data mining: methods, metrics, and applications.IEEE Access.5. pp.10562-10582. Saleem, H., and et. al., 2019. Novel Intelligent Electronic Booking Framework for E-Business with Distributed Computing and Data Mining.IJCSNS.19(4). p.270. Sasikala, D., Kalaiselvi, S. and Scholar, M.P., 2016. Data Mining for Business Intelligence in CRMSystem.InternationalJournalofMultidisciplinaryResearchand Development.3(3). pp.198-200. Tirpude, S., Karandikar, A. and Welekar, R., 2020. An Approach for Environment Vitiation Analysis and Prediction Using Data Mining and Business Intelligence. InSmart Trends in Computing and Communications(pp. 327-338). Springer, Singapore. Trivedi, S.K., and et. al., 2017.Handbook of research on advanced data mining techniques and applications for business intelligence. IGI Global. Zhou, Q., and et. al., 2017, April. An Advanced Inventory Data Mining System for Business Intelligence. In2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService)(pp. 210-217). IEEE. 15