Comprehensive Report on Data Handling, Excel, and Smile Clinic SPSS

Verified

Added on  2023/06/18

|17
|3168
|300
Report
AI Summary
This report delves into data handling techniques, emphasizing the importance of data pre-processing using Excel for sales and profit analysis. It explores the smile_clinic.csv dataset using SPSS, providing insights into gender and age analysis, customer preferences, and cluster analysis. The report also discusses common data mining methods applicable to businesses and evaluates the advantages and disadvantages of using SPSS for data analysis. The findings highlight the significance of data quality, the role of technology in increasing sales and customer retention, and the strategic use of data-driven decision-making in business operations. Desklib offers this report as a valuable resource for students studying data analysis and provides access to a wide range of solved assignments and study materials.
Document Page
Data Handling 2
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
TABLE OF CONTENTS
INTRODUCTION...........................................................................................................................3
MAIN BODY..................................................................................................................................3
PART 1............................................................................................................................................3
Sales/profits over the years and the use of Excel for pre-processing the data.............................3
PART 2............................................................................................................................................7
The smile_clinic.csv provided in conjunction with SPSS...........................................................7
The most common data mining methods...................................................................................14
The advantages/disadvantages of SPSS.....................................................................................14
CONCLUSION..............................................................................................................................15
REFERENCES................................................................................................................................1
Document Page
INTRODUCTION
Data handling is the process for ensuring that research about the data that could stored,
disposed and achieve off while for keeping safe and secure for future aspects. While in such kind
of things this might be during and conclusion for the research and includes many of the
developments about the procedures and policies while for just managing many data and handled
the electronically as well as non-electronically means. This report will explain about the sales
and profits over the years and the use of excel in pre-processing data along with visualising data.
Moreover, make the proper use of charts and graph plus about the smile_clinic.csv provided in
conjunction with SPSS. Furthermore, about the data mining which could be used for the business
and discuss the advantage and disadvantage of the SPSS.
MAIN BODY
PART 1
Sales/profits over the years and the use of Excel for pre-processing the data
The unrefined data contains missing thusly, loud and botches, so this can't among the use
that setup in AI models. The better data pre-handling has extended accuracy of the model. So the
data pre-handling is by and large critical in AI and significant learning models. The all essential
groups are imported for data with high worth and make about their all the more simple way for
computation. The used for group action performed on the datasets (Ariannezhad and et.al.,
2021). This could likewise significant for thinking about the Excel in which they can without
much of a stretch ensure for having some appropriate information assortment and hotspots for
consummation about the given their legitimate information assortment.
Document Page
from the above report been come to think about how the sources have been taken with having the
high deals and benefits. While having such things this could likewise significant for thinking
about how the association is functioning for their client and making their superstore for working
with all the more high. This could likewise fundamental for taking the deals and benefits
information for the association while in which they can undoubtedly come to think about they
can work and deal with their work with having many terms and condition (Karlstetter and et.al.,
2021). This may be significant for taking their appropriate deals about their items and
administrations for which they can undoubtedly ponder numerous procedure so that could assist
them for expanding their business volume with 76% in year and ensure for having their high
assessment about different things. The dataset has expanding all the more high volumes in string
plan. The simulated intelligence model just allowed arithmetical characteristics. The string
changed over into the numerical characteristics regard techniques. While such sort of things this
could likewise significant for thinking about their eachyears deals volumes in which they can
work and ensure about their business volume to be development. 34.56 deals have been taken
their high dynamic places and keeping many significant viewpoints about their sources in which
those terms and condition can work. In such sort of things this could likewise being so useful for
them for about the benefits can expanded while by saving worker for working with numerous
exercises in association. 45% to 89% superstore can undoubtedly their items and keeping all the
more high innovation measure for which having many terms without much of a stretch come to
think about their market technique.
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
From the graph, this could seems that organization need to keep their profit with high
values and easily make sure about their more working things. This might be also important for
taking the employee for working with many things and make sure about their those effective
places for which they can work. This might be also essential for keeping the working process and
procures in which they can easily have their various activities. This might be playing their role
for completion goals and make more successfully towards those places which could having more
termination and sanction about their profits. In keeping more attraction towards their profits
superstore needs to work with more technology and make sure for their sustainability products
and services (List, Sims and Forkel, 2021). Along with this also come to know about their those
things which could keeping them for working with many terms and policies and also keeping
them with more effective terms. Technology must be important for knowing those things and
that could also important for the organization about their customer. As increasing the sales this
could also increased with having many benefits and keeping their customer with long term. As
74% to 86% superstore need to increase their products and services in which the customer can
easily gets attraction and also come to know about their high impact over through which they
can work. This might be also important among the market position for taking many terms and
make sure for their valuable things and that could also be effective. 34.01 this could be profits
which could be increasing and keeping their customer with working more process and make sure
about their more policies . As the organization can also work with employee while by having
many strategy for employee and also working with many new things in organization.
Document Page
This could be appears to be that high successful places and make effective framework for
their functioning framework (Felkel and et.al., 2021). While for keeping 45%of deals volume in
2009-2010 and for this association need to keep their development about the 78% for having the
high sales and for that they can without much of a stretch come to think about the business
volume that could be development. In such sort of exercises for running superstore the
association are needed to high benefits in given the year. While according to having the high
interaction for the client where they can without much of a stretch come to think about their
those successful spots for which client can keep their legitimate fascination. In such sort of
exercises the association are needed to work with having more benefits in informational index.
Organizing reports, particularly execution the executives plan, will uncover when to explore all
the more high data. Overall, data and interpretation happens previous conveying a report record
or part fundamental noticing or judgment work out (Gronau, 2021). Regardless, experts
recommend that orchestrate these cycles component of typical assessment attempts will
additionally foster task learning and help with versatile administration, therefore further
introductory endeavor execution all. It very well may be likewise proposed that interface with
key accomplices unraveling results to guarantee data use and decisions that are reasonable close
by setting. It is reliably significant endorse or guineas pigs and ideas make since assessment and
comprehension. This endorsement ought to be probable with many accomplices, checking the
endeavor individuals themselves.
Document Page
The whole data isolated into part of identical size and subsequently various methods are
performed work. One can replace all in arrangement segment mean or breaking point regards can
be used to complete the task. Since data evacuation plan that used arrangement with gigantic
proportion of all together. While working with colossal volume of data, hypothesis test turned
out to be all the more perseveringly in such cases. It intends to extend the limit expertise and
diminish data storing and study costs. Since data is routinely taken from different sources which
are conventionally not entirely trustworthy and that different associations, the larger part our
time consumed in overseeing data worth issues when managing an AI issue. It is basically crazy
to expect that the data will be great (Singh and et.al, 2021). There may be issues in light of
human slip-up, restrictions of gauge devices, surrenders data assortment measure. A genuine data
all things considered contains uproars and maybe unusable plan which can't be effectively used
for different sorts of models. Data pre-handling is required endeavors for association the data and
making it sensible for which moreover grows the precision and efficiency. Dataset may
associations for different purposes, for instance, accepting need to make for business reason, by
having some of the proper dataset will be particular with dataset required for liver patient. in
such kind of things this might be also shown that employment rate should be also proper for
which the organization and their countries can work.
PART 2
The smile_clinic.csv provided in conjunction with SPSS
Gender and age analysis
Gender
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Frequency Percent Valid
Percent
Cumulative
Percent
Valid
Male 50 50.0 50.0 50.0
Female 50 50.0 50.0 100.0
Total 100 100.0 100.0
Among the 100 participants 50% of people are male and remaining 50 are female. The
mean value of age group is 1.5 which shows that on the basis of gender population can be
divided equally into groups.
Statistics
Gender Age
N Valid 100 100
Missing 0 0
Mean 1.5000 20.3500
Median 1.5000 19.0000
Mode 1.00a 22.00
a. Multiple modes exist. The smallest
value is shown
The average age of customers is of 20 years. The median age group of chosen customers
is 19 years and their mode value is 22 years.
Gender
Frequency Percent Valid Percent Cumulative
Percent
Valid Male 50 50.0 50.0 50.0
Document Page
Female 50 50.0 50.0 100.0
Total 100 100.0 100.0
Age
Frequency Percent Valid Percent Cumulative
Percent
Valid
13.00 5 5.0 5.0 5.0
15.00 5 5.0 5.0 10.0
17.00 8 8.0 8.0 18.0
18.00 13 13.0 13.0 31.0
19.00 20 20.0 20.0 51.0
20.00 5 5.0 5.0 56.0
22.00 21 21.0 21.0 77.0
23.00 1 1.0 1.0 78.0
25.00 12 12.0 12.0 90.0
26.00 10 10.0 10.0 100.0
Total 100 100.0 100.0
The average age of the customers is 22 years old and it comprises nearly 21% of the total
number of customers. The customers of age group 18 years and 25 years also make good
proportion of the population.
Document Page
From the graph it can be clearly seen that majority of the customers are in age group 19
and 22 years and minimum number of customers lies in the age group of 23 years.
Number of customers eating rice
Statistics
Customerseatingriceornot
N Valid 100
Missing 0
Mean .6000
Median 1.0000
Mode 1.00
Customerseatingriceornot
Frequency Percent Valid Percent Cumulative
Percent
Valid
No 40 40.0 40.0 40.0
Yes 60 60.0 60.0 100.0
Total 100 100.0 100.0
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Out of 100 participants 60% customers used to prefer rice and they have it in their daily
meal. Contrary to this 40 people does not include rich in their meal. It is also indicated by the
mean value of 0.60
For k means clustering at first classify option is selected in analysis menu. After this a dialogue
box is opened in which all variables are selected as classify and iterate.
Document Page
On clicking ok output window is opened and results can be interpreted.
Document Page
Quick Cluster
Initial Cluster Centers
Cluster
1 2
Gender 1.00 2.00
Age 13.00 26.00
Customerseatingriceornot 1.00 .00
Iteration Historya
Iteration Change in Cluster Centers
1 2
1 4.489 2.731
2 .228 .384
3 .000 .000
a. Convergence achieved due to no or
small change in cluster centers. The
maximum absolute coordinate change
for any center is .000. The current
iteration is 3. The minimum distance
between initial centers is 13.077.
Final Cluster Centers
Cluster
1 2
Gender 1.46 1.55
Age 17.68 23.75
Customerseatingriceornot .64 .55
Number of Cases in each Cluster
Cluster 1 56.000
2 44.000
Valid 100.000
Missing .000
tabler-icon-diamond-filled.svg

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
The most common data mining methods
Data mining has opened universe of chance for business. This field computational limit
looks incredible numerous private pieces of in grouping and used by association to distinguish
and anticipate purchaser way. Distinctive technique like backslide examination and bundling,
portrayal, and inconsistency assessment are pragmatic to data to know supportive outcomes.
These systems use encoding and backend estimation that take apart in rank and show plans. Here
are the case of the information mining like showcasing, retailing, banking, medication, etc. This
could help the business for thinking about their some significant ideas and ensure for their high
effects over through which business can work (Lastari and Kusriyah, 2021). Information digging
is generally utilized for the association in building more about the showcasing technique, web
based business setting through selling about their items sites and a lot more different things.
Information mining likewise make the business for having the high development and finishing
over through which the course of assessment in keeping the high assessment about those things
which could be more significant. As results for having the high effect and ensure for having their
those ideas wherein hazard can be diminished. This assistance the store for the retailing industry
wherein they can work with having the appropriate correspondence about their more viable
sources (Nguyen and Duong, 2021). Also, offering more items and administrations or make their
more interest high and limits and making their high benefits for which the deals can expanded.
The advantages/disadvantages of SPSS
SPSS that used for quantitative data assessment. It very well may be not having request
line feature yet rather that totally point and click and modestly looks like Microsoft Excel. In
spite of truth that it looks extraordinary arrangement like Excel, it can settlement with greater
enlightening assortments speedier and missing scarcely taking finger. One of the essential fights
about SPSS prohibitive costly to use, with solitary groups. Most top investigation workplaces use
SPSS to cut separated audit data thus they can exploit their evaluation projects (Rioux and Little,
2021). A non-specialized individual that could likewise help them for getting familiar with the
online framework and ensure for having their high assessment about those sources. Benefit can
be : speedy and simple attempt to learn, incredible UI, can without much of a stretch handle the
huge information sum. Impediment can be the most costly, restricted practically, basically the
same as dominate. This may be likewise appears to be that attempt to learn numerous new things
however this could be more costly in utilizing new things. Alongside this additionally ensure for
Document Page
having their numerous things and that may be likewise significant for taking those successful
spots and that may be additionally helpful in business and saving their factor for working with
many terms (Alji and Chougdali, 2021). Data appears to be that could likewise make their high
guidelines for which having those exercises which could make their learning more simpler. In
such sort of exercises this could likewise essential for creating numerous things.
CONCLUSION
From the above report it had been seems that, data handling might be important for the
organization while through which this come to know about those research about various data. In
such kind of things this might be important for knowing about those concepts which could be
important for the employee who are working at their sector. This could keep them for having
many sufficient concepts and make sure about those strategy which should the organization and
keep for their customer for having long term relationship. Mean and mode that could help the
smile_clinic.csv that could know about how they have to usually deal with any situation. This
could also essential for them in knowing about many effective data handling. Various methods of
data mining which could be used in marketing, retailing and various other business purpose.
Advantage and disadvantage of the SPSS that could keep more focus towards the business
context.
Document Page
REFERENCES
Books and journals
Ariannezhad, A. and et.al., 2021. Handling imbalanced data for real-time crash prediction:
application of boosting and sampling techniques. Journal of Transportation Engineering,
Part A: Systems. 147(3). p.04020165.
List, J.M., Sims, N.A. and Forkel, R., 2021. Toward a sustainable handling of interlinear-glossed
text in language documentation. ACM Transactions on Asian and Low-Resource
Language Information Processing. 20(2). pp.1-15.
Singh, G. and et.al, 2021. Handling Non-Local Executions to Improve MapReduce Performance
Using Ant Colony Optimization. IEEE Access. 9. pp.96176-96188.
Lastari, S. and Kusriyah, S., 2021. Police Discretion Policy In Handling Middle/Minor Crimes
(Tipiring) Based On Justice Value. Law Development Journal. 3(3).
Rioux, C. and Little, T.D., 2021. Missing data treatments in intervention studies: What was, what
is, and what should be. International Journal of Behavioral Development. 45(1). pp.51-
58.
Nguyen, N.N. and Duong, A.T., 2021. Comparison of Two Main Approaches for Handling
Imbalanced Data in Churn Prediction Problem. J. Adv. Inf. Technol. Vol, 12, pp.1-7.
Felkel, R. and et.al., 2021. From laboratory to real life: Fraport’s approach to applying artificial
intelligence in airside operations and ground handling. Journal of Airport
Management. 15(3). pp.266-279.
Karlstetter, R. and et.al., 2021, May. Living on the Edge: Efficient Handling of Large Scale
Sensor Data. In 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and
Internet Computing (CCGrid) (pp. 1-10). IEEE.
Alji, M. and Chougdali, K., 2021, June. File Slack Handling Tool. In 2021 International
Conference on Cyber Situational Awareness, Data Analytics and Assessment
(CyberSA) (pp. 1-3). IEEE.
Gronau, N., 2021, July. Modeling the Handling of Knowledge for Industry 4.0. In International
Symposium on Business Modeling and Software Design (pp. 207-223). Springer, Cham.
1
tabler-icon-diamond-filled.svg

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
2
chevron_up_icon
1 out of 17
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]