Data Analysis Techniques: Assignment

Verified

Added on  2021/02/20

|11
|1571
|57
AI Summary

Contribute Materials

Your contribution can guide someone’s learning journey. Share your documents today.
Document Page
DATA ANALYSIS
TECHNIQUES

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
Document Page
Table of Contents
INTRODUCTION...........................................................................................................................1
MAIN BODY...................................................................................................................................1
1. Representation of data in tabular form....................................................................................1
2. Dara representation in charts...................................................................................................1
3. Calculations of mean, median, mode, standard deviation and range......................................3
4. Calculating values of m, c and wind forecast of day 14 and 21..............................................5
CONCLUSION ...............................................................................................................................6
Document Page

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
INTRODUCTION
In statistical term, data analysis is defined as the theoretical and practical methods and
procedures for improving efficiency and profitability (Chen and Yang, 2015). Data visualization
is often used to represent the information so that the helpful patterns in the information can be
easily found that ease the process of decision-making. In this project to better understand the
concept of data analysis wind speed of Liverpool for 10 days have been selected.
In this report different subjects like numeric and chart information, mean, median
calculation, mode, range, standard deviation are calculated. In addition to this m, the utilization
of c and station is also calculated.
MAIN BODY
1. Representation of data in tabular form
In the respective table mention below is the wind speed for the month of may of
Liverpool for 10 consecutive date (Wind speed of Nottingham, 2019).
Date Wind (km/h) (12 Am)
22/05/19 17
23/05/19 16
24/05/19 21
25/05/19 18
26/05/19 37
27/05/19 27
28/05/19 30
29/05/19 21
30/05/19 26
31/05/19 23
2. Dara representation in charts
Bar Graph: Using bars of various heights or intervals, a bar chart or bar graph is being
used to graphically portray data. The collected data is plotted either vertically or horizontally,
enabling spectators to easily create different beliefs and make predictions.
1
Document Page
1
2
3
4
5
6
7
8
9
10
0 5 10 15 20 25 30 35 40
17
16
21
18
37
27
30
21
26
23
Column E
From the above bar graph, it has been determined that on 26/05/2019 the highest wind
flow was recorded which is around 37 km/h. On the other side the lowest wind flow was
recorded on 23/05/2019.
Column chart: This chart is mainly used to compare the value of different categories
with the help of various vertical bar. Comparisons enable business users to evaluate the
performance of every other category comparative to another (Figueres-Esteban, Hughes and Van
Gulijk, 2015). The reader may achieve a detail knowledge about the top and bottom
classifications by figuring them in the respective columns. These graphs are best to be used over
a period of moment for visualization information sets. It is comparatively more normal for the a
specific user to understand modifications that occur over a period of moment.
2
Document Page
1 2 3 4 5 6 7 8 9 10
0
5
10
15
20
25
30
35
40
17 16
21
18
37
27
30
21
26
23
Column E
3. Calculations of mean, median, mode, standard deviation and range
Date Wind (km/h)
22/05/19 17
23/05/19 16
24/05/19 21
25/05/19 18
26/05/19 37
27/05/19 27
28/05/19 30
29/05/19 21
30/05/19 26
31/05/19 23
X 236
Mean 23.6
Median 37
Mode 21
Range 21
Maximum range 37
3

Paraphrase This Document

Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Document Page
Minimum 16
Mean: The numerical mean relates to the average which used to obtain the data's
arithmetic mean. It is calculated by adding total of the observation information, afterwards
dividing the sum by the amount of points.
Formula of mean:
∑ X / N
= 236 / 10
= 23.6
Mode: The model value is defined as the frequently occurring value within a discrete
unorganized variable series. In case if there are only limited variable within a series than the
value of mode can be zero and on the other side if there are more number of repeated value can
be mode of respective series. From the collected data of wind speed the mode value is 21 km/h.
Median: It is the sample with a distinct random sample which is depended conditions of
distribution whether it is even or odd (Marks, 2015). If the proportion of items is odd, the mid-
term valuation is the median, on the other side If the number of observation with series is even,
the median is the sum of the two range.
Formula of median:
When data series is odd= ( N +1 ) / 2
When data series is even= (N / 2)
Thus the median value is 23 km/h.
Range: The range of a specific distribution that contain number of different random
variables is the variation among the minimum and maximum value.
Formula of range:
Max – Min
= 37-16
= 21
Standard deviation: It is the measurement of the distribution in its mean of a collection
of information. It estimates a distribution's complete variation, the greater the absorption or
variation, the higher the standard deviation and the higher the scale of the value of variation from
its mean (Gatobu, Arocha and Hoffman-Goetz, 2016). It is also defined as square root-mean
deviation because it is obtained form square root of the variation that is mainly extracted from
4
Document Page
arithmetic mean. Standard deviation is often used in financial terms-to assess the potential risks
associated in an investment tool. This offers investors with a numerical foundation for making
choices about their financial sector investment.
Dates Wind (km/h) x- mean (x-m)2
22/05/2019 17 -6.6 43.56
23/05/2019 16 -7.6 57.76
24/05/2019 21 -2.6 6.76
25/05/2019 18 -5.6 31.36
26/05/2019 37 13.4 179.56
27/05/2019 27 3.4 11.56
28/05/2019 30 6.4 40.96
29/05/2019 21 -2.6 6.76
30/05/2019 26 2.4 5.76
31/05/2019 23 -0.6 0.36
Total 384.4
Mean 23.6
Variance 38.44
STDEV 6.2
Formula of standard deviation: √ (variance)
Variance = {∑(x – mean) 2 / N}
= 384.4/10
Thus variance = 38.44
Standard deviation is √ 38.44
Standard deviation = 6.2
4. Calculating values of m, c and wind forecast of day 14 and 21.
Days Wind (km/h) X2 ∑xy
1 17 1 17
2 16 4 64
3 21 9 189
4 18 16 288
5 37 25 925
5
Document Page
6 27 36 972
7 30 49 1470
8 21 64 1344
9 26 81 2106
10 23 100 2300
∑x= 55 ∑y= 236 ∑X2=385 ∑xy=9675
This model helps in determining the value of m in y = mx + c by taking the following steps:-
1. Value of M: M= N∑xy- ∑x∑y / N∑ X2 - (∑x)2
= 10*9675 – (55*236)/ 10*385- (55)2
= 96750- 12980/ 3850- 3025
= 83770/ 825
=101.53 of 101
2. Value of c: ∑y- m ∑x/ N
=236- 101*55 /10
= 319.5 or (319)
3. With the help of calculated 'm' and 'c' values wind speed is forecasted
Forecast wind for 14 day Y= mx+c
Y= 101*14+(-319)= 45.625
= So the forecasted value of day 14 is 45.625 or 46 km/h.
Forecast wind for 21: Y= mx+c
Y= 101*21+ (-319)
= 30.03 , thus the wind speed for day 21 will be approx 30 km/h
CONCLUSION
From the above report, it has been concluded that to recognize and evaluate functional
information and pattern, data is being is obtained and classified that help in easy analysis
according to organisational needs. With the help of different statistical method the value of
mean, mode, median, range and standard deviation have been measured that ease in decision
making. By applying liner forecasting model the value of day 14 and 21 has been determined.
6

Secure Best Marks with AI Grader

Need help grading? Try our AI Grader for instant feedback on your assignments.
Document Page
REFERENCES
Books and Journals:
Chen, Y. and Yang, Z. J., 2015. Message formats, numeracy, risk perceptions of alcohol-
attributable cancer, and intentions for binge drinking among college students. Journal of
drug education. 45(1). pp.37-55.
Figueres-Esteban, M., Hughes, P. and Van Gulijk, C., 2015, September. The role of data
visualization in railway big data risk analysis. In Proceedings of the 25th European
Safety and Reliability Conference, ESREL 2015 (pp. 2877-2882). CRC Press/Balkema.
Gatobu, S. K., Arocha, J. F. and Hoffman-Goetz, L., 2016. Numeracy, health numeracy, and
older immigrants’ primary language: an observation-oriented exploration. Basic and
Applied Social Psychology. 38(4). pp.185-199.
Marks, G. N., 2015. School sector differences in student achievement in Australian primary and
secondary schools: A longitudinal analysis. Journal of School Choice. 9(2). pp.219-238.
Online
Wind speed of Nottingham. 2019. [Online]. Available through:
<https://www.timeanddate.com/weather/uk/nottingham/historic?month=4&year=2019>
7
1 out of 11
circle_padding
hide_on_mobile
zoom_out_icon
[object Object]

Your All-in-One AI-Powered Toolkit for Academic Success.

Available 24*7 on WhatsApp / Email

[object Object]