This report explores data analysis techniques to transform raw data into a more understandable form. It focuses on analyzing humidity level data using tables, charts, and statistical tools. The report also predicts humidity levels for future days.
Contribute Materials
Your contribution can guide someone’s learning journey. Share your
documents today.
DATA ANALYSIS TECHNIQUES
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
Contents INTRODUCTION...........................................................................................................................1 MAIN BODY..................................................................................................................................1 Arranging data in a table format..................................................................................................1 Presenting data in a Chart format................................................................................................1 Calculation of descriptive statistical tools...................................................................................2 Calculation of humidity level for day 15 and day 20..................................................................4 CONCLUSION................................................................................................................................6 REFERENCES................................................................................................................................7
INTRODUCTION Data analysis techniques are the procedures which enable an individual to transform a raw data into much more understandable form. There are various data analysis techniques such as presentation, forecasting, statistics and many more (Pole, West and Harrison, 2018). This report is developed with the aim of analysing the humidity level data of ten days by predicting the humidity level of day 15 and 20. This report combines presentation of data using table format and graphs along with which analysis of that data using central tendency tools of mean, mode, median, range and standard deviation. MAIN BODY Arranging data in a table format Humidity percent data of the region Liverpool, United Kingdom is procured at a uniform time of 06:00 AM(Humidity level in Liverpool Merseyside, United Kingdom,2019). DateHumidity level 25-Dec-1990% 26-Dec-1983% 27-Dec-1995% 28-Dec-1995% 29-Dec-1993% 30-Dec-1990% 31-Dec-1992% 01-Jan-2093% 02-Jan-2092% 03-Jan-2071% Presenting data in a Chart format The data which is presented above in a table is below represented using abar graphand a scatter diagramwhich helps to understand the humidity level fluctuations better. 1
Calculation of descriptive statistical tools Mean FormulaSum of the all values / Total number of the values M = Σx/n Calculation= 894% / 10 = 89.4 or 89% 2
Paraphrase This Document
Need a fresh take? Get an instant paraphrase of this document with our AI Paraphraser
Mean is the average value of the frequencies present in a data set. In the present case, Mean is calculated by totalling the values of humidity level i.e., 894% and then dividing it with total number of days i.e., 10. The determined mean of 89% shows that the average humidity level of the region of Liverpool is 89% which is considerably high, interpreting Liverpool to be a humid region. Median Formula(n + 1)/2th position Calculation= (10+1) / 2 = 5.5th position = 92% Median refers to the middle point frequency of a data set which categorizes the whole set into two equal categories (Afshar and Bigdeli, 2011). In the present case, all the frequencies are first cleansed by converting them in an ascending order. After which the frequency of 5.5thor 6th position is considered as a median that is 92%. Mode Mode is usually the most repeated frequency value in a data set. This measure helps to understand the humid level which is most experienced in the region of Liverpool (Bisgaard and Kulahci, 2011). By observing the data set, it has been seen that 90% humid level is recurring two times, hence considered as mode. Range FormulaMaximum band value – Minimum band value Calculation95% - 71% = 24% Range is the difference between maximum and minimum frequency value which shows the range by which humid level of Liverpool is dispersed. The maximum humid level Liverpool experienced from 25 December 2019 to 3 January 2020 is 95% and 71% to be the minimum. These values have a difference of 24% which is the range of this data set. Standard deviation FormulaStandard Deviations =√(variance) Variance2= {∑ (x – mean) / N}2 = {∑ (x2/ N – (mean)2} 3
Calculation= {804% / 10 – (89%)2} = {80.4% – 80%} = 0.4 Std. Dev. =√0.4 Standard deviation = 0.63 Standard deviation is the dispersion by which frequencies of the data set disperse from its average mean. In the present case, the measure is calculated as 0.63 which implies that the dispersion rate of humid level in Liverpool is low and the fluctuations in humidity in this region is lower. Calculation of humidity level for day 15 and day 20 Calculation of m value DateDay (X) Humidity data (%) (Y)XYX^2 25-Dec-19190%90%1 26-Dec-19283%166%4 27-Dec-19395%285%9 28-Dec-19495%380%16 29-Dec-19593%465%25 30-Dec-19690%540%36 31-Dec-19792%644%49 01-Jan-20893%744%64 02-Jan-20992%828%81 03-Jan-201071%710%100 Total55894%4852%385 ParticularsDetails mNΣxy – Σx Σy / NΣ x ^ 2 – (Σx) ^ 2 (10 * 4852) - (55 * 894) / (10 * 385) - (55) ^ 2 (48520 - 49170) / (3850 - 3025) 4
-0.78 The “m” variable is the sum combination of days and humidity level which acts as a base to predict future humidity level. In the present case of Liverpool, the equation of “y = mx + c” is used by which the value of m is computed as -0.78. Calculation of c value ParticularsDetails cΣy - mΣx / N (894 - ( - 0.78 * 55)) / 10 93.69 The variable “c” is the predictive date by which the future humid level can increase or decrease (Croushore, 2011). This variable in the case of Liverpool is determined as 93.69. Forecasting humidity level for 15thand 20thday Forecast of 15th day y= mx + c y-0.78 (x) + 93.69 x15 y0.78 (15) + 93.69 81.99 Humidity level of ten consecutive days for the region of Liverpool is considered in this data analysis. Using these known frequency levels, the humidity for day 15this determined as 81.99 or 82%. Forecast of 20th day y= mx + c y-0.78 (x) + 93.69 x20 y-0.78 (20) + 93.69 78.09 The humidity level for the 20thday is calculated as 78.09 or 78%. This disperse humid level shows that there are high chances of experiencing fluctuations in the future. 5
Secure Best Marks with AI Grader
Need help grading? Try our AI Grader for instant feedback on your assignments.
CONCLUSION From the above report, it has been concluded that the procedure of forecasting is essential in studyofdataanalysisasitdoesnotonlyhelpinpresentingthedatainmuchmore understandable form but also help in predicting the future values of that data set. It is observed and found from above analysis report that effective usage of statistical and Microsoft Excel tool, the raw data cannot only be effectively presented but can also be transformed. 6
REFERENCES Books and Journals Afshar, K. and Bigdeli, N., 2011. Data analysis and short term load forecasting in Iran electricity market using singular spectral analysis (SSA). Energy. 36(5). pp.2620-2627. Bisgaard, S. and Kulahci, M., 2011. Time series analysis and forecasting by example. John Wiley & Sons. Croushore, D., 2011. Frontiers of real-time data analysis. Journal of economic literature. 49(1). pp.72-100. Pole, A., West, M. and Harrison, J., 2018. Applied Bayesian forecasting and time series analysis. Chapman and Hall/CRC. Online Humidity level in Liverpool Merseyside, United Kingdom.2019. [Online]. Available through: <https://www.worldweatheronline.com/liverpool-weather-history/merseyside/gb.aspx> 7