Data Analysis in R: Descriptive Stats, Histograms, and Relationships
VerifiedAdded on  2023/06/03
|8
|1109
|436
Practical Assignment
AI Summary
This assignment presents a comprehensive analysis of a dataset using the R programming language. The analysis begins with descriptive statistics, including measures of central tendency and dispersion, for continuous variables such as temperature, production, days, persons, and water usage. Histograms are used to examine the distributions of these variables, with observations on skewness. The analysis then delves into categorical variables, specifically supervisors, using bar charts to compare their performance. The average values for each supervisor are compared across several variables. Furthermore, the assignment explores the relationships between different variables using scatter plots. Specifically, the relationship between temperature and water usage, as well as the relationship between production cost and the number of persons on the monthly plant payroll, are investigated. The analysis concludes with a summary of the key findings, including the average production cost, the observed positive relationships between temperature and water usage, and between production cost and the number of persons on the payroll. The appendix includes the R code used for the analysis, including data loading, summary statistics, visualization, and aggregation.
1 out of 8