The University of Sydney DATA2001: Data Cleaning and Exploration
VerifiedAdded on 2022/01/21
|73
|3869
|94
Homework Assignment
AI Summary
This document presents an assignment solution for DATA2001, focusing on data cleaning and exploration using Python. It covers the Python environment, including Jupyter Notebooks, and the use of Pandas for data manipulation and analysis. The assignment delves into data types, levels of measurement (nominal, ordinal, interval, and ratio), and measures of central tendency and dispersion. It explores data acquisition, cleaning, and transformation techniques, including handling missing data and converting data types. The solution emphasizes the use of Python libraries like Pandas and the importance of exploratory analysis workflows. The document also discusses the use of csv and pandas for reading data, and the handling of missing data. It also covers the use of functions to convert values in a given column. The assignment provides a practical guide to cleaning and preparing data for analysis using Python tools, with examples and explanations to facilitate understanding. This assignment is a valuable resource for students learning data science and big data concepts.
1 out of 73












