DATA2001: Data Science, Big Data and Data Diversity Data
Added on - 21 Jan 2022
Trusted by 2+ million users,
1000+ happy students everyday
1000+ happy students everyday
Showing pages 1 to 8 of 73 pages
The University of SydneyPage1
DATA2001: Data Science,
Big Data and Data Diversity
Data Cleaning and Exploration
with Python
Presented by Alan Fekete
Material prepared by UweRoehm
School of Computer Science
DATA2001: Data Science,
Big Data and Data Diversity
Data Cleaning and Exploration
with Python
Presented by Alan Fekete
Material prepared by UweRoehm
School of Computer Science
The University of SydneyPage2
JupyterNotebooks:
The Python Environment in DATA2001
JupyterNotebooks:
The Python Environment in DATA2001
The University of SydneyPage3
JupyterNotebooks support interactive Data Science
with Python
–IPythoninteractive command shell offers:
–Introspection
–Tab completion
–Command history
–Jupyterruns in a browser and supports:
–Sharing and documenting of live code
–Data cleaning,visualisation, machine learning,...
–Jupyter’sgallery of interesting notebooks:
https://github.com/ipython/ipython/wiki/A-gallery-of-interesting-IPython-Notebooks
–We provideJupyterservers which run Python 3
https://ucpu0.ug.it.usyd.edu.au/(remember you need to be using VPN, if off campus)
JupyterNotebooks support interactive Data Science
with Python
–IPythoninteractive command shell offers:
–Introspection
–Tab completion
–Command history
–Jupyterruns in a browser and supports:
–Sharing and documenting of live code
–Data cleaning,visualisation, machine learning,...
–Jupyter’sgallery of interesting notebooks:
https://github.com/ipython/ipython/wiki/A-gallery-of-interesting-IPython-Notebooks
–We provideJupyterservers which run Python 3
https://ucpu0.ug.it.usyd.edu.au/(remember you need to be using VPN, if off campus)
The University of SydneyPage4
1.Click here for file
open dialogue
2.Click upload
next to file name
1.Click here for file
open dialogue
2.Click upload
next to file name
The University of SydneyPage5
Installing Python andJupyterusing Anaconda
–You can use ourJupyter
server
–but it can be slow
–If you wish, you
caninstall Python and
Jupyterprivately,eg
using
Anaconda Distribution,
which includes Python,
theJupyterNotebook,
and other commonly used
packages for scientific
computing and data
science.
Installing Python andJupyterusing Anaconda
–You can use ourJupyter
server
–but it can be slow
–If you wish, you
caninstall Python and
Jupyterprivately,eg
using
Anaconda Distribution,
which includes Python,
theJupyterNotebook,
and other commonly used
packages for scientific
computing and data
science.
The University of SydneyPage6
Python and Data Science Libraries
Python and Data Science Libraries
The University of SydneyPage7
Python background
–Students who did data1002: this should mostly be revision
–If you didn’t really master pandas, matplotlib before, do so now!
–Also note the following key differences
•More sophisticated ways to consider the kinds of data (not just
numerical/castegorical)
–Students who learned Python elsewhere (eginfi1110): you
need to learn how to use particular libraries (Pandas,
matplotlib etc) from the examples here, and online resources
Python background
–Students who did data1002: this should mostly be revision
–If you didn’t really master pandas, matplotlib before, do so now!
–Also note the following key differences
•More sophisticated ways to consider the kinds of data (not just
numerical/castegorical)
–Students who learned Python elsewhere (eginfi1110): you
need to learn how to use particular libraries (Pandas,
matplotlib etc) from the examples here, and online resources
The University of SydneyPage8
Python general concepts
–general program syntax
–variables and types
–integer and float numbers, string types, type conversion
–list, dictionary, tuple and set
–condition statements (if/elif/else)
–for loops, ranges
–functions
–print(),len(), lower(), upper(),...
–nesting of functions; example:print(len(str.upper() ) )
Python general concepts
–general program syntax
–variables and types
–integer and float numbers, string types, type conversion
–list, dictionary, tuple and set
–condition statements (if/elif/else)
–for loops, ranges
–functions
–print(),len(), lower(), upper(),...
–nesting of functions; example:print(len(str.upper() ) )

You’re reading a preview
To View Complete Document
Click the button to download
Subscribe to our plans