Two data files are not included in the repo, you can download them from: titles.csv
and cast.csv
and put them in the /data
folder.
To follow this tutorial you need to have the following packages installed:
pandas
version 0.18.0 or later: http://pandas.pydata.org/numpy
version 1.7 or later: http://www.numpy.org/matplotlib
version 1.3 or later: http://matplotlib.org/ipython
version 3.x with notebook support, or ipython 4.x
combined with jupyter
: http://ipython.orgseaborn
(this is used for some plotting, but not necessary to follow the tutorial): http://stanford.edu/~mwaskom/software/seaborn/If you have git installed, you can get the material in this tutorial by cloning this repo:
git clone https://github.com/jorisvandenbossche/pandas-tutorial.git
As an alternative, you can download it as a zip file:
https://github.com/jorisvandenbossche/pandas-tutorial/archive/master.zip.
I will probably make some changes until the start of the tutorial, so best to download
the latest version then (or do a git pull
if you are using git).
Two data files are not included in the repo, you can download them from: titles.csv
and cast.csv
and put them in the /data
folder.
Beginners track:
Advanced track: