Episode 2: Statistics
Is this Data Science?
Use metaflow to load the movie metadata CSV file into a Pandas Dataframe and compute some movie genre-specific statistics. These statistics are then used in later examples to improve our playlist generator. You can optionally use the Metaflow client to eyeball the results in a Notebook, and make some simple plots using the Matplotlib library.
You can find the tutorial code on GitHub
- Fan-out over a set of parameters using Metaflow foreach.
- Using external packages like Pandas.
- Plotting results in a Notebook.
Before playing this episode:
python -m pip install pandas
python -m pip install notebook
python -m pip install matplotlib
To play this episode:
python 02-statistics/stats.py show
python 02-statistics/stats.py run