Episode 2: Statistics
Is this Data Science?
Use metaflow to load the movie metadata CSV file into a dataframe and compute some movie genre-specific statistics. These statistics are then used in later examples to improve our playlist generator. You can optionally use the Metaflow client to eyeball the results in a Notebook, and make some simple plots using the Matplotlib library.
You can find the tutorial code on GitHub
Showcasing:
- Fan-out over a set of parameters using Metaflow foreach.
- Plotting results in a Notebook.
Before playing this episode:
python -m pip install notebook
python -m pip install matplotlib
To play this episode:
cd metaflow-tutorials
python 02-statistics/stats.py show
python 02-statistics/stats.py run
jupyter-notebook 02-statistics/stats.ipynb