Use metaflow to load the movie metadata CSV file into a Pandas Dataframe and compute some movie genre specific statistics. These statistics are then used in later examples to improve our playlist generator. You can optionally use the Metaflow client to eyeball the results in a Notebook, and make some simple plots using the Matplotlib library.
You can find the tutorial code on GitHub​
Showcasing:
Fan-out over a set of parameters using Metaflow foreach.
Using external packages like Pandas.
Plotting results in a Notebook.
Before playing this episode:
python -m pip install pandas
python -m pip install notebook
python -m pip install matplotlib
To play this episode:
cd metaflow-tutorials
python 02-statistics/stats.py show
python 02-statistics/stats.py run
jupyter-notebook 02-statistics/stats.ipynb