%matplotlib inline

import pandas as pd; pd.set_option('max_columns', 6)
import seaborn as sns
import matplotlib.pyplot as plt


colleges = pd.read_csv('colleges.csv')


colleges.shape

(1946, 7)


from IPython.core.display import display, HTML
display(HTML(colleges.head(10).to_html()))


colleges.groupby('_____').agg(['mean', 'std']).shape

(55, 2)


colleges.groupby('_____').agg({'cases':sum})


#!conda install geoplot -c conda-forge
#!conda install geopandas


import geopandas as gpd
import geoplot as gplt


usa_cities = gpd.read_file(gplt.datasets.get_path('usa_cities'))
usa_cities.head()


continental_usa_cities = usa_cities.query('STATE not in ["HI", "AK", "PR"]')
gplt.pointplot(continental_usa_cities);

<AxesSubplot:>


contiguous_usa = gpd.read_file(gplt.datasets.get_path('contiguous_usa'))
ax = gplt.polyplot(contiguous_usa)
gplt.pointplot(continental_usa_cities, ax=ax);

<AxesSubplot:>


import geoplot.crs as gcrs
ax = gplt.webmap(contiguous_usa, projection=gcrs.WebMercator())
gplt.pointplot(continental_usa_cities, ax=ax)

<GeoAxesSubplot:>

	date	state	county	city	ipeds_id	college	cases
0	2021-05-26	Alabama	Madison	Huntsville	100654	Alabama A&M University	41
1	2021-05-26	Alabama	Montgomery	Montgomery	100724	Alabama State University	2
2	2021-05-26	Alabama	Limestone	Athens	100812	Athens State University	45
3	2021-05-26	Alabama	Lee	Auburn	100858	Auburn University	2742
4	2021-05-26	Alabama	Montgomery	Montgomery	100830	Auburn University at Montgomery	220
5	2021-05-26	Alabama	Walker	Jasper	102429	Bevill State Community College	4
6	2021-05-26	Alabama	Jefferson	Birmingham	100937	Birmingham-Southern College	263
7	2021-05-26	Alabama	Limestone	Tanner	101514	Calhoun Community College	137
8	2021-05-26	Alabama	Tallapoosa	Alexander City	100760	Central Alabama Community College	49
9	2021-05-26	Alabama	Coffee	Enterprise	101143	Enterprise State Community College	76

	cases
county
Abbeville	0
Acadia	149
Ada	1642
Adair	749
Adams	948
...	...
Yellow Medicine	93
Yellowstone	246
Yolo	678
York	804
Yuma	41

	id	POP_2010	ELEV_IN_FT	STATE	geometry
0	53	40888.0	1611.0	ND	POINT (-101.29627 48.23251)
1	101	52838.0	830.0	ND	POINT (-97.03285 47.92526)
2	153	15427.0	1407.0	ND	POINT (-98.70844 46.91054)
3	177	105549.0	902.0	ND	POINT (-96.78980 46.87719)
4	192	17787.0	2411.0	ND	POINT (-102.78962 46.87918)

Tracking Covid-19 at U.S. Colleges and Universities¶

This is an exercise meant to make you think of ways of grouping, aggregating, plotting, and presenting this data in a way that an audience would appreciate.¶

Data¶

Optional: add geospatial data and maps to your plots¶