This dataset, contagious_diseases_df
, is a data frame containing yearly counts
for Hepatitis A, Measles, Mumps, Pertussis, Polio, Rubella, and Smallpox for US states.
The original data is courtesy of the Tycho Project.
Usage
data(contagious_diseases_df)
Format
A data frame with 16,065 observations and 6 variables:
- disease
Factor with 7 levels indicating the disease type
- state
Factor with 51 levels indicating the US state
- year
Numeric vector indicating the year of observation
- weeks_reporting
Numeric vector indicating the number of weeks reported
- count
Numeric vector indicating the number of cases reported
- population
Numeric vector indicating the population of the state in that year
Source
Data taken from the dslabs package version 0.8.0. Original data courtesy of the Tycho Project (http://www.tycho.pitt.edu/).
Details
The dataset name has been kept as contagious_diseases_df
to avoid confusion with other datasets
in the R ecosystem. This naming convention helps distinguish this dataset as part of the
infectiousR package and assists users in identifying its specific characteristics.
The suffix _df
indicates that the dataset is a data frame. The original content has not been modified
in any way.