Home

Awesome

sars2pack

<!-- badges: start -->

codecov test-coverage

<!-- badges: end -->

Overview

The sars2pack R package provides one-line access to over 40 COVID-related datasets. Datasets are accessed in real time directly from their sources and then transformed to tidy-data form where possible and applicable. The result of each dataset accessor is a ready-to-use R dataset, often a dataframe. Documentation includes dataset descriptions, sources and references, and examples. Online documentation is available in two locations:

Questions addressed by sars2pack

Installation

# If you do not have BiocManager installed:
install.packages('BiocManager')

# Then, if sars2pack is not already installed:
BiocManager::install('seandavi/sars2pack')

After the one-time installation, load the packge to get started.

library(sars2pack)

Available datasets

<table class="table table-striped" style="margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:left;"> name </th> <th style="text-align:left;"> accessor </th> <th style="text-align:left;"> data\_type </th> <th style="text-align:left;"> geographical </th> <th style="text-align:left;"> geospatial </th> <th style="text-align:left;"> region </th> <th style="text-align:left;"> resolution </th> <th style="text-align:left;"> url </th> </tr> </thead> <tbody> <tr> <td style="text-align:left;"> United States county-level geographic details </td> <td style="text-align:left;"> us\_county\_geo\_details </td> <td style="text-align:left;"> c(“demographics”, “geographic”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin2 </td> <td style="text-align:left;"> [LINK](https://github.com/josh-byster/fips_lat_long) </td> </tr> <tr> <td style="text-align:left;"> OECD International Unemployment Data </td> <td style="text-align:left;"> oecd\_unemployment\_data </td> <td style="text-align:left;"> c(“economics”, “time series”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> admin0 </td> <td style="text-align:left;"> [LINK](https://oecd.org) </td> </tr> <tr> <td style="text-align:left;"> healthdata.org COVID-19 Mobility Observations and Projections </td> <td style="text-align:left;"> healthdata\_mobility\_data </td> <td style="text-align:left;"> c(“mobility”, “time series”, “projections”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> International </td> <td style="text-align:left;"> c(“admin0”, “admin1”) </td> <td style="text-align:left;"> [LINK](https://covid19.healthdata.org/projections) </td> </tr> <tr> <td style="text-align:left;"> healthdata.org COVID-19 Testing Observations and Projections </td> <td style="text-align:left;"> healthdata\_testing\_data </td> <td style="text-align:left;"> c(“testing”, “time series”, “projections”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> International </td> <td style="text-align:left;"> c(“admin0”, “admin1”) </td> <td style="text-align:left;"> [LINK](https://covid19.healthdata.org/projections) </td> </tr> <tr> <td style="text-align:left;"> Our World In Data testing and cases reporting </td> <td style="text-align:left;"> owid\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”, “testing”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> admin0 </td> <td style="text-align:left;"> [LINK](https://ourworldindata.org/coronavirus) </td> </tr> <tr> <td style="text-align:left;"> CovidTracker data </td> <td style="text-align:left;"> covidtracker\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”, “testing”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin1 </td> <td style="text-align:left;"> [LINK](https://covidtracking.com/) </td> </tr> <tr> <td style="text-align:left;"> European CDC world tracking </td> <td style="text-align:left;"> ecdc\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> admin0 </td> <td style="text-align:left;"> [LINK](https://www.ecdc.europa.eu/en/covid-19) </td> </tr> <tr> <td style="text-align:left;"> EU data Github aggregator </td> <td style="text-align:left;"> eu\_data\_cache\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> Europe </td> <td style="text-align:left;"> c(“admin0”, “admin1”) </td> <td style="text-align:left;"> [LINK](https://github.com/covid19-eu-zh/covid19-eu-data) </td> </tr> <tr> <td style="text-align:left;"> USA Facts </td> <td style="text-align:left;"> usa\_facts\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin1 </td> <td style="text-align:left;"> [LINK](https://usafacts.org/visualizations/coronavirus-covid-19-spread-map/) </td> </tr> <tr> <td style="text-align:left;"> Johns Hopkins dataset </td> <td style="text-align:left;"> jhu\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> admin0 </td> <td style="text-align:left;"> [LINK](https://github.com/CSSEGISandData/COVID-19) </td> </tr> <tr> <td style="text-align:left;"> Johns Hopkins US-centric data </td> <td style="text-align:left;"> jhu\_us\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> c(“admin1”, “admin2”) </td> <td style="text-align:left;"> [LINK](https://github.com/CSSEGISandData/COVID-19) </td> </tr> <tr> <td style="text-align:left;"> New York Times county level data </td> <td style="text-align:left;"> nytimes\_county\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin2 </td> <td style="text-align:left;"> [LINK](https://raw.githubusercontent.com/nytimes/covid-19-data) </td> </tr> <tr> <td style="text-align:left;"> New York Times state level data </td> <td style="text-align:left;"> nytimes\_state\_data </td> <td style="text-align:left;"> c(“time series”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin1 </td> <td style="text-align:left;"> [LINK](https://raw.githubusercontent.com/nytimes/covid-19-data) </td> </tr> <tr> <td style="text-align:left;"> The Economist: Excess deaths during COVID pandemic </td> <td style="text-align:left;"> economist\_excess\_deaths </td> <td style="text-align:left;"> c(“time series”, “deaths”, “excess deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> International </td> <td style="text-align:left;"> c(“admin0”, “admin1”) </td> <td style="text-align:left;"> [LINK](https://github.com/TheEconomist/covid-19-excess-deaths-tracker) </td> </tr> <tr> <td style="text-align:left;"> The : Excess deaths during COVID pandemic </td> <td style="text-align:left;"> financial\_times\_excess\_deaths </td> <td style="text-align:left;"> c(“time series”, “deaths”, “excess deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> International </td> <td style="text-align:left;"> c(“admin0”, “admin1”) </td> <td style="text-align:left;"> [LINK](https://github.com/Financial-Times/coronavirus-excess-mortality-data) </td> </tr> <tr> <td style="text-align:left;"> US CDC excess deaths dataset </td> <td style="text-align:left;"> cdc\_excess\_deaths </td> <td style="text-align:left;"> c(“time series”, “deaths”, “excess deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin1 </td> <td style="text-align:left;"> [LINK](https://www.cdc.gov/nchs/nvss/vsrr/covid19/excess_deaths.html) </td> </tr> <tr> <td style="text-align:left;"> Descartes Labs Mobility Data </td> <td style="text-align:left;"> descartes\_mobility\_data </td> <td style="text-align:left;"> c(“time series”, “mobility”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin1 </td> <td style="text-align:left;"> [LINK](https://raw.githubusercontent.com/descarteslabs/DL-COVID-19) </td> </tr> <tr> <td style="text-align:left;"> Apple mobility data from maps </td> <td style="text-align:left;"> apple\_mobility\_data </td> <td style="text-align:left;"> c(“time series”, “mobility”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> c(“admin0”, “admin1”, “admin2”, “admin3”) </td> <td style="text-align:left;"> [LINK](https://www.apple.com/covid19/mobility) </td> </tr> <tr> <td style="text-align:left;"> Healthdata.org projections of hospital utilization and deaths </td> <td style="text-align:left;"> healthdata\_projections\_data </td> <td style="text-align:left;"> c(“time series”, “projections”, “cases”, “deaths”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> c(“United States”, “World”) </td> <td style="text-align:left;"> c(“admin1”, “admin2”) </td> <td style="text-align:left;"> [LINK](http://www.healthdata.org/covid) </td> </tr> <tr> <td style="text-align:left;"> Healthdata.org mobility data </td> <td style="text-align:left;"> healthdata\_mobility\_data </td> <td style="text-align:left;"> c(“time series”, “projections”, “mobility”) </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> c(“United States”, “World”) </td> <td style="text-align:left;"> c(“admin1”, “admin2”) </td> <td style="text-align:left;"> [LINK](http://www.healthdata.org/covid) </td> </tr> <tr> <td style="text-align:left;"> United States CDC Social Vulnerability Index </td> <td style="text-align:left;"> cdc\_social\_vulnerability\_index </td> <td style="text-align:left;"> demographics </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin2 </td> <td style="text-align:left;"> [LINK](https://svi.cdc.gov/) </td> </tr> <tr> <td style="text-align:left;"> US county health rankings from ‘<https://www.countyhealthrankings.org>’ </td> <td style="text-align:left;"> us\_county\_health\_rankings </td> <td style="text-align:left;"> demographics </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> c(“admin0”, “admin1”, “admin2”) </td> <td style="text-align:left;"> [LINK](https://www.countyhealthrankings.org) </td> </tr> <tr> <td style="text-align:left;"> Country metadata from restcountries.eu </td> <td style="text-align:left;"> country\_metadata </td> <td style="text-align:left;"> demographics </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> admin0 </td> <td style="text-align:left;"> [LINK](https://restcountries.eu) </td> </tr> <tr> <td style="text-align:left;"> Extensive United States hospital capabilities </td> <td style="text-align:left;"> us\_hospital\_details </td> <td style="text-align:left;"> healthcare capacity </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> individual hospital </td> <td style="text-align:left;"> [LINK](https://hub.arcgis.com/datasets/geoplatform::hospitals) </td> </tr> <tr> <td style="text-align:left;"> Kaiser Family Foundation ICU bed data </td> <td style="text-align:left;"> kff\_icu\_beds </td> <td style="text-align:left;"> healthcare capacity </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> Individual hospital </td> <td style="text-align:left;"> [LINK](https://khn.org/news/as-coronavirus-spreads-widely-millions-of-older-americans-live-in-counties-with-no-icu-beds) </td> </tr> <tr> <td style="text-align:left;"> CovidCare United States Healthcare Capacity </td> <td style="text-align:left;"> us\_healthcare\_capacity </td> <td style="text-align:left;"> healthcare capacity </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> Individual hospital </td> <td style="text-align:left;"> [LINK](https://github.com/covidcaremap/covid19-healthsystemcapacity) </td> </tr> <tr> <td style="text-align:left;"> GISAID metadata from thousands of SARS-CoV-2 sequences </td> <td style="text-align:left;"> cov\_glue\_lineage\_data </td> <td style="text-align:left;"> line list </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> multiple </td> <td style="text-align:left;"> [LINK](https://github.com/hCoV-2019/lineages) </td> </tr> <tr> <td style="text-align:left;"> beoutbreakprepared </td> <td style="text-align:left;"> beoutbreakprepared\_data </td> <td style="text-align:left;"> line list </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> patient </td> <td style="text-align:left;"> [LINK](https://github.com/beoutbreakprepared/nCoV2019) </td> </tr> <tr> <td style="text-align:left;"> Published epidemic parameters for COVID-19 </td> <td style="text-align:left;"> param\_estimates\_published </td> <td style="text-align:left;"> miscellaneous </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> list() </td> <td style="text-align:left;"> list() </td> <td style="text-align:left;"> [LINK](https://github.com/midas-network/COVID-19/blob/master/parameter_estimates/2019_novel_coronavirus/estimates.csv) </td> </tr> <tr> <td style="text-align:left;"> Google mobility data </td> <td style="text-align:left;"> google\_mobility\_data </td> <td style="text-align:left;"> mobility </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> c(“admin0”, “admin1”, “admin2”) </td> <td style="text-align:left;"> [LINK](https://www.google.com/covid19/mobility/) </td> </tr> <tr> <td style="text-align:left;"> Newick tree from thousands of SARS-CoV-2 sequences </td> <td style="text-align:left;"> cov\_glue\_newick\_data </td> <td style="text-align:left;"> phylogenetic </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> multiple </td> <td style="text-align:left;"> [LINK](https://github.com/hCoV-2019/lineages) </td> </tr> <tr> <td style="text-align:left;"> Aggregated projections from US CDC </td> <td style="text-align:left;"> cdc\_aggregated\_projections </td> <td style="text-align:left;"> projections </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> list() </td> <td style="text-align:left;"> c(“admin0”, “admin1”) </td> <td style="text-align:left;"> [LINK](https://www.cdc.gov/coronavirus/2019-ncov/covid-data/forecasting-us.html) </td> </tr> <tr> <td style="text-align:left;"> CoronaNet government response database </td> <td style="text-align:left;"> coronanet\_government\_response\_data </td> <td style="text-align:left;"> public policy </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> c(“admin0”, “admin1”) </td> <td style="text-align:left;"> [LINK](https://coronanet-project.org/index.html) </td> </tr> <tr> <td style="text-align:left;"> Oxford Government Policy Intervention time series </td> <td style="text-align:left;"> government\_policy\_timeline </td> <td style="text-align:left;"> public policy </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> World </td> <td style="text-align:left;"> admin0 </td> <td style="text-align:left;"> [LINK](https://www.bsg.ox.ac.uk/research/research-projects/oxford-covid-19-government-response-tracker) </td> </tr> <tr> <td style="text-align:left;"> United States social distancing policies </td> <td style="text-align:left;"> us\_state\_distancing\_policy </td> <td style="text-align:left;"> public policy </td> <td style="text-align:left;"> TRUE </td> <td style="text-align:left;"> FALSE </td> <td style="text-align:left;"> United States </td> <td style="text-align:left;"> admin1 </td> <td style="text-align:left;"> [LINK](https://github.com/COVID19StatePolicy/SocialDistancing/) </td> </tr> </tbody> </table> Case tracking -------------

Updated tracking of city, county, state, national, and international confirmed cases, deaths, and testing is critical to driving policy, implementing interventions, and measuring their effectiveness. Case tracking datasets include date, a count of cases, and usually numerous other pieces of information related to location of reporting, etc.

Accessing case-tracking datasets is typically done with one function per dataset. The example here is data from the European Centers for Disease Control, or ECDC.

ecdc = ecdc_data()

Get a quick overview of the dataset.

head(ecdc)

## # A tibble: 6 x 8
## # Groups:   location_name, subset [6]
##   date       location_name iso2c iso3c population_2019 continent subset    count
##   <date>     <chr>         <chr> <chr>           <dbl> <chr>     <chr>     <dbl>
## 1 2019-12-31 Afghanistan   AF    AFG          38041757 Asia      confirmed     0
## 2 2019-12-31 Afghanistan   AF    AFG          38041757 Asia      deaths        0
## 3 2019-12-31 Algeria       DZ    DZA          43053054 Africa    confirmed     0
## 4 2019-12-31 Algeria       DZ    DZA          43053054 Africa    deaths        0
## 5 2019-12-31 Armenia       AM    ARM           2957728 Europe    confirmed     0
## 6 2019-12-31 Armenia       AM    ARM           2957728 Europe    deaths        0

The ecdc dataset is just a data.frame (actually, a tibble), so applying standard R or tidyverse functionality can get answers to basic questions with little code. The next code block generates a top10 of countries with the most deaths recorded to date. Note that if you do this on your own computer, the data will be updated to today’s data values.

library(dplyr)
top10 = ecdc %>% filter(subset=='deaths') %>% 
    group_by(location_name) %>%
    filter(count==max(count)) %>%
    arrange(desc(count)) %>%
    head(10) %>% select(-starts_with('iso'),-continent,-subset) %>%
    mutate(rate_per_100k = 1e5*count/population_2019)

Finally, present a nice table of those countries:

knitr::kable(
    top10,
    caption = "Reported COVID-19-related deaths in ten most affected countries.",
    format = 'pandoc')
<table> <caption>Reported COVID-19-related deaths in ten most affected countries.</caption> <thead> <tr class="header"> <th style="text-align: left;">date</th> <th style="text-align: left;">location_name</th> <th style="text-align: right;">population_2019</th> <th style="text-align: right;">count</th> <th style="text-align: right;">rate_per_100k</th> </tr> </thead> <tbody> <tr class="odd"> <td style="text-align: left;">2020-07-06</td> <td style="text-align: left;">United_States_of_America</td> <td style="text-align: right;">329064917</td> <td style="text-align: right;">129947</td> <td style="text-align: right;">39.489776</td> </tr> <tr class="even"> <td style="text-align: left;">2020-07-06</td> <td style="text-align: left;">Brazil</td> <td style="text-align: right;">211049519</td> <td style="text-align: right;">64867</td> <td style="text-align: right;">30.735441</td> </tr> <tr class="odd"> <td style="text-align: left;">2020-07-06</td> <td style="text-align: left;">United_Kingdom</td> <td style="text-align: right;">66647112</td> <td style="text-align: right;">44220</td> <td style="text-align: right;">66.349462</td> </tr> <tr class="even"> <td style="text-align: left;">2020-07-06</td> <td style="text-align: left;">Italy</td> <td style="text-align: right;">60359546</td> <td style="text-align: right;">34861</td> <td style="text-align: right;">57.755570</td> </tr> <tr class="odd"> <td style="text-align: left;">2020-07-06</td> <td style="text-align: left;">Mexico</td> <td style="text-align: right;">127575529</td> <td style="text-align: right;">30639</td> <td style="text-align: right;">24.016361</td> </tr> <tr class="even"> <td style="text-align: left;">2020-07-04</td> <td style="text-align: left;">France</td> <td style="text-align: right;">67012883</td> <td style="text-align: right;">29893</td> <td style="text-align: right;">44.607841</td> </tr> <tr class="odd"> <td style="text-align: left;">2020-07-05</td> <td style="text-align: left;">France</td> <td style="text-align: right;">67012883</td> <td style="text-align: right;">29893</td> <td style="text-align: right;">44.607841</td> </tr> <tr class="even"> <td style="text-align: left;">2020-07-06</td> <td style="text-align: left;">France</td> <td style="text-align: right;">67012883</td> <td style="text-align: right;">29893</td> <td style="text-align: right;">44.607841</td> </tr> <tr class="odd"> <td style="text-align: left;">2020-05-24</td> <td style="text-align: left;">Spain</td> <td style="text-align: right;">46937060</td> <td style="text-align: right;">28752</td> <td style="text-align: right;">61.256500</td> </tr> <tr class="even"> <td style="text-align: left;">2020-07-06</td> <td style="text-align: left;">India</td> <td style="text-align: right;">1366417756</td> <td style="text-align: right;">19693</td> <td style="text-align: right;">1.441214</td> </tr> </tbody> </table>

Examine the spread of the pandemic throughout the world by examining cumulative deaths reported for the top 10 countries above.

ecdc_top10 = ecdc %>% filter(location_name %in% top10$location_name & subset=='deaths')
plot_epicurve(ecdc_top10,
              filter_expression = count > 10, 
              color='location_name')
<img src="https://i.imgur.com/KilVPg5.png" width="100%" />

Comparing the features of disease spread is easiest if all curves are shifted to “start” at the same absolute level of infection. In this case, shift the origin for all countries to start at the first time point when more than 100 cumulative cases had been observed. Note how some curves cross others which is evidence of less infection control at the same relative time in the pandemic for that country (eg., Brazil).

ecdc_top10 %>% align_to_baseline(count>100,group_vars=c('location_name')) %>%
    plot_epicurve(date_column = 'index',color='location_name')
<img src="https://i.imgur.com/sDHPp2t.png" width="100%" />

Contributions

Pull requests are gladly accepted on Github.

Adding new datasets

See the Adding new datasets vignette.

Similar work