Home

Awesome

<!-- * @Author: your name * @Date: 2020-05-20 13:39:28 * @LastEditTime: 2020-12-07 14:27:17 * @LastEditors: Please set LastEditors * @Description: In User Settings Edit * @FilePath: \github_test\README.md -->

STC COVID-19 Dataset

License: CC BY 4.0

This data repository stores COVID-19 virus case and related natural and social factors (e.g. environmental observation, policy index) in multi-scale based on ISO standard.

Data Organization

Datasets are organized by region area ranging from global to countries as shown below. Underneath each folder, multi-scale daily reports and summary reports are provided separately.

Field Description

Daily Data

Daily data provides automatically updated information of COVID-19 cases, and related attributes daily.

Attribute NameDescriptionFormatExample
dateThe date representing the current day in which the data represents. UTC time is used for this dataset, all values will calculated before the end of UTC time of the date.Date (YYYY/MM/DD) in UTC2020/04/09
country_nameName of the country.stringUnited States
iso33 digit ISO country codes.varchar(3)USA
admin1_nameThe name for admin 1 level.stringVirginia
hasc1This will represent the Hierarchical administrative subdivision codes (HASC) for admin 1 level.stringUS.VA (for Virginia, United States)
local_id1This will represent the ID for specific admin 1 level. ID that represents the country's admin 1 levelstringVA (for Virginia, United States)
admin2_nameThe name for admin 2 level.stringFairfax County
hasc2This will represent the Hierarchical administrative subdivision codes (HASC) for admin 2 level.stringUS.VA.FX (for Fairfax, Virginia, United States)
local_id2This will represent the ID for specific admin 2 level. ID that represents the country's admin 2 level.string51059 (for Fairfax, Virginia, United States)
confirmedThe number of confirmed cases.integer777
deathThe number of death cases.integer19
recoveredThe number of recovered cases. (might be null for admin 2 level)integernull
MiscellaneousOther data attributed to our dataset.TBDTBD

Summary Data

Summary data records the COVID-19 cases, and related attributes, to show the timeline of cases.

Attribute NameDescriptionFormatExample
country_nameName of the country.string"US"
iso33 digit ISO country codes.varchar(3)USA
admin1_nameThe name for admin 1 level.stringState for USA
dateThe date representing the current day in which the data represents. UTC time is used for this dataset, all values will calculated before the end of UTC time of the date.UTCYYYY/MM/DD

Tutorial - Visualize Virus Cases on Map using QGIS

<img src="https://dl.dropboxusercontent.com/s/fjursbp8dwjpnkp/qgis_join_tutorial.jpg" width="60%">

Overall Data Sources by Country

Legend for data source and operation status

Country / RegionContinentAdmin levelData SourceTemporal CoverageOperation Status
GlobalGlobal0 2020/1/22 to current
United StatesNorth America1 , 2 admin0: 2020/1/22 to current, admin1: 2020/1/27 to current
ChinaAsia1 , 2 admin0: 2020/1/22 to current, admin1: 2020/1/24 to current
CanadaNorth America1 2020/1/26 to current
AustraliaOceania1 2020/1/27 to current
ItalyEurope1 , 2 2020/2/24 to current
GermanyEurope1 2020/2/29 to current
AustriaEurope1 2020/3/4 to current
BrazilSouth America1 2020/2/26 to current
ChileSouth America1 2020/3/2 to current
JapanAsia1 2020/1/15 to current
RussiaEurope1 2020/3/22 to current
South AfricaAfrica1 2020/3/5 to current
CroatiaEurope1 2020/3/21 to current
SwedenEurope1 2020/3/16 to current
IndiaAsia1 2020/3/10 to current
HungaryEurope1 2020/3/31 to current
DenmarkEurope1 2020/5/20 to current
UkraineEurope1 2020/4/5 to current
LatviaEurope1 2020/3/19 to current
AlbaniaEurope1 2020/4/22 to current
HaitiNorth America1 2020/3/19 to current
RomaniaEurope1 2020/4/2 to current
MexicoNorth America1 2020/4/25 to current
NigeriaAfrica1 2020/2/27 to current
PakistanAsia1 2020/3/10 to current
BoliviaSouth America1 2020/6/4 to 2020/7/29
GuatemalaNorth America1 2020/3/15 to 2020/8/14
El SalvadorNorth America1 2020/6/6 to 2020/7/4
SwitzerlandEurope1 2020/6/1 to 2020/8/10
BulgariaEurope1 2020/6/6 to 2020/8/10

Recommended Citation

@article{doi:10.1080/20964471.2020.1844934,
  author = { Dexuan   Sha  and  Yi   Liu  and  Qian   Liu  and  Yun   Li  and  Yifei   Tian  and  Fayez   Beaini  and  Cheng   Zhong  and  Tao   Hu  and  Zifu   Wang  and  Hai   Lan  and  You   Zhou  and  Zhiran   Zhang  and  Chaowei   Yang },
  title = {A spatiotemporal data collection of viral cases for COVID-19 rapid response},
  journal = {Big Earth Data},
  volume = {0},
  number = {0},
  pages = {1-21},
  year  = {2020},
  publisher = {Taylor & Francis},
  doi = {10.1080/20964471.2020.1844934}
  }
@article{liu2020environmental,
  title={An Environmental Data Collection for COVID-19 Pandemic Research},
  author={Liu, Qian and Liu, Wei and Sha, Dexuan and Kumar, Shubham and Chang, Emily and Arora, Vishakh and Lan, Hai and Li, Yun and Wang, Zifu and Zhang, Yadong and others},
  journal={Data},
  volume={5},
  number={3},
  pages={68},
  year={2020},
  publisher={Multidisciplinary Digital Publishing Institute}
}
@article{yang2020taking,
  title={Taking the pulse of COVID-19: A spatiotemporal perspective},
  author={Yang, Chaowei and Sha, Dexuan and Liu, Qian and Li, Yun and Lan, Hai and Guan, Weihe Wendy and Hu, Tao and Li, Zhenlong and Zhang, Zhiran and Thompson, John Hoot and others},
  journal={International Journal of Digital Earth},
  pages={1--26},
  year={2020},
  publisher={Taylor \& Francis}
}

Source Changing Log

People Contribution & Credit

Disclaimer

All data in this repository was collected/calculated/calibrated from multiple publicly available data sources that do not always agree. While we'll try our best to keep the information up to date and correct, we make no representations or warranties of any kind, express or implied, about the completeness, accuracy, reliability, with respect to the data. We do not bear any legal responsibility for any consequence caused by the usage of data provided. Reliance on the data for medical guidance or use of the data in commerce is strictly prohibited. NSF STcenter hereby disclaims any and all representations and warranties with respect to the data repository, including accuracy, fitness for use, and merchantability. For countries where there are internal disputes and sensitive region or area, we do not include that part of data in our datasets. If you are interested in this part of data, you can contact us directly.

License

The dataset is published under the Creative Commons Attribution 4.0 International License (CC BY 4.0).