Awesome
COVID-19 Epidemic Prevention - Data Science (COVID-19 Open Data)
简体中文 | English
Goals:
-
Let everyone who has a data-analysis background be able to catch up with current epedemic status.
-
We are calling for contributors and volunteers who has a data science-related background, including but not limited to Epidemiology, Geographic Information, Economy, etc. We aim at having transparent, open discussions, sharing data and research results that could help people fight against the outbreak caused by the novel Coronavirus.
-
You can leave message on Gitter for us.
Important Notice (please read first)
-
Contribute by skill set: Please fill out the SkillSet/TimeZone servey so people could coordinate easily.
-
Risk Disclosure Currently this project mainly focus on:
- Establishing professional knowledge base
- Basic database/data repo
- Data Visualization
- Rapidly testing models
Contributing Guide
Ways of contributing
- Contribute your data to
Data/
folder. - Sharing your model to
/Model/
. - Submit the references and related papers to
Reference/
. - Get involved in the development work, including data wrangling, API development, modeling, integrate with the main project, etc.
Please download Github Desktop and get yourself familiar with the guide and tutorials before you clone this repository by either using the "Clone or Download" button on this page, if you haven't used Github before.
Resources provided by Individual Contributors
Every contributor is welcome to edit this section by opening a PR.
Data Source:
- Wangxin:Collection
- Stockard:Summary based on official released text data of the outbreak
- Yilun Guan: Data of the population traveled out from Wuhan during Jan.1 to Jan.26 by Baidu
- Michael: Summary of the news and repots about the novel-Coronavirus in North America, on Telegram
- Guangchuang Yu: nCov2019: R package to query for recent and historical data
Models and Verification:
- Tongshuang Wu(Vegalite):Online tool for data wrangling
- Jialin Lu :Reproduced model of the estimation model from Imperial College London
- Yiran Jing :Estimation and prediction of the outbreak in Wuhan
- Phase1: Estimation of the infected population in Wuhan (refer to the ICL model)
- Phase2: Simulate and predict the infected population and peak after the shutdown of Wuhan. (Leverage the SIER(susceptible-exposed-infectious- recovered) method)
List of Resources
- John Hopkins Dashboard Geographical visualization of the epidemic
- Data Sources Including official data sources and 3rd-party data. Openly Editable
- Modeling Navigation Share the models and references here, Openly Editable
- Visualization and epidemic papers and references Including links to Elsevier, Wiley, Lancet, etc,Openly Editable
Other 3rd-party Resources
-
Outbreak analytics Application of data science in epidemic outbreak (provided by Eric Y.L.)
-
Feel free to join Trello to view and pick the tickets from the backlog(we are thinking to have an English version of trello board)
-
Previous Slack #team-data channel is no longer maintained.
Author: @Stockard