Awesome
- Introduction to the data and the repo
- R package
- Data format
- Time series plot
- Calendar plot
- Contributing
Introduction to the data and the repo
The U.S. Embassy and Consulates General in India maintain an air quality monitoring program with on-site measuring instruments and put the corresponding data on this website. There are csv files for 2013 and 2014, for 2015 except December which is in a pdf, and various csv/pdf for the months of the beginning of 2016. From August 2016 I chose to use OpenAQ and in particular ropenaq to get new data.
In this repository I have made a copy of these files and provide a R code for wrangling them to get a single csv with all measures for Delhi, Mumbai, Kolkata, Hyderabad and Chennai. Refer to original source for licensing questions.
Useful even for non R users: You will find the raw data in this folder and my wrangling code is here. The resulting csv is here. I will try to update the repository as new data comes in on the embassy website.
R package
I have made a R package out of the data so that my fellow R users can easily play with the data. The package imports ggTimeSeries
for doing the calendar plot. It can be installed this way:
devtools::install_github("Ather-Energy/ggTimeSeries")
devtools::install_github("masalmon/usaqmindia")
Data format
The data is in a long format:
library("usaqmindia")
library("dplyr")
data("pm25_india")
pm25_india %>% head(n = 20) %>% knitr::kable()
datetime | city | conc |
---|---|---|
2013-01-01 01:00:00 | Delhi | 324.4 |
2013-01-01 02:00:00 | Delhi | 366.8 |
2013-01-01 03:00:00 | Delhi | 290.7 |
2013-01-01 04:00:00 | Delhi | 245.4 |
2013-01-01 05:00:00 | Delhi | 220.3 |
2013-01-01 06:00:00 | Delhi | 180.2 |
2013-01-01 07:00:00 | Delhi | 140.0 |
2013-01-01 08:00:00 | Delhi | 125.0 |
2013-01-01 09:00:00 | Delhi | 111.0 |
2013-01-01 10:00:00 | Delhi | 112.0 |
2013-01-01 11:00:00 | Delhi | 129.4 |
2013-01-01 12:00:00 | Delhi | 184.1 |
2013-01-01 13:00:00 | Delhi | 284.8 |
2013-01-01 14:00:00 | Delhi | 372.3 |
2013-01-01 15:00:00 | Delhi | 456.7 |
2013-01-01 16:00:00 | Delhi | 404.8 |
2013-01-01 17:00:00 | Delhi | 328.1 |
2013-01-01 18:00:00 | Delhi | 274.4 |
2013-01-01 19:00:00 | Delhi | 285.0 |
2013-01-01 20:00:00 | Delhi | 354.9 |
Time series plot
Below is an example plot of concentrations (see the code here). Check out the Diwali peaks!
library("usaqmindia")
usaqmindia_plot()
Calendar plot
And this is a calendar plot of daily median concentrations in one city using the ggTimeSeries
package (see the code here):
usaqmindia_calendar(cityplot = "Delhi")
Contributing
Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.