Home

Awesome

aws-open-data

Colab Binder License

Introduction

The AWS Open Data program hosts a lot of publicly available datasets. This repo compiles the list of all datasets on AWS as a CSV file and as a JSON file, making it easier to find and use them programmatically. The list is updated daily.

A complete list of AWS open datasets as individual YAML files is available here.

Usage

This repo provides the list of AWS open datasets in two formats:

The TSV file can be easily read into a Pandas DataFrame using the following code:

import pandas as pd

url = 'https://github.com/giswqs/aws-open-data/raw/master/aws_open_datasets.tsv'
df = pd.read_csv(url, sep='\t')
df.head()

Related Projects