Home

Awesome

Stochastic Adversarial Video Prediction

[Project Page] [Paper]

TensorFlow implementation for stochastic adversarial video prediction. Given a sequence of initial frames, our model is able to predict future frames of various possible futures. For example, in the next two sequences, we show the ground truth sequence on the left and random predictions of our model on the right. Predicted frames are indicated by the yellow bar at the bottom. For more examples, visit the project page.

<img src="https://alexlee-gk.github.io/video_prediction/index_files/images/bair_action_free_random_00066_crop.gif" height="96"> <img src="https://alexlee-gk.github.io/video_prediction/index_files/images/bair_action_free_random_00006_crop.gif" height="96">

Stochastic Adversarial Video Prediction,
Alex X. Lee, Richard Zhang, Frederik Ebert, Pieter Abbeel, Chelsea Finn, Sergey Levine.
arXiv preprint arXiv:1804.01523, 2018.

An alternative implementation of SAVP is available in the Tensor2Tensor library.

Getting Started

Prerequisites

Installation

git clone -b master --single-branch https://github.com/alexlee-gk/video_prediction.git
cd video_prediction
pip install -r requirements.txt

Miscellaneous installation considerations

Use a Pre-trained Model

bash data/download_and_preprocess_dataset.sh bair
bash pretrained_models/download_model.sh bair_action_free ours_savp
CUDA_VISIBLE_DEVICES=0 python scripts/generate.py --input_dir data/bair \
  --dataset_hparams sequence_length=30 \
  --checkpoint pretrained_models/bair_action_free/ours_savp \
  --mode test \
  --results_dir results_test_samples/bair_action_free
CUDA_VISIBLE_DEVICES=0 python scripts/evaluate.py --input_dir data/bair \
  --dataset_hparams sequence_length=30 \
  --checkpoint pretrained_models/bair_action_free/ours_savp \
  --mode test \
  --results_dir results_test/bair_action_free

Model Training

bash data/download_and_preprocess_dataset.sh bair
CUDA_VISIBLE_DEVICES=0 python scripts/train.py --input_dir data/bair --dataset bair \
  --model savp --model_hparams_dict hparams/bair_action_free/ours_savp/model_hparams.json \
  --output_dir logs/bair_action_free/ours_savp

Datasets

Download the datasets using the following script. These datasets are collected by other researchers. Please cite their papers if you use the data.

bash data/download_and_preprocess_dataset.sh dataset_name

The dataset_name should be one of the following:

To use a different dataset, preprocess it into TFRecords files and define a class for it. See kth_dataset.py for an example where the original dataset is given as videos.

Note: the bair dataset is used for both the action-free and action-conditioned experiments. Set the hyperparameter use_state=True to use the action-conditioned version of the dataset.

Models

bash pretrained_models/download_model.sh dataset_name model_name

The dataset_name should be one of the following: bair_action_free, kth, or bair. The model_name should be one of the available pre-trained models:

The following are ablations of our model:

See pretrained_models/download_model.sh for a complete list of available pre-trained models.

Model and Training Hyperparameters

The implementation is designed such that each video prediction model defines its architecture and training procedure, and include reasonable hyperparameters as defaults. Still, a few of the hyperparameters should be overriden for each variant of dataset and model. The hyperparameters used in our experiments are provided in hparams as JSON files, and they can be passed onto the training script with the --model_hparams_dict flag.

Citation

If you find this useful for your research, please use the following.

@article{lee2018savp,
  title={Stochastic Adversarial Video Prediction},
  author={Alex X. Lee and Richard Zhang and Frederik Ebert and Pieter Abbeel and Chelsea Finn and Sergey Levine},
  journal={arXiv preprint arXiv:1804.01523},
  year={2018}
}