Home

Awesome

This repository corresponds to the official source code of the CVPR 2019 paper:

<a href="https://arxiv.org/pdf/1906.03631.pdf">Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction</a>

To get an overview about the method and its results, we highly recommend checking our poster and a short video at <a href="https://lmb.informatik.uni-freiburg.de/Publications/2019/MICB19/">[Page]</a>

demo

Requirements

Setup

We use the source code from WEMD[1] to compute our SEMD evaluation metric.

After compilation, you should get a library under /wemd/lib, which is linked in the wemd.py.

Data

To reproduce our results in the paper, we provide the processed testing samples from SDD [2] used in our paper. Please download them from <a href="https://lmb.informatik.uni-freiburg.de/resources/binaries/Multimodal_Future_Prediction/datasets.zip">[Link]</a>

After extracting the datasets.zip, you will get a set of folders representing the testing scenes. For each scene you have the following structure:

Additionally, we provide the processed training SDD which can be downloaded from <a href="https://lmb.informatik.uni-freiburg.de/resources/binaries/Multimodal_Future_Prediction/sdd_train.zip">[Link]</a>

Models

We provide the final trained model for our EWTAD-MDF. Please download them from <a href="https://lmb.informatik.uni-freiburg.de/resources/binaries/Multimodal_Future_Prediction/models.zip">[Link]</a>

Testing

To test our EWTAD-MDF, you can run:

python test.py --output

Training

We provide additionally the loss functions used when training our sampling-fitting network, please check the net.py file for more details.

CPI Dataset

We also provide the script to generate our CPI (Car Pedestrian Interaction) synthetic dataset. To generate the training dataset, you can run:

cd CPI/ python CPI-train.py output_folder n_scenes history n_gts dist

Similarly, the testing dataset can be generated using:

python CPI-test.py cpi_testing_dataset 54 3 1000 20

Citation

If you use our repository or find it useful in your research, please cite the following paper:

<pre class='bibtex'> @InProceedings{MICB19, author = "O. Makansi and E. Ilg and {\"O}. {\c{C}}i{\c{c}}ek and T. Brox", title = "Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction", booktitle = "IEEE International Conference on Computer Vision and Pattern Recognition (CVPR)", month = " ", year = "2019", url = "http://lmb.informatik.uni-freiburg.de/Publications/2019/MICB19" } </pre>

References

[1] S. Shirdhonkar and D. W. Jacobs. Approximate earth movers distance in linear time. In 2008 IEEE Conference on Computer Vision and Pattern Recognition, pages 1–8, June 2008.

[2] A. Robicquet, A. Sadeghian, A. Alahi, S. Savarese, Learning Social Etiquette: Human Trajectory Prediction In Crowded Scenes in European Conference on Computer Vision (ECCV), 2016.

License

logo

This source code is shared under the license CC-BY-NC-SA, please refer to the LICENSE file for more information.

This source code is only shared for R&D or evaluation of this model on user database.

Any commercial utilization is strictly forbidden.

For any utilization with a commercial goal, please contact contact_cs or bendahan