Awesome
N<sup>3</sup>: Newtonian Image Understanding: Unfolding the Dynamics of Objects in Statis Images
This is the source code for Newtonian Neural Networks N<sup>3</sup>, which predicts the dynamics of objects in scenes.
Citation
If you find N<sup>3</sup> useful in your research, please consider citing:
@inproceedings{mottaghiCVPR16N3,
Author = {Roozbeh Mottaghi and Hessam Bagherinezhad and Mohammad Rastegari and Ali Farhadi},
Title = {Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images},
Booktitle = {CVPR},
Year = {2016}
}
Requirements
This code is written in Lua, based on Torch. If you are on Ubuntu 14.04+, you can follow this instruction to install torch.
You need the VIND dataset. Extract it in the current directory, and rename it to VIND
. Or you can put it somewhere else and change the config.DataRootPath
in setting_options.lua
.
Training
To run the training:
th main.lua train
This trains the model on training data, and once in every 10 iterations, evalutates on one val_images
batch. If you want to validate on val_videos
go to setting_options.lua
and change the line valmeta = imvalmeta
to valmeta = vidvalmeta
.
Test
You need to get the weights. Extract the weights in the current directory and rename it weights
. To run the test:
th main.lua test
License
This code is released under MIT License.