Home

Awesome

MMVED: Multimodal Variational Encoder Decoder Framework for Micro Video Popularity Prediction

Note!!! Please refers to [here] for the lastest update of MMVED from me and my colleage! The NEW work is interesting which creates a hierarchical and multimodal version of the deep variational information bottleneck. It is accepted by IEEE TMM.

This is our implementation of MMVED for micro-video popularity prediction associated with:

A multimodal variational encoder decoder framework for micro video popularity prediction,
Xie, Jiayi and Zhu, Yaochen, and others
Accepted as a conference paper in WWW 2020.

Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework,
Yaochen Zhu, Jiayi Xie, Zhenzhong Chen
arXiv:2003.12724

It includes two parts:

Each part contains everything required to train or test the corresponding MMVED model.

For the Xigua datset we collect, we release the data as well.

Architecture

Environment

Datasets

The Xigua dataset

The Xigua micro-video temporal popularity prediction dataset we collect is available [google drive], [baidu] (pin: zpwb). For usage, download, unzip the data folder and put them in the xigua directory. Descriptions of the files are as follows:

The NUS dataset

The original NUS dataset can be found here, which was released with the TMALL model in this paper. The descriptions of files in the data folder in the NUS directory are as follows:

Examples to run the Codes

The basic usage of the codes for training and testing MMVED model on both Xigua and NUS dataset is as follows:

For more advanced arguments, run the code with --help argument.

If you find our codes and dataset helpful, please kindly cite the following papers. Thanks!

Fullfledged version: Here ; WWW 2020 paper: Here

@article{mmved-fullfledged,
  title={Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework},
  author={Zhu, Yaochen and Xie, Jiayi and Chen, Zhenzhong},
  booktitle={arXiv preprint arXiv:2003.12724},
  year={2020},
}	

@inproceedings{mmved-www2020-preliminary,
  title={A Multimodal Variational Encoder-Decoder Framework for Micro-video Popularity Prediction},
  author={Xie, Jiayi and Zhu, Yaochen and Zhang, Zhibin and Peng, Jian and Yi, Jing and Hu, Yaosi and Liu, Hongyi and Chen, Zhenzhong},
  booktitle={The World Wide Web Conference},
  year={2020},
  pages = {2542–2548},
}