Home

Awesome

Pose-based Modular Network for Human-Object Interaction Detection

<!---------------------------------------------------------------------------------------------------------------->

Official Pytorch implementation for Pose-based Modular Network for Human-Object Interaction Detection.

overview

<!---------------------------------------------------------------------------------------------------------------->

Code Overview

In this project, we implement our method based on VS-GATs. The structure of the code in this project is similar to VS-GATs. You can check it for the description of each file.

<!---------------------------------------------------------------------------------------------------------------->

Getting Started

Prerequisites

This codebase was tested with Python 3.6, Pytorch 1.1.0, torchvision 0.3, CUDA 10.0, Ubuntu 16.04.

Installation

  1. Clone this repository.

    git clone https://github.com/birlrobotics/PMN.git
    
  2. Install Python dependencies:

    pip install -r requirements.txt
    

Prepare Data

Download Original Data (Optional)

  1. Download the original HICO-DET dataset and put it into datasets/hico.
  2. Follow here to prepare the original data of V-COCO dataset in datasets/vcoco folder.
  3. (For VS-GATs) Download the pretrain word2vec model on GoogleNews and put it into ./datasets/word2vec

Download the Processed Data

Download the Pretrained Model of VS-GATs

Training

Testing

Results

Acknowledgemen

In this project, some codes which process the data and eval the model are built upon VS-GATs: Visual-Semantic Graph Attention Networks for Human-Object Interaction Detecion, ECCV2018-Learning Human-Object Interactions by Graph Parsing Neural Networks and ICCV2019-No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques. Thanks them for their great works.