Home

Awesome

Audio Event Net

This README shows how to run AENet which classify the audio events. Paper: (https://arxiv.org/abs/1701.00599)

Requirements

How to use

Prerequisites

Run example

Run python run_sample.py to test the installation and see how to use the code.

Install as a package

Supported format

Currently only wave file format with 16kHz sampling rate, 16bit, monoral channel is supported. If you would like to extract AENet feature from other format audio files, please first convert it.

For convenience the class aenet.AENet contains the function write_wav which writes the audio stream of a video in the correct format using moviepy.

Known problems

Sometimes, the function write_wav fails, due to a moviepy error. See https://github.com/Zulko/moviepy/pull/377

Reference

If you end up using this code or the pre-trained network, we ask you to cite the following paper:

Naoya Takahashi, Michael Gygli, and Luc Van Gool, "AENet: Learning Deep Audio Features for Video Analysis", arXiv preprint arXiv:1701.00599, 2017.