Home

Awesome

Clotho data loader


NOTICE: This code is transferred to a new repository

Link to new repository: https://github.com/audio-captioning/clotho-dataloader

The current repository will not be updated any more. Any issues will be taken into account, but the updated code will be pushed at the new repository.

Sorry for the inconvenience!


Welcome to Clotho data loader repository. This repository has the necessary code for using the DataLoader class from PyTorch package (torch.utils.data.dataloader.DataLoader) with the Clotho dataset.

You can use the present data loader of Clotho directly with the examples created by the Clotho baseline dataset repository.

If you are looking at this README file, then I suppose that you already know what is a DataLoader from PyTorch. Nevertheless, the Clotho dataset has sequences as inputs and outputs, and each sequence is of arbitrary length (15 to 30 seconds for the input and 8 to 20 words for the output). For that reason, this data loader already provides a collate function.

Collate function

To be able to use the sequences of Clotho in a batch, you most likely will need some kind of padding policy. This repository already offers a collate function to be used with the Clotho data.

With the provided collate function, you can choose to either:

Enjoy and if you have any issues, please let me know in the issue section.