Awesome

DJ Neural Net - Artificial Music Composition

This code can be run from the command line on any text or rtf file: $ python lstm_class.py --data "my_file.txt"

Intro

The inspiration for this project came from my experience as a classical pianist and my interest in natural language processing and deep learning. When I learned of the power of recurrent neural networks from Andrej Karpathy's paper The Unreasonable Effectiveness of Recurrent Neural Networks, it seemed fitting that I could produce music with such a model.

Data

A corpus of traditional Irish folk songs, dances, reels, and jigs contained in the Nottingham Music Database that were translated into ABC format by the ABC Music Project. I found clean versions of these songs here. I also trained the network on Bach, Backstreet Boys, Enya and Michael Jackson, also translated into ABC format.

About ABC

ABC is a textual representation of music notation. This limited the amount of training data I could use for a model because of its simple format.

ABC staff

Though ABC is a rather robust representation of music, there is a bit of nuance lost in translation. It is also somewhat difficult to translate staff notation into ABC format by hand, so I used a program called EasyABC to do most of the work for me; after EasyABC took care of the translation, I checked the pieces for accuracy.

The Model

I am using a Keras LSTM model with only one LSTM layer that has

Memory Units: 100
Dropout Rate: 0.3
Optimizer: RMSprop, lr=0.01
Batch: 100
Sequence Length: 25

model

A LSTM RNN model seemed fitting as context within music is important and must be maintained over long periods of time. The LSTM model better contains this context and thus theoretically would produce better music.

Model Performance

Loss over epochs loss_one

Evaluating the model was an interesting task as I did not have testing data. Since the goal was to produce new music, there was not a way to properly split the data into testing and training sets. Thus the loss is calculating the difference in predicted and actual values for the entire dataset. With no loss calculated on testing data, the final task of evaluation was left to my ears.

Results

After 1 epoch: one_epoch

After 10 epochs: ten_epoch

After 20 epochs: twenty_epochs You can see that the network has learned some of the structure of ABC format, and is even beginning to write titles for its tunes.

Postscript

I was able to successfully convert the ABC files to staff notation and play some of the tunes using a program called MuseScore. Here are some of my favorite titles:

Cone Blcen Cherronatee
Slio Keleoso
9D Millillihe Mo's Boy
Dewenr Bcend Batlis
The Lassie's Fatre
The orant sit

And my absolute favorite: song

References

In Conclusion

"Never get one of those cheap tin whistles. It leads to much harder drugs like pipes and flutes." -Irish Proverb