Home

Awesome

<div align="center"><img src="images/header.png"></div>

Gist - An Audio Analysis Library

<!-- Version and License Badges -->

Version License Language

Gist is a C++ based audio analysis library

Author

Gist is written and maintained by Adam Stark.

http://www.adamstark.co.uk

Usage

Firstly, import the Gist header file:

#include "Gist.h"
Instantiation

Gist is a template class, so instantiate it with floating point precision:

int frameSize = 512;
int sampleRate = 44100;

Gist<float> gist (frameSize, sampleRate);

Or with double precision:

Gist<double> gist (frameSize, sampleRate);

We proceed with the documentation as if we were using floating point precision.

Process Audio Frames

Once you have an audio frame, pass it to the Gist object. You can do this either as a STL vector:

std::vector<float> audioFrame;

// !
// fill audio frame with samples here
// !

gist.processAudioFrame (audioFrame);

Or, as an array:

float audioFrame[512];

// !
// fill audio frame with samples here
// !

gist.processAudioFrame (audioFrame, 512);

Now we can retrieve some audio features.

Core Time Domain Features
// Root Mean Square (RMS)
float rms = gist.rootMeanSquare();

// Peak Energy
float peakEnergy = gist.peakEnergy();

// Zero Crossing rate
float zcr = gist.zeroCrossingRate();
Core Frequency Domain Features
// Spectral Centroid
float specCent = gist.spectralCentroid();

// Spectral Crest
float specCrest = gist.spectralCrest();

// Spectral Flatness
float specFlat = gist.spectralFlatness();

// Spectral Rolloff
float specRolloff = gist.spectralRolloff();

// Spectral Kurtosis
float specKurtosis = gist.spectralKurtosis();
Onset Detection Functions
// Energy difference
float ed = gist.energyDifference();

// Spectral difference
float sd = gist.spectralDifference();

// Spectral difference (half-wave rectified)
float sd_hwr = gist.spectralDifferenceHWR();

// Complex Spectral Difference
float csd = gist.complexSpectralDifference();

// High Frequency Content
float hfc = gist.highFrequencyContent();
FFT Magnitude Spectrum
// FFT Magnitude Spectrum
const std::vector<float>& magSpec = gist.getMagnitudeSpectrum();
Pitch
// Pitch Estimation
float pitch = gist.pitch();
Mel-frequency Representations
// Mel-frequency Spectrum
const std::vector<float>& melSpec = gist.getMelFrequencySpectrum();

// MFCCs
const std::vector<float>& mfcc = gist.getMelFrequencyCepstralCoefficients();

	

Version History

=== 1.0.7 === (18th September 2021)

=== 1.0.6 === (21st October 2020)

=== 1.0.5 === (29th February 2020)

=== 1.0.4 === (22nd January 2017)

=== 1.0.3 === (17th June 2016)

=== 1.0.2 === (24th April 2016)

=== 1.0.1 === (26th June 2014)

=== 1.0.0 === (22nd June 2014)

Dependencies

The Gist library depends on one of the following FFT libraries:

You will need to install this yourself, link projects using -lfftw3 and use the flag -DUSE_FFTW

This is included with the project. To use Kiss FFT, add the flag -DUSE_KISS_FFT

To use Accelerate FFT, add the flag -DUSE_ACCELERATE_FFT

License

Copyright (c) 2014 Adam Stark

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.