CATbox:  Computer Audition Toolbox in Matlab

V0.1

 

 

http://music.ucsd.edu/~sdubnov/ComputerAudition.htm

Copyright (c) Shlomo Dubnov 2006

sdubnov@ucsd.edu

 

Contents:

-----------

AudioBasis: (MPEG7) Audio Basis (AB) and several distance measures (using GMM and Itakura Saito).

Auditory: ERB auditory filter bank

Ceps: Cepstral envelope and pitch analysis

Chroma: Chroma representation of spectral magnitudes

ConstQ: Constant-Q filter Bank

FactorOracle: Factor Oracle for Midi improvisation

ISDist: Itakura-Saito Distance between AR spectra

InstFreq: Instantaneous Frequency using MillerÕs hop-one method

InfoRate: Information Rate analysis, including signal recurrence analysis by spectral clustering

LPC: Linear Prediction envelope and residual, including simple Talkbox

LSEE: Griffin and LimÕs least square error estimation from spectral magnitudes (phaseless reconstuction)

MFCC: Mel-Frequency Cepstral Coefficients

MIDI Tools: Various Midi representations needed for alignment and improvisation

NoteSigSim: Note-Signal similarity using harmonic projection

Onset: Various methods for onset detection

SFM: Spectral Flatness Measure, including the non-Gaussian version

STFT: Short time Fourier transform, including phase-vocoder time stretching and pitch shifting

SeqAlign: Different sequence alignment and dynamic time warping methods

Utils: some utilities

Welch: Welch spectral estimate

YASA: Sinusoidal and Noise analysis and resynthesis based on Linear Prediction and MVDR

 

Thanks to: Benjamin Blankertz, Mike Brookes, Michael Casey, Dan Ellis, Malcom Slaney

 

Other programs needed to run different parts of this toolbox:

Eerola, T. & Toiviainen, P. (2004). MIDI Toolbox

Dan EllisÕs Mp3read for Matlab (with my small modification)

 

License:

----------

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU General Public License for more details. You can obtain a copy of the GNU General Public License from ftp://prep.ai.mit.edu/pub/gnu/COPYING-2.0 or by writing to Free Software Foundation, Inc.,675 Mass Ave, Cambridge, MA 02139, USA.