Calculating MFCC Parameters (Mel-Frequency Cepstral Coefficients)
- Login to Download
- 1 Credits
Resource Overview
Detailed Documentation
In the MATLAB environment, various signal processing techniques can be employed to compute different parameters from audio files. Commonly used techniques include calculating MFCC parameters (Mel-Frequency Cepstral Coefficients), LPCC parameters (Linear Predictive Cepstral Coefficients), and pitch (fundamental frequency). These techniques help us better understand audio signal characteristics and provide valuable information for subsequent audio processing and analysis tasks.
For MFCC computation, MATLAB implementations typically involve: pre-emphasis to enhance high frequencies, framing the signal into short overlapping segments, applying Hamming windowing to reduce spectral leakage, computing FFT to obtain magnitude spectrum, mapping to Mel-scale using triangular filter banks, taking logarithm of filter bank energies, and finally applying Discrete Cosine Transform (DCT) to decorrelate the coefficients. Key functions often used include melSpectrogram, mfcc, and custom filter bank implementations.
LPCC parameters are derived from Linear Predictive Coding (LPC) analysis, where the audio signal is modeled as a linear combination of past samples. The implementation involves calculating LPC coefficients using methods like Levinson-Durbin recursion, then converting them to cepstral coefficients through recursive relations. MATLAB's lpc function is commonly used for this purpose.
Pitch detection algorithms in MATLAB may employ autocorrelation methods, cepstrum analysis, or spectral peak tracking to identify the fundamental frequency. Functions like pitch from the Audio Toolbox or custom implementations using xcorr for autocorrelation are frequently utilized.
- Login to Download
- 1 Credits