abstract |
A system performs local feature extraction. The system includes a processing device that performs a Short Time Fourier Transform to obtain a spectrogram for a discrete-time speech signal sample. The spectrogram is subdivided based on natural divisions of frequency to humans. Time-frequency-energy is then quantized using information obtained from the spectrogram. And, feature vectors are determined based on the quantized time-frequency-energy information. |