The browser you are using is not supported by this website. All versions of Internet Explorer are no longer supported, either by us or Microsoft (read more here: https://www.microsoft.com/en-us/microsoft-365/windows/end-of-ie-support).

Please use a modern browser to fully experience our website, such as the newest versions of Edge, Chrome, Firefox or Safari etc.

What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering

Author

Summary, in English

Usually the mel-frequency cepstral coefficients (MFCCs) are derived via Hamming windowed DFT spectrum. In this paper, we advocate to use a so-called multitaper method instead. Multitaper methods form a spectrum estimate using multiple window functions and frequency-domain averaging. Multitapers provide a robust spectrum estimate but have not received much attention in speech processing. Our speaker recognition experiment on NIST 2002 yields equal error rates (EERs) of 9.66 % (clean data) and 16.41 % (-10 dB SNR) for the conventional Hamming method and 8.13 % (clean data) and 14.63 % (-10 dB SNR) using multitapers. Multitapering is a simple and robust alternative to the Hamming window method.

Department/s

Publishing year

2010

Language

English

Pages

2734-2737

Publication/Series

InterSpecch 2010

Document type

Conference paper

Topic

  • Probability Theory and Statistics

Keywords

  • speaker verification
  • multiple window method

Conference name

Interspeech 2010

Conference date

0001-01-02

Conference place

Makuhari, Japan

Status

Published

Research group

  • Statistical Signal Processing Group