Canonical correlation analysis in speech enhancement
1st ed.
- Author
- Additional Author(s)
-
- Publisher
- Cham, Switzerland : Springer International Publishing, 2018
- Language
- English
- ISBN
- 9783319670201
- Series
- SpringerBriefs in electrical and computer engineering
- Subject(s)
-
- ENGINEERING
- SPEECH PROCESSING SYSTEMS
- Notes
-
. .
- Abstract
- This book focuses on the application of canonical correlation analysis (CCA) to speech enhancement using the filtering approach. The authors explain how to derive different classes of time-domain and time-frequency-domain noise reduction filters, which are optimal from the CCA perspective for both single-channel and multichannel speech enhancement. Enhancement of noisy speech has been a challenging problem for many researchers over the past few decades and remains an active research area. Typically, speech enhancement algorithms operate in the short-time Fourier transform (STFT) domain, where the clean speech spectral coefficients are estimated using a multiplicative gain function. A filtering approach, which can be performed in the time domain or in the subband domain, obtains an estimate of the clean speech sample at every time instant or time-frequency bin by applying a filtering vector to the noisy speech vector. Compared to the multiplicative gain approach, the filtering approach more naturally takes into account the correlation of the speech signal in adjacent time frames. In this study, the authors pursue the filtering approach and show how to apply CCA to the speech enhancement problem. They also address the problem of adaptive beamforming from the CCA perspective, and show that the well-known Wiener and minimum variance distortionless response (MVDR) beamformers are particular cases of a general class of CCA-based adaptive beamformers.
Physical Dimension
- Number of Page(s)
- 1 online resource (ix, 121 p.)
- Dimension
- -
- Other Desc.
- -
Summary / Review / Table of Content
Introduction --
Canonical Correlation Analysis --
Single-Channel Speech Enhancement in the Time Domain --
Single-Channel Speech Enhancement in the STFT Domain --
Multichannel Speech Enhancement in the Time Domain --
Multichannel Speech Enhancement in the Time Domain --
Adaptive Beamforming.
Exemplar(s)
# |
Accession No. |
Call Number |
Location |
Status |
1. | 01555/20 | 006.454 Ben | Online ! | Available |