Nonlinear adaptive speech enhancement inspired by early auditory processing

Research Output

This paper presents non-linear adaptive speech enhancement schemes inspired by features of early auditory processing. A generic multi-microphone sub-band adaptive (MMSBA) framework is described which allows for the manipulation of several factors that may influence the intelligibility and perceived quality of the processed speech. The proposed framework supports inclusion of: non-linear distribution of sub-bands (as in humans), cross-band effects such as lateral inhibition, and robust adaptive metrics for selecting an appropriate coherent or incoherent noise canceller for each sub-band, based on identified features of the band-limited signals from multiple-sensors during silence periods. An efficient higher order statistics (HOS) based speech/non-speech detector is proposed for enabling effective adaptive control of MMSBA filtering against the environment. New hybrid extensions of the MMSBA scheme incorporating neural networks and post-Weiner filtering are also described and their comparative performance assessed in real reverberant environments. Finally, some future research directions for MMSBA based speech enhancement are proposed including possible alternative strategies based on stochastic resonance.

Date:

31 December 2005
Publication Status:

Published
DOI:

10.1007/11520153_13
Funders:

Historic Funder (pre-Worktribe)

http://researchrepository.napier.ac.uk/output/1793684 <p>Hussain, A., Durrani, T. S., Alkulaibi, A., & Mtetwa, N. (2005). Nonlinear adaptive speech enhancement inspired by early auditory processing. In <i>Nonlinear Speech Modeling and Applications: Advanced Lectures and Revised Selected Papers</i>, (291-316). https://doi.org/10.1007/11520153_13</p>

Citation

Hussain, A., Durrani, T. S., Alkulaibi, A., & Mtetwa, N. (2005). Nonlinear adaptive speech enhancement inspired by early auditory processing. In Nonlinear Speech Modeling and Applications: Advanced Lectures and Revised Selected Papers, (291-316). https://doi.org/10.1007/11520153_13

Authors

Prof Amir Hussain

Professor
School of Computing Engineering and the Built Environment

0131 455 2239

A.Hussain@napier.ac.uk

Keywords

Speech Signal; Finite Impulse Response; Stochastic Resonance; Interaural Time Difference; Speech Enhancement

Monthly Views:

Available Documents

Files currently unavailable for download , please contact A.Hussain@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Date:

Publication Status:

DOI:

Funders:

Citation

Authors

Prof Amir Hussain

Keywords

Monthly Views:

Files currently unavailable for download , please contact A.Hussain@napier.ac.uk to request a copy

Downloadable citations