Novel Two-Stage Audiovisual Speech Filtering in Noisy Environments

Research Output

In recent years, the established link between the various human communication production domains has become more widely utilised in the field of speech processing. In this work, we build on previous work by the authors and present a novel two-stage audiovisual speech enhancement system, making use of audio-only beamforming, automatic lip tracking, and pre-processing with visually derived Wiener speech filtering. Initial results have demonstrated that this two-stage multimodal speech enhancement approach can produce positive results with noisy speech mixtures that conventional audio-only beamforming would struggle to cope with, such as in very noisy environments with a very low signal to noise ratio, and when the type of noise is difficult for audio-only beamforming to process.

Type:

Article
Date:

20 October 2013
Publication Status:

Published
DOI:

10.1007/s12559-013-9231-2
ISSN:

1866-9956
Library of Congress:

QA75 Electronic computers. Computer science
Dewey Decimal Classification:

004 Data processing & computer science
Funders:

Historic Funder (pre-Worktribe)

http://researchrepository.napier.ac.uk/output/1793059 Abel, A., & Hussain, A. (2014). Novel Two-Stage Audiovisual Speech Filtering in Noisy Environments. Cognitive Computation, 6(2), 200-217. https://doi.org/10.1007/s12559-013-9231-2

Citation

Abel, A., & Hussain, A. (2014). Novel Two-Stage Audiovisual Speech Filtering in Noisy Environments. Cognitive Computation, 6(2), 200-217. https://doi.org/10.1007/s12559-013-9231-2

Authors

Prof Amir Hussain

Professor
School of Computing Engineering and the Built Environment

0131 455 2239

A.Hussain@napier.ac.uk

Keywords

Speech enhancement; Multimodal speech filtering; Audiovisual speech processing

Monthly Views:

Available Documents

Files currently unavailable for download , please contact A.Hussain@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Type:

Date:

Publication Status:

DOI:

ISSN:

Library of Congress:

Dewey Decimal Classification:

Funders:

Citation

Authors

Prof Amir Hussain

Keywords

Monthly Views:

Files currently unavailable for download , please contact A.Hussain@napier.ac.uk to request a copy

Downloadable citations