Research Output
ASPIRE - Real noisy audio-visual speech enhancement corpus
  ASPIRE is a a first of its kind, audiovisual speech corpus recorded in real noisy environment (such as cafe, restaurants) which can be used to support reliable evaluation of multi-modal Speech Filtering technologies. This dataset follows the same sentence format as the audio-visual Grid corpus. The recorded audiovisual speech corpus can be used for reliable evaluation of next generation multi-modal Speech Filtering technologies.

  • Date:

    01 November 2020

  • Publication Status:


  • DOI:


  • Funders:

    Engineering and Physical Sciences Research Council


Gogate, M., Dashtipour, K., Adeel, A., & Hussain, A. (2020). ASPIRE - Real noisy audio-visual speech enhancement corpus. [Dataset].



speech enhancement, speech separation, audio-visual, deep learning

Monthly Views:

Available Documents