ASPIRE - Real noisy audio-visual speech enhancement corpus
Dataset
Gogate, M., Dashtipour, K., Adeel, A., & Hussain, A. (2020)
ASPIRE - Real noisy audio-visual speech enhancement corpus. [Dataset]. https://doi.org/10.5281/zenodo.4585619
ASPIRE is a a first of its kind, audiovisual speech corpus recorded in real noisy environment (such as cafe, restaurants) which can be used to support reliable evaluation of m...
Offline Arabic Handwriting Recognition Using Deep Machine Learning: A Review of Recent Advances
Conference Proceeding
Ahmed, R., Dashtipour, K., Gogate, M., Raza, A., Zhang, R., Huang, K., …Hussain, A. (2020)
Offline Arabic Handwriting Recognition Using Deep Machine Learning: A Review of Recent Advances. In Advances in Brain Inspired Cognitive Systems: 10th International Conference, BICS 2019, Guangzhou, China, July 13–14, 2019, Proceedings (457-468). https://doi.org/10.1007/978-3-030-39431-8_44
In pattern recognition, automatic handwriting recognition (AHWR) is an area of research that has developed rapidly in the last few years. It can play a significant role in bro...
Detecting Alzheimer’s Disease Using Machine Learning Methods
Conference Proceeding
Dashtipour, K., Taylor, W., Ansari, S., Zahid, A., Gogate, M., Ahmad, J., …Abbasi, Q. (2022)
Detecting Alzheimer’s Disease Using Machine Learning Methods. In Body Area Networks. Smart IoT and Big Data for Intelligent Health Management 16th EAI International Conference, BODYNETS 2021, Virtual Event, October 25-26, 2021, Proceedings. https://doi.org/10.1007/978-3-030-95593-9_8
As the world is experiencing population growth, the portion of the older people, aged 65 and above, is also growing at a faster rate. As a result, the dementia with Alzheimer’...
Towards Individualised Speech Enhancement: An SNR Preference Learning System for Multi-Modal Hearing Aids
Conference Proceeding
Kirton-Wingate, J., Ahmed, S., Gogate, M., Tsao, Y., & Hussain, A. (2023)
Towards Individualised Speech Enhancement: An SNR Preference Learning System for Multi-Modal Hearing Aids. In K. Dashtipour (Ed.), Proceedings of the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). https://doi.org/10.1109/icasspw59220.2023.10193122
Since the advent of deep learning (DL), speech enhancement (SE) models have performed well under a variety of noise conditions. However, such systems may still introduce sonic...
Deep Neural Network Driven Binaural Audio Visual Speech Separation
Conference Proceeding
Gogate, M., Dashtipour, K., Bell, P., & Hussain, A. (2020)
Deep Neural Network Driven Binaural Audio Visual Speech Separation. In 2020 International Joint Conference on Neural Networks (IJCNN). https://doi.org/10.1109/ijcnn48605.2020.9207517
The central auditory pathway exploits the auditory signals and visual information sent by both ears and eyes to segregate speech from multiple competing noise sources and help...
A Survey on the Role of Wireless Sensor Networks and IoT in Disaster Management
Book Chapter
Adeel, A., Gogate, M., Farooq, S., Ieracitano, C., Dashtipour, K., Larijani, H., & Hussain, A. (2019)
A Survey on the Role of Wireless Sensor Networks and IoT in Disaster Management. In T. S. Durrani, W. Wang, & S. M. Forbes (Eds.), Geological Disaster Monitoring Based on Sensor Networks (57-66). Singapore: Springer. https://doi.org/10.1007/978-981-13-0992-2_5
Extreme events and disasters resulting from climate change or other ecological factors are difficult to predict and manage. Current limitations of state-of-the-art approaches ...
Visual Speech In Real Noisy Environments (VISION): A Novel Benchmark Dataset and Deep Learning-Based Baseline System
Conference Proceeding
Gogate, M., Dashtipour, K., & Hussain, A. (2020)
Visual Speech In Real Noisy Environments (VISION): A Novel Benchmark Dataset and Deep Learning-Based Baseline System. In Proc. Interspeech 2020 (4521-4525). https://doi.org/10.21437/interspeech.2020-2935
In this paper, we present VIsual Speech In real nOisy eNvironments (VISION), a first of its kind audio-visual (AV) corpus comprising 2500 utterances from 209 speakers, recorde...
Comparing the Performance of Different Classifiers for Posture Detection
Conference Proceeding
Suresh Kumar, S., Dashtipour, K., Gogate, M., Ahmad, J., Assaleh, K., Arshad, K., …Ahmad, W. (2022)
Comparing the Performance of Different Classifiers for Posture Detection. In Body Area Networks. Smart IoT and Big Data for Intelligent Health Management. BODYNETS 2021 (210-218). https://doi.org/10.1007/978-3-030-95593-9_17
Human Posture Classification (HPC) is used in many fields such as human computer interfacing, security surveillance, rehabilitation, remote monitoring, and so on. This paper c...
Towards real-time privacy-preserving audio-visual speech enhancement
Presentation / Conference
Gogate, M., Dashtipour, K., & Hussain, A. (2022, September)
Towards real-time privacy-preserving audio-visual speech enhancement. Paper presented at 2nd Symposium on Security and Privacy in Speech Communication, Incheon, Korea
Human auditory cortex in everyday noisy situations is known to exploit aural and visual cues that are contextually combined by the brain’s multi-level integration strategies t...
CochleaNet: A robust language-independent audio-visual model for real-time speech enhancement
Journal Article
Gogate, M., Dashtipour, K., Adeel, A., & Hussain, A. (2020)
CochleaNet: A robust language-independent audio-visual model for real-time speech enhancement. Information Fusion, 63, 273-285. https://doi.org/10.1016/j.inffus.2020.04.001
Noisy situations cause huge problems for the hearing-impaired, as hearing aids often make speech more audible but do not always restore intelligibility. In noisy settings, hum...