Skip to main content
Passa alla visualizzazione normale.

SABATO MARCO SINISCALCHI

Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine

  • Authors: Hussain, Tassadaq; Tsao, Yu; Wang, Hsin-Min; Wang, Jia-Ching; Siniscalchi, Sabato Marco; Liao, Wen-Hung
  • Publication year: 2019
  • Type: Contributo in atti di convegno pubblicato in volume
  • OA Link: http://hdl.handle.net/10447/636655

Abstract

Recently, the hierarchical extreme learning machine (HELM) model has been utilized for speech enhancement (SE) and demonstrated promising performance, especially when the amount of training data is limited and the system does not support heavy computations. Based on the success of audio-onlybased systems, termed AHELM, we propose a novel audio-visual HELM-based SE system, termed AVHELM that integrates the audio and visual information to confrontate the unseen nonstationery noise problem at low SNR levels to attain improved SE performance. The experimental results demonstrate that AVHELM can yield satisfactory enhancement performance with a limited amount of training data and outperforms AHELM in terms of three standardized objective measures under matched and mismatched testing conditions, confirming the effectiveness of incorporating visual information into the HELM-based SE system.