Pubblicazione | SABATO MARCO SINISCALCHI | Università degli Studi di Palermo

Federated learning for privacy-preserving speech recognition

Authors: Yang C.-H.H.; Siniscalchi S.M.
Publication year: 2024
Type: Capitolo o Saggio
OA Link: http://hdl.handle.net/10447/637516

Abstract

Speech signal contains rich information encompassing gender, accent, speaking environment, and other speaker characteristics. Meanwhile, deploying high-performance speech applications often requires a large amount of training speech data, which are often collected from end-users. Therefore, protecting data privacy becomes a rising concern when speech data are employed to deploy commercial speech applications. That motivates the rising interest in designing “federated learning” for voice assistants and mobile applications. This chapter will introduce recent advances in federated learning foundation algorithms and applications for speech recognition, and general acoustic processing. Furthermore, it will introduce how federated learning-based speech processing techniques (e.g., average gradient and teacher–student learning) would connect to some critical data protection guidelines and public regulations, such as European Union's General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA).