2024 |
How word semantics and phonology affect handwriting of Alzheimer’s patients: A machine learning based analysis |
Articolo in rivista |
Vai |
2024 |
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction |
Contributo in atti di convegno pubblicato in volume |
Vai |
2024 |
Federated learning for privacy-preserving speech recognition |
Capitolo o Saggio |
Vai |
2024 |
Boosting End-to-End Multilingual Phoneme Recognition Through Exploiting Universal Speech Attributes Constraints |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
Description and analysis of the KPT system for NIST Language Recognition Evaluation 2022 |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
Inference and Denoise: Causal Inference-Based Neural Speech Enhancement |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
Differentially Private Adapters for Parameter Efficient Acoustic Modeling |
Contributo in atti di convegno pubblicato in volume |
Vai |
2023 |
A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity |
Articolo in rivista |
Vai |
2023 |
Cumulative Sum Analysis of Learning Curve Process for Vaginal Natural Orifice Transluminal Endoscopic Surgery Hysterectomy |
Articolo in rivista |
Vai |
2022 |
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition |
Contributo in atti di convegno pubblicato in volume |
Vai |
2022 |
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis |
Contributo in atti di convegno pubblicato in volume |
Vai |
2022 |
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer |
Contributo in atti di convegno pubblicato in volume |
Vai |
2022 |
AN EXPERIMENTAL STUDY ON PRIVATE AGGREGATION OF TEACHER ENSEMBLE LEARNING FOR END-TO-END SPEECH RECOGNITION |
Contributo in atti di convegno pubblicato in volume |
Vai |
2022 |
The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results |
Contributo in atti di convegno pubblicato in volume |
Vai |
2022 |
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models |
Articolo in rivista |
Vai |
2022 |
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis |
Contributo in atti di convegno pubblicato in volume |
Vai |
2022 |
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification |
Contributo in atti di convegno pubblicato in volume |
Vai |
2022 |
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation |
Contributo in atti di convegno pubblicato in volume |
Vai |
2021 |
Vector-to-Vector Regression via Distributional Loss for Speech Enhancement |
Articolo in rivista |
Vai |
2021 |
Bone-Conducted Speech Enhancement Using Hierarchical Extreme Learning Machine |
Capitolo o Saggio |
Vai |
2021 |
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification |
Contributo in atti di convegno pubblicato in volume |
Vai |
2021 |
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition |
Contributo in atti di convegno pubblicato in volume |
Vai |
2021 |
A Two-Stage Approach to Device-Robust Acoustic Scene Classification |
Contributo in atti di convegno pubblicato in volume |
Vai |
2021 |
Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation |
Contributo in atti di convegno pubblicato in volume |
Vai |
2021 |
A Two-Stage Deep Modeling Approach to Articulatory Inversion |
Contributo in atti di convegno pubblicato in volume |
Vai |
2021 |
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
Maximal Figure-of-Merit Framework to Detect Multi-label Phonetic Features for Spoken Language Recognition |
Articolo in rivista |
Vai |
2020 |
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression |
Articolo in rivista |
Vai |
2020 |
Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation |
Articolo in rivista |
Vai |
2020 |
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
Transfer Learning of Articulatory Information Through Phone Information |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
Performance Analysis for Tensor-Train Decomposition to Deep Neural Network Based Vector-to-Vector Regression |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
A Cross-Task Transfer Learning Approach to Adapting Deep Speech Enhancement Models to Unseen Background Noise Using Paired Senone Classifiers |
Contributo in atti di convegno pubblicato in volume |
Vai |
2020 |
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression |
Articolo in rivista |
Vai |
2019 |
Improving Audio-visual Speech Recognition Performance with Cross-modal Student-teacher Training |
Contributo in atti di convegno pubblicato in volume |
Vai |
2019 |
Compressed multimodal hierarchical extreme learning machine for speech enhancement |
Contributo in atti di convegno pubblicato in volume |
Vai |
2019 |
A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement |
Articolo in rivista |
Vai |
2019 |
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models |
Articolo in rivista |
Vai |
2019 |
A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion |
Contributo in atti di convegno pubblicato in volume |
Vai |
2019 |
Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine |
Contributo in atti di convegno pubblicato in volume |
Vai |
2017 |
An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition |
Articolo in rivista |
Vai |
2012 |
Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition |
Contributo in atti di convegno pubblicato in volume |
Vai |
2006 |
A study on lattice rescoring with knowledge scores for automatic speech recognition |
Contributo in atti di convegno pubblicato in volume |
Vai |
2006 |
Application of EalphaNets to Feature Recognition of Articulation Manner in Knowledge-Based Automatic Speech Recognition |
Capitolo o Saggio |
Vai |
2006 |
A Study of Perceptron Mapping Capability to Design Speech Event Detectors |
eedings |
Vai |
2006 |
Embedded Knowledge-based Speech Detectors for Real-Time Recognition Tasks |
eedings |
Vai |
2005 |
Neural Classification of HEP Experimental Data |
eedings |
Vai |
2005 |
Application of Enets to Feature Recognition of Articulation Manner in Knowledge-based Automatic Speech Recognition |
Capitolo o Saggio |
Vai |
2005 |
Efficient FPGA Implementation of a Knowledge-based Automatic Speech Classifier |
Capitolo o Saggio |
Vai |
2004 |
Efficient Rapid Prototyping of Image and Video Processing Algorithms |
eedings |
Vai |