Publications

Inference and Denoise: Causal Inference-based Neural Speech Enhancement

Published in MLSP, 2023

This study introduces a causal inference-based speech enhancement (CISE) framework that models noise presence as an intervention, using a noise detector and mask-based enhancement modules to perform noise-conditional speech enhancement, demonstrating improved performance and efficiency compared to non-causal and more complex SE models.

Download here

Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing

Published in APSIPA ASC, 2020

TIn this study, we apply a modified Transformer architecture to speech enhancement by replacing positional encoding with convolutional layers and fine-tuning the model using a MetricGAN framework to boost perceptual quality (PESQ) scores, achieving significant improvements over the baseline in both subjective and objective evaluations on the DNS challenge datasets.

Download here

WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement

Published in IEEE Signal Processing Letters, 2020

In this letter, we propose an efficient end-to-end speech enhancement model, WaveCRN, which combines a CNN module for capturing speech locality features with a stacked SRU module for modeling sequential properties, using a novel restricted feature masking approach to achieve state-of-the-art performance with reduced complexity and faster inference.

Download here