Advanced deep neural networks for speech separation and enhancement

Xian, Yang

Please use this identifier to cite or link to this item: http://theses.ncl.ac.uk/jspui/handle/10443/5449

Full metadata record

DC Field	Value	Language
dc.contributor.author	Xian, Yang	-
dc.date.accessioned	2022-06-15T10:44:35Z	-
dc.date.available	2022-06-15T10:44:35Z	-
dc.date.issued	2021	-
dc.identifier.uri	http://hdl.handle.net/10443/5449	-
dc.description	Ph. D. Thesis.	en_US
dc.description.abstract	Monaural speech separation and enhancement aim to remove noise interference from the noisy speech mixture recorded by a single microphone, which causes a lack of spatial information. Deep neural network (DNN) dominates speech separation and enhancement. However, there are still challenges in DNN-based methods, including choosing proper training targets and network structures, refining generalization ability and model capacity for unseen speakers and noises, and mitigating the reverberations in room environments. This thesis focuses on improving separation and enhancement performance in the real-world environment. The first contribution in this thesis is to address monaural speech separation and enhancement within reverberant room environment by designing new training targets and advanced network structures. The second contribution to this thesis is on improving the enhancement performance by proposing a multi-scale feature recalibration convolutional bidirectional gate recurrent unit (GRU) network (MCGN). The third contribution is to improve the model capacity of the network and retain the robustness in the enhancement performance. A convolutional fusion network (CFN) is proposed, which exploits the group convolutional fusion unit (GCFU). The proposed speech enhancement methods are evaluated with various challenging datasets. The proposed methods are assessed with the stateof-the-art techniques and performance measures to confirm that this thesis contributes novel solutions	en_US
dc.language.iso	en	en_US
dc.publisher	Newcastle University	en_US
dc.title	Advanced deep neural networks for speech separation and enhancement	en_US
dc.type	Thesis	en_US
Appears in Collections:	School of Engineering

Files in This Item:

File	Description	Size	Format
Xian Yang e-copy.pdf	Thesis	8.92 MB	Adobe PDF	View/Open
dspacelicence.pdf	Licence	43.82 kB	Adobe PDF	View/Open

Show simple item record