Please use this identifier to cite or link to this item:
http://theses.ncl.ac.uk/jspui/handle/10443/5449
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Xian, Yang | - |
dc.date.accessioned | 2022-06-15T10:44:35Z | - |
dc.date.available | 2022-06-15T10:44:35Z | - |
dc.date.issued | 2021 | - |
dc.identifier.uri | http://hdl.handle.net/10443/5449 | - |
dc.description | Ph. D. Thesis. | en_US |
dc.description.abstract | Monaural speech separation and enhancement aim to remove noise interference from the noisy speech mixture recorded by a single microphone, which causes a lack of spatial information. Deep neural network (DNN) dominates speech separation and enhancement. However, there are still challenges in DNN-based methods, including choosing proper training targets and network structures, refining generalization ability and model capacity for unseen speakers and noises, and mitigating the reverberations in room environments. This thesis focuses on improving separation and enhancement performance in the real-world environment. The first contribution in this thesis is to address monaural speech separation and enhancement within reverberant room environment by designing new training targets and advanced network structures. The second contribution to this thesis is on improving the enhancement performance by proposing a multi-scale feature recalibration convolutional bidirectional gate recurrent unit (GRU) network (MCGN). The third contribution is to improve the model capacity of the network and retain the robustness in the enhancement performance. A convolutional fusion network (CFN) is proposed, which exploits the group convolutional fusion unit (GCFU). The proposed speech enhancement methods are evaluated with various challenging datasets. The proposed methods are assessed with the stateof-the-art techniques and performance measures to confirm that this thesis contributes novel solutions | en_US |
dc.language.iso | en | en_US |
dc.publisher | Newcastle University | en_US |
dc.title | Advanced deep neural networks for speech separation and enhancement | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | School of Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Xian Yang e-copy.pdf | Thesis | 8.92 MB | Adobe PDF | View/Open |
dspacelicence.pdf | Licence | 43.82 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.