Please use this identifier to cite or link to this item:
http://theses.ncl.ac.uk/jspui/handle/10443/6529
Title: | Characterising the suitability and limitations of metagenomic tools for the detection and discovery of plant viruses |
Authors: | Prusokiene, Alisa |
Issue Date: | 2024 |
Publisher: | Newcastle University |
Abstract: | Detecting the presence of viral genomes within a plant tissue sample is a vital task, with implications for food security, ecological networks, and biotechnological discovery. Having confidence that a negative is a true negative is necessary, but also difficult - could a novel virus or viroid pass undetected? In the last decade, metagenomic approaches have become widely used for this task, with many diverse methodologies being proposed and becoming implemented. Each is known to have advantages and disadvantages, but the exact boundaries of their limits of detection, and the factors that influence them, have only been studied within restricted subsets of tools. Successful metagenomic detection of viruses and viroids face three main barriers - the divergence of viral genomes, the presence of low-titre viral genomes, and sparsity of taxonomy within our viral reference databases. In this thesis we develop a benchmarking methodology that allows us to compare a diverse set of approaches in metagenomic viral detection and discovery, and determine the factors that influence their limitations. For this end, we inititally create a novel program for the determination of pairwise substitution distance between short, highly divergent genomes. This program, Mottle, is able to successfully quantify substitution distances further than current alternatives. We then use this as a metric, along with other controlled parameters, to determine which software are able to overcome which barriers. We find that there is a trade-off between performance at high divergence and low read depth, with reference sparsity acting as a modulator. Crucially, no approach showed success at detection when all barriers were high. Finally, we apply these tools to previously seen and novel metagenomic datasets, to compare their outputs, and to synthesise conclusions informed by multiple approaches. |
Description: | PhD Thesis |
URI: | http://hdl.handle.net/10443/6529 |
Appears in Collections: | School of Natural and Environmental Sciences |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Prusokiene A 2024.pdf | Thesis | 5.53 MB | Adobe PDF | View/Open |
dspacelicence.pdf | Licence | 43.82 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.