<?xml version="1.0"?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-05-25T07:10:00Z</responseDate><request verb="GetRecord" metadataPrefix="oai_dc">https://keep.lib.asu.edu/oai/request</request><GetRecord><record><header><identifier>oai:keep.lib.asu.edu:node-158849</identifier><datestamp>2024-12-23T18:01:48Z</datestamp><setSpec>oai_pmh:all</setSpec><setSpec>oai_pmh:repo_items</setSpec></header><metadata><oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><dc:identifier>158849</dc:identifier>
          <dc:identifier>https://hdl.handle.net/2286/R.I.63039</dc:identifier>
                  <dc:rights>http://rightsstatements.org/vocab/InC/1.0/</dc:rights>
                  <dc:date>2020</dc:date>
                  <dc:format>270 pages</dc:format>
                  <dc:type>Doctoral Dissertation</dc:type>
          <dc:type>Academic theses</dc:type>
          <dc:type>Text</dc:type>
                  <dc:language>eng</dc:language>
                  <dc:contributor>Orr, Adam James</dc:contributor>
          <dc:contributor>Cartwright, Reed</dc:contributor>
          <dc:contributor>Wilson, Melissa</dc:contributor>
          <dc:contributor>Kusumi, Kenro</dc:contributor>
          <dc:contributor>Taylor, Jesse</dc:contributor>
          <dc:contributor>Pfeifer, Susanne</dc:contributor>
          <dc:contributor>Arizona State University</dc:contributor>
                  <dc:description>Doctoral Dissertation Molecular and Cellular Biology 2020</dc:description>
          <dc:description>Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.&lt;br/&gt;This can make mutation detection difficult; and while increasing sequencing depth&lt;br/&gt;can often help, sequence-specific errors and other non-random biases cannot be de-&lt;br/&gt;tected by increased depth. The problem of accurate genotyping is exacerbated when&lt;br/&gt;there is not a reference genome or other auxiliary information available.&lt;br/&gt;I explore several methods for sensitively detecting mutations in non-model or-&lt;br/&gt;ganisms using an example Eucalyptus melliodora individual. I use the structure of&lt;br/&gt;the tree to find bounds on its somatic mutation rate and evaluate several algorithms&lt;br/&gt;for variant calling. I find that conventional methods are suitable if the genome of a&lt;br/&gt;close relative can be adapted to the study organism. However, with structured data,&lt;br/&gt;a likelihood framework that is aware of this structure is more accurate. I use the&lt;br/&gt;techniques developed here to evaluate a reference-free variant calling algorithm.&lt;br/&gt;I also use this data to evaluate a k-mer based base quality score recalibrator&lt;br/&gt;(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing&lt;br/&gt;data. Base quality scores can help detect errors in sequencing reads, but are often&lt;br/&gt;inaccurate. The most popular method for correcting this issue requires a known&lt;br/&gt;set of variant sites, which is unavailable in most cases. I simulate data and show&lt;br/&gt;that errors in this set of variant sites can cause calibration errors. I then show that&lt;br/&gt;KBBQ accurately recalibrates base quality scores while requiring no reference or other&lt;br/&gt;information and performs as well as other methods.&lt;br/&gt;Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-&lt;br/&gt;tion on the quality of output variant calls and show that improved base quality score&lt;br/&gt;calibration increases the sensitivity and reduces the false positive rate of a variant&lt;br/&gt;calling algorithm.</dc:description>
                  <dc:subject>Bioinformatics</dc:subject>
          <dc:subject>Computer Science</dc:subject>
          <dc:subject>Biology</dc:subject>
          <dc:subject>DNA Sequencing</dc:subject>
          <dc:subject>Mutation</dc:subject>
          <dc:subject>Quality Scores</dc:subject>
          <dc:subject>Sequencing Error</dc:subject>
          <dc:subject>Variant Calling</dc:subject>
                  <dc:title>Methods for Detecting Mutations in Non-model Organisms</dc:title></oai_dc:dc></metadata></record></GetRecord></OAI-PMH>
