Matching Items (64)
158849-Thumbnail Image.png
Description
Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The

Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.
ContributorsOrr, Adam James (Author) / Cartwright, Reed (Thesis advisor) / Wilson, Melissa (Committee member) / Kusumi, Kenro (Committee member) / Taylor, Jesse (Committee member) / Pfeifer, Susanne (Committee member) / Arizona State University (Publisher)
Created2020
161497-Thumbnail Image.png
Description
The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males

The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males and females. The goal of the study was to identify germline variation that differs by sex in hepatocellular carcinoma. Using the program, multiple pathways and genes were identified to have significant differences in their relationship to liver cancer in males and females. In animal studies, the genes which were identified using the PoDA analysis have been shown to impact liver cancer, often with different results for males and females. While these genes are often the focus in animal models, they are absent from current Genome Wide Association Studies (GWAS) catalogs for humans. By working to bridge the results of animal studies and human studies, the results help to identify the causes of liver cancer, and more specifically, the reason the disease affects males at much higher rates. The differences in pathways identified to be significant for the two sexes indicate the germline variance may play sex-specific roles in the development of hepatocellular carcinoma. Additionally, these results reinforce the capacity of the PoDA analysis to identify genes that may be missed by more traditional GWAS methods. This study lays the groundwork for further investigations into the identified genes and pathways, and how they behave differently within males and females.
ContributorsOlson, Erik Jon (Author) / Buetow, Kenneth (Thesis advisor) / Wilson, Melissa (Committee member) / Cartwright, Reed (Committee member) / Arizona State University (Publisher)
Created2021
129567-Thumbnail Image.png
Description

Human protein diversity arises as a result of alternative splicing, single nucleotide polymorphisms (SNPs) and posttranslational modifications. Because of these processes, each protein can exists as multiple variants in vivo. Tailored strategies are needed to study these protein variants and understand their role in health and disease. In this work

Human protein diversity arises as a result of alternative splicing, single nucleotide polymorphisms (SNPs) and posttranslational modifications. Because of these processes, each protein can exists as multiple variants in vivo. Tailored strategies are needed to study these protein variants and understand their role in health and disease. In this work we utilized quantitative mass spectrometric immunoassays to determine the protein variants concentration of beta-2-microglobulin, cystatin C, retinol binding protein, and transthyretin, in a population of 500 healthy individuals. Additionally, we determined the longitudinal concentration changes for the protein variants from four individuals over a 6 month period. Along with the native forms of the four proteins, 13 posttranslationally modified variants and 7 SNP-derived variants were detected and their concentration determined. Correlations of the variants concentration with geographical origin, gender, and age of the individuals were also examined. This work represents an important step toward building a catalog of protein variants concentrations and examining their longitudinal changes.

ContributorsTrenchevska, Olgica (Author) / Phillips, David A. (Author) / Nelson, Randall (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2014-06-23
129370-Thumbnail Image.png
Description

Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements

Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species.

ContributorsSchrader, Lukas (Author) / Kim, Jay W. (Author) / Ence, Daniel (Author) / Zimin, Aleksey (Author) / Klein, Antonia (Author) / Wyschetzki, Katharina (Author) / Weichselgartner, Tobias (Author) / Kemena, Carsten (Author) / Stoekl, Johannes (Author) / Schultner, Eva (Author) / Wurm, Yannick (Author) / Smith, Christopher D. (Author) / Yandell, Mark (Author) / Heinze, Juergen (Author) / Gadau, Juergen (Author) / Oettler, Jan (Author) / College of Liberal Arts and Sciences (Contributor)
Created2014-12-01
129287-Thumbnail Image.png
Description

The phenomenon of Fano resonance is ubiquitous in a large variety of wave scattering systems, where the resonance profile is typically asymmetric. Whether the parameter characterizing the asymmetry should be complex or real is an issue of great experimental interest. Using coherent quantum transport as a paradigm and taking into

The phenomenon of Fano resonance is ubiquitous in a large variety of wave scattering systems, where the resonance profile is typically asymmetric. Whether the parameter characterizing the asymmetry should be complex or real is an issue of great experimental interest. Using coherent quantum transport as a paradigm and taking into account of the collective contribution from all available scattering channels, we derive a universal formula for the Fano-resonance profile. We show that our formula bridges naturally the traditional Fano formulas with complex and real asymmetry parameters, indicating that the two types of formulas are fundamentally equivalent (except for an offset). The connection also reveals a clear footprint for the conductance resonance during a dephasing process. Therefore, the emergence of complex asymmetric parameter when fitting with experimental data needs to be properly interpreted. Furthermore, we have provided a theory for the width of the resonance, which relates explicitly the width to the degree of localization of the close-by eigenstates and the corresponding coupling matrices or the self-energies caused by the leads. Our work not only resolves the issue about the nature of the asymmetry parameter, but also provides deeper physical insights into the origin of Fano resonance. Since the only assumption in our treatment is that the transport can be described by the Green’s function formalism, our results are also valid for broad disciplines including scattering problems of electromagnetic waves, acoustics, and seismology.

ContributorsHuang, Liang (Author) / Lai, Ying-Cheng (Author) / Luo, Hong-Gang (Author) / Grebogi, Celso (Author) / Ira A. Fulton Schools of Engineering (Contributor)
Created2015-01-01
129298-Thumbnail Image.png
Description

Persistent currents (PCs), one of the most intriguing manifestations of the Aharonov-Bohm (AB) effect, are known to vanish for Schrödinger particles in the presence of random scatterings, e.g., due to classical chaos. But would this still be the case for Dirac fermions? Addressing this question is of significant value due

Persistent currents (PCs), one of the most intriguing manifestations of the Aharonov-Bohm (AB) effect, are known to vanish for Schrödinger particles in the presence of random scatterings, e.g., due to classical chaos. But would this still be the case for Dirac fermions? Addressing this question is of significant value due to the tremendous recent interest in two-dimensional Dirac materials. We investigate relativistic quantum AB rings threaded by a magnetic flux and find that PCs are extremely robust. Even for highly asymmetric rings that host fully developed classical chaos, the amplitudes of PCs are of the same order of magnitude as those for integrable rings, henceforth the term superpersistent currents (SPCs). A striking finding is that the SPCs can be attributed to a robust type of relativistic quantum states, i.e., Dirac whispering gallery modes (WGMs) that carry large angular momenta and travel along the boundaries. We propose an experimental scheme using topological insulators to observe and characterize Dirac WGMs and SPCs, and speculate that these features can potentially be the base for a new class of relativistic qubit systems. Our discovery of WGMs in relativistic quantum systems is remarkable because, although WGMs are common in photonic systems, they are relatively rare in electronic systems.

ContributorsXu, Hongya (Author) / Huang, Liang (Author) / Lai, Ying-Cheng (Author) / Grebogi, Celso (Author) / Ira A. Fulton Schools of Engineering (Contributor)
Created2015-03-11
128687-Thumbnail Image.png
Description

Proteins can exist as multiple proteoforms in vivo, as a result of alternative splicing and single-nucleotide polymorphisms (SNPs), as well as posttranslational processing. To address their clinical significance in a context of diagnostic information, proteoforms require a more in-depth analysis. Mass spectrometric immunoassays (MSIA) have been devised for studying structural

Proteins can exist as multiple proteoforms in vivo, as a result of alternative splicing and single-nucleotide polymorphisms (SNPs), as well as posttranslational processing. To address their clinical significance in a context of diagnostic information, proteoforms require a more in-depth analysis. Mass spectrometric immunoassays (MSIA) have been devised for studying structural diversity in human proteins. MSIA enables protein profiling in a simple and high-throughput manner, by combining the selectivity of targeted immunoassays, with the specificity of mass spectrometric detection. MSIA has been used for qualitative and quantitative analysis of single and multiple proteoforms, distinguishing between normal fluctuations and changes related to clinical conditions. This mini review offers an overview of the development and application of mass spectrometric immunoassays for clinical and population proteomics studies. Provided are examples of some recent developments, and also discussed are the trends and challenges in mass spectrometry-based immunoassays for the next-phase of clinical applications.

ContributorsTrenchevska, Olgica (Author) / Nelson, Randall (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2016-03-17
128933-Thumbnail Image.png
Description

Introduction: Apolipoprotein C-III (apoC-III) regulates triglyceride (TG) metabolism. In plasma, apoC-III exists in non-sialylated (apoC-III0a without glycosylation and apoC-III[subscript 0b] with glycosylation), monosialylated (apoC-III1) or disialylated (apoC-III2) proteoforms. Our aim was to clarify the relationship between apoC-III sialylation proteoforms with fasting plasma TG concentrations.

Methods: In 204 non-diabetic adolescent participants, the

Introduction: Apolipoprotein C-III (apoC-III) regulates triglyceride (TG) metabolism. In plasma, apoC-III exists in non-sialylated (apoC-III0a without glycosylation and apoC-III[subscript 0b] with glycosylation), monosialylated (apoC-III1) or disialylated (apoC-III2) proteoforms. Our aim was to clarify the relationship between apoC-III sialylation proteoforms with fasting plasma TG concentrations.

Methods: In 204 non-diabetic adolescent participants, the relative abundance of apoC-III plasma proteoforms was measured using mass spectrometric immunoassay.

Results: Compared with the healthy weight subgroup (n = 16), the ratios of apoC-III0a, apoC-III0b, and apoC-III1 to apoC-III2 were significantly greater in overweight (n = 33) and obese participants (n = 155). These ratios were positively correlated with BMI z-scores and negatively correlated with measures of insulin sensitivity (S[subscript i]). The relationship of apoC-III1 / apoC-III2 with Si persisted after adjusting for BMI (p = 0.02). Fasting TG was correlated with the ratio of apoC-III0a / apoC-III2 (r = 0.47, p<0.001), apoC-III0b / apoC-III2 (r = 0.41, p<0.001), apoC-III1 / apoC-III2 (r = 0.43, p<0.001). By examining apoC-III concentrations, the association of apoC-III proteoforms with TG was driven by apoC-III0a (r = 0.57, p<0.001), apoC-III0b (r = 0.56. p<0.001) and apoC-III1 (r = 0.67, p<0.001), but not apoC-III2 (r = 0.006, p = 0.9) concentrations, indicating that apoC-III relationship with plasma TG differed in apoC-III2 compared with the other proteoforms.

Conclusion: We conclude that apoC-III0a, apoC-III0b, and apoC-III1, but not apoC-III2 appear to be under metabolic control and associate with fasting plasma TG. Measurement of apoC-III proteoforms can offer insights into the biology of TG metabolism in obesity.

ContributorsYassine, Hussein N. (Author) / Trenchevska, Olgica (Author) / Ramrakhiani, Ambika (Author) / Parekh, Aarushi (Author) / Koska, Juraj (Author) / Walker, Ryan W. (Author) / Billheimer, Dean (Author) / Reaven, Peter D. (Author) / Yen, Frances T. (Author) / Nelson, Randall (Author) / Goran, Michael I. (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2015-12-03
Description

We present a phylogeographic study of at least six reproductively isolated lineages of new world harvester ants within the Pogonomyrmex barbatus and P. rugosus species group. The genetic and geographic relationships within this clade are complex: Four of the identified lineages show genetic caste determination (GCD) and are divided into

We present a phylogeographic study of at least six reproductively isolated lineages of new world harvester ants within the Pogonomyrmex barbatus and P. rugosus species group. The genetic and geographic relationships within this clade are complex: Four of the identified lineages show genetic caste determination (GCD) and are divided into two pairs. Each pair has evolved under a mutualistic system that necessitates sympatry. These paired lineages are dependent upon one another because their GCD requires interlineage matings for the production of F1 hybrid workers, and intralineage matings are required to produce queens. This GCD system maintains genetic isolation among these interdependent lineages, while simultaneously requiring co-expansion and emigration as their distributions have changed over time. It has also been demonstrated that three of these four GCD lineages have undergone historical hybridization, but the narrower sampling range of previous studies has left questions on the hybrid parentage, breadth, and age of these groups. Thus, reconstructing the phylogenetic and geographic history of this group allows us to evaluate past insights and hypotheses and to plan future inquiries in a more complete historical biogeographic context. Using mitochondrial DNA sequences sampled across most of the morphospecies’ ranges in the U.S.A. and Mexico, we conducted a detailed phylogeographic study. Remarkably, our results indicate that one of the GCD lineage pairs has experienced a dramatic range expansion, despite the genetic load and fitness costs of the GCD system. Our analyses also reveal a complex pattern of vicariance and dispersal in Pogonomyrmex harvester ants that is largely concordant with models of late Miocene, Pliocene, and Pleistocene range shifts among various arid-adapted taxa in North America.

ContributorsMott, Brendon (Author) / Gadau, Juergen (Author) / Anderson, Kirk E. (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-07-01
129155-Thumbnail Image.png
Description

The impetus for discovery and evaluation of protein biomarkers has been accelerated by recent development of advanced technologies for rapid and broad proteome analyses. Mass spectrometry (MS)-based protein assays hold great potential for in vitro biomarker studies. Described here is the development of a multiplex mass spectrometric immunoassay (MSIA) for

The impetus for discovery and evaluation of protein biomarkers has been accelerated by recent development of advanced technologies for rapid and broad proteome analyses. Mass spectrometry (MS)-based protein assays hold great potential for in vitro biomarker studies. Described here is the development of a multiplex mass spectrometric immunoassay (MSIA) for quantification of apolipoprotein C-I (apoC-I), apolipoprotein C-II (apoC-II), apolipoprotein C-III (apoC-III) and their proteoforms. The multiplex MSIA assay was fast (∼40 min) and high-throughput (96 samples at a time). The assay was applied to a small cohort of human plasma samples, revealing the existence of multiple proteoforms for each apolipoprotein C. The quantitative aspect of the assay enabled determination of the concentration for each proteoform individually. Low-abundance proteoforms, such as fucosylated apoC-III, were detected in less than 20% of the samples. The distribution of apoC-III proteoforms varied among samples with similar total apoC-III concentrations. The multiplex analysis of the three apolipoproteins C and their proteoforms using quantitative MSIA represents a significant step forward toward better understanding of their physiological roles in health and disease.

ContributorsTrenchevska, Olgica (Author) / Schaab, Matthew (Author) / Nelson, Randall (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2015-06-15