Matching Items (68)
158849-Thumbnail Image.png
Description
Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The

Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.
ContributorsOrr, Adam James (Author) / Cartwright, Reed (Thesis advisor) / Wilson, Melissa (Committee member) / Kusumi, Kenro (Committee member) / Taylor, Jesse (Committee member) / Pfeifer, Susanne (Committee member) / Arizona State University (Publisher)
Created2020
161497-Thumbnail Image.png
Description
The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males

The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males and females. The goal of the study was to identify germline variation that differs by sex in hepatocellular carcinoma. Using the program, multiple pathways and genes were identified to have significant differences in their relationship to liver cancer in males and females. In animal studies, the genes which were identified using the PoDA analysis have been shown to impact liver cancer, often with different results for males and females. While these genes are often the focus in animal models, they are absent from current Genome Wide Association Studies (GWAS) catalogs for humans. By working to bridge the results of animal studies and human studies, the results help to identify the causes of liver cancer, and more specifically, the reason the disease affects males at much higher rates. The differences in pathways identified to be significant for the two sexes indicate the germline variance may play sex-specific roles in the development of hepatocellular carcinoma. Additionally, these results reinforce the capacity of the PoDA analysis to identify genes that may be missed by more traditional GWAS methods. This study lays the groundwork for further investigations into the identified genes and pathways, and how they behave differently within males and females.
ContributorsOlson, Erik Jon (Author) / Buetow, Kenneth (Thesis advisor) / Wilson, Melissa (Committee member) / Cartwright, Reed (Committee member) / Arizona State University (Publisher)
Created2021
161416-Thumbnail Image.png
Description
Interactions between proteins form the basis of almost all biological mechanisms. The majority of proteins perform their functions as a part of an assembled complex, rather than as an isolated species. Understanding the functional pathways of these protein complexes helps in uncovering the molecular mechanisms involved in the interactions. In

Interactions between proteins form the basis of almost all biological mechanisms. The majority of proteins perform their functions as a part of an assembled complex, rather than as an isolated species. Understanding the functional pathways of these protein complexes helps in uncovering the molecular mechanisms involved in the interactions. In this thesis, this has been explored in two fundamental ways. First, a biohybrid complex was assembled using the photosystem I (PSI) protein complex to translate the biochemical pathways into a non-cellular environment. This involved incorporating PSI on a porous antimony-doped tin oxide electrode using cytochrome c. Photocurrent was generated upon illumination of the PSI/electrode system alone at microamp/cm2 levels, with reduced oxygen apparently as the primary carrier. When the PSI/electrode system was coupled with ferredoxin, ferredoxin-NADP+ reductase (FNR), and NADP+, the resulting light-powered NADPH production was coupled to a dehydrogenase system for enzymatic carbon reduction. The results demonstrated that light-dependent reduction readily takes place. However, the pathways do not always match the biological pathways of PSI in nature. To create a complex self-assembled system such as the one involving PSI that is structurally well defined, there is a need to develop ways to guide the molecular interactions. In the second part of the thesis, this problem was approached by studying a well-defined system involving monoclonal antibodies (mAbs) binding their cognate epitope sequences to understand the molecular recognition properties associated with protein-protein interactions. This approach used a neural network model to derive a comprehensive and quantitative relationship between an amino acid sequence and its function by using sparse measurements of mAb binding to peptides on a high density peptide microarray. The resulting model can be used to predict the function of any peptide in the possible combinatorial sequence space. The results demonstrated that by training the model on just ~105 peptides out of the total combinatorial space of ~1010, the target sequences of the mAbs (cognate epitopes) can be predicted with high statistical accuracy. Furthermore, the biological relevance of the algorithm’s predictive ability has also been demonstrated.
ContributorsSingh, Akanksha (Author) / Woodbury, Neal (Thesis advisor) / Liu, Yan (Committee member) / Gould, Ian (Committee member) / Arizona State University (Publisher)
Created2021
129532-Thumbnail Image.png
Description

Swinging arms are a key functional component of multistep catalytic transformations in many naturally occurring multi-enzyme complexes. This arm is typically a prosthetic chemical group that is covalently attached to the enzyme complex via a flexible linker, allowing the direct transfer of substrate molecules between multiple active sites within the

Swinging arms are a key functional component of multistep catalytic transformations in many naturally occurring multi-enzyme complexes. This arm is typically a prosthetic chemical group that is covalently attached to the enzyme complex via a flexible linker, allowing the direct transfer of substrate molecules between multiple active sites within the complex. Mimicking this method of substrate channelling outside the cellular environment requires precise control over the spatial parameters of the individual components within the assembled complex. DNA nanostructures can be used to organize functional molecules with nanoscale precision and can also provide nanomechanical control. Until now, protein–DNA assemblies have been used to organize cascades of enzymatic reactions by controlling the relative distance and orientation of enzymatic components or by facilitating the interface between enzymes/cofactors and electrode surfaces. Here, we show that a DNA nanostructure can be used to create a multi-enzyme complex in which an artificial swinging arm facilitates hydride transfer between two coupled dehydrogenases. By exploiting the programmability of DNA nanostructures, key parameters including position, stoichiometry and inter-enzyme distance can be manipulated for optimal activity.

ContributorsFu, Jinglin (Author) / Yang, Yuhe (Author) / Johnson-Buck, Alexander (Author) / Liu, Minghui (Author) / Liu, Yan (Author) / Walter, Nils G. (Author) / Woodbury, Neal (Author) / Yan, Hao (Author) / Biodesign Institute (Contributor)
Created2014-07-01
Description

A structurally and compositionally well-defined and spectrally tunable artificial light-harvesting system has been constructed in which multiple organic dyes attached to a three-arm-DNA nanostructure serve as an antenna conjugated to a photosynthetic reaction center isolated from Rhodobacter sphaeroides 2.4.1. The light energy absorbed by the dye molecules is transferred to

A structurally and compositionally well-defined and spectrally tunable artificial light-harvesting system has been constructed in which multiple organic dyes attached to a three-arm-DNA nanostructure serve as an antenna conjugated to a photosynthetic reaction center isolated from Rhodobacter sphaeroides 2.4.1. The light energy absorbed by the dye molecules is transferred to the reaction center, where charge separation takes place. The average number of DNA three-arm junctions per reaction center was tuned from 0.75 to 2.35. This DNA-templated multichromophore system serves as a modular light-harvesting antenna that is capable of being optimized for its spectral properties, energy transfer efficiency, and photostability, allowing one to adjust both the size and spectrum of the resulting structures. This may serve as a useful test bed for developing nanostructured photonic systems.

ContributorsDutta, Palash (Author) / Levenberg, Symon (Author) / Loskutov, Andrey (Author) / Jun, Daniel (Author) / Saer, Rafael (Author) / Beatty, J. Thomas (Author) / Lin, Su (Author) / Liu, Yan (Author) / Woodbury, Neal (Author) / Yan, Hao (Author) / Department of Chemistry and Biochemistry (Contributor)
Created2014-11-26
128998-Thumbnail Image.png
Description

Background: While prior studies have quantified the mortality burden of the 1957 H2N2 influenza pandemic at broad geographic regions in the United States, little is known about the pandemic impact at a local level. Here we focus on analyzing the transmissibility and mortality burden of this pandemic in Arizona, a setting

Background: While prior studies have quantified the mortality burden of the 1957 H2N2 influenza pandemic at broad geographic regions in the United States, little is known about the pandemic impact at a local level. Here we focus on analyzing the transmissibility and mortality burden of this pandemic in Arizona, a setting where the dry climate was promoted as reducing respiratory illness transmission yet tuberculosis prevalence was high.

Methods: Using archival death certificates from 1954 to 1961, we quantified the age-specific seasonal patterns, excess-mortality rates, and transmissibility patterns of the 1957 H2N2 pandemic in Maricopa County, Arizona. By applying cyclical Serfling linear regression models to weekly mortality rates, the excess-mortality rates due to respiratory and all-causes were estimated for each age group during the pandemic period. The reproduction number was quantified from weekly data using a simple growth rate method and assumed generation intervals of 3 and 4 days. Local newspaper articles published during 1957–1958 were also examined.

Results: Excess-mortality rates varied between waves, age groups, and causes of death, but overall remained low. From October 1959-June 1960, the most severe wave of the pandemic, the absolute excess-mortality rate based on respiratory deaths per 10,000 population was 16.59 in the elderly (≥65 years). All other age groups exhibit very low excess-mortality and the typical U-shaped age-pattern was absent. However, the standardized mortality ratio was greatest (4.06) among children and young adolescents (5–14 years) from October 1957-March 1958, based on mortality rates of respiratory deaths. Transmissibility was greatest during the same 1957–1958 period, when the mean reproduction number was estimated at 1.08–1.11, assuming 3- or 4-day generation intervals with exponential or fixed distributions.

Conclusions: Maricopa County exhibited very low mortality impact associated with the 1957 influenza pandemic. Understanding the relatively low excess-mortality rates and transmissibility in Maricopa County during this historic pandemic may help public health officials prepare for and mitigate future outbreaks of influenza.

ContributorsCobos, April (Author) / Nelson, Clinton (Author) / Jehn, Megan (Author) / Viboud, Cecile (Author) / Chowell-Puente, Gerardo (Author) / College of Liberal Arts and Sciences (Contributor)
Created2016-08-11
128953-Thumbnail Image.png
Description

Background: On 31 March 2013, the first human infections with the novel influenza A/H7N9 virus were reported in Eastern China. The outbreak expanded rapidly in geographic scope and size, with a total of 132 laboratory-confirmed cases reported by 3 June 2013, in 10 Chinese provinces and Taiwan. The incidence of A/H7N9

Background: On 31 March 2013, the first human infections with the novel influenza A/H7N9 virus were reported in Eastern China. The outbreak expanded rapidly in geographic scope and size, with a total of 132 laboratory-confirmed cases reported by 3 June 2013, in 10 Chinese provinces and Taiwan. The incidence of A/H7N9 cases has stalled in recent weeks, presumably as a consequence of live bird market closures in the most heavily affected areas. Here we compare the transmission potential of influenza A/H7N9 with that of other emerging pathogens and evaluate the impact of intervention measures in an effort to guide pandemic preparedness.

Methods: We used a Bayesian approach combined with a SEIR (Susceptible-Exposed-Infectious-Removed) transmission model fitted to daily case data to assess the reproduction number (R) of A/H7N9 by province and to evaluate the impact of live bird market closures in April and May 2013. Simulation studies helped quantify the performance of our approach in the context of an emerging pathogen, where human-to-human transmission is limited and most cases arise from spillover events. We also used alternative approaches to estimate R based on individual-level information on prior exposure and compared the transmission potential of influenza A/H7N9 with that of other recent zoonoses.

Results: Estimates of R for the A/H7N9 outbreak were below the epidemic threshold required for sustained human-to-human transmission and remained near 0.1 throughout the study period, with broad 95% credible intervals by the Bayesian method (0.01 to 0.49). The Bayesian estimation approach was dominated by the prior distribution, however, due to relatively little information contained in the case data. We observe a statistically significant deceleration in growth rate after 6 April 2013, which is consistent with a reduction in A/H7N9 transmission associated with the preemptive closure of live bird markets. Although confidence intervals are broad, the estimated transmission potential of A/H7N9 appears lower than that of recent zoonotic threats, including avian influenza A/H5N1, swine influenza H3N2sw and Nipah virus.

Conclusion: Although uncertainty remains high in R estimates for H7N9 due to limited epidemiological information, all available evidence points to a low transmission potential. Continued monitoring of the transmission potential of A/H7N9 is critical in the coming months as intervention measures may be relaxed and seasonal factors could promote disease transmission in colder months.

Created2013-10-02
128959-Thumbnail Image.png
Description

Background: The impact of socio-demographic factors and baseline health on the mortality burden of seasonal and pandemic influenza remains debated. Here we analyzed the spatial-temporal mortality patterns of the 1918 influenza pandemic in Spain, one of the countries of Europe that experienced the highest mortality burden.

Methods: We analyzed monthly death rates from

Background: The impact of socio-demographic factors and baseline health on the mortality burden of seasonal and pandemic influenza remains debated. Here we analyzed the spatial-temporal mortality patterns of the 1918 influenza pandemic in Spain, one of the countries of Europe that experienced the highest mortality burden.

Methods: We analyzed monthly death rates from respiratory diseases and all-causes across 49 provinces of Spain, including the Canary and Balearic Islands, during the period January-1915 to June-1919. We estimated the influenza-related excess death rates and risk of death relative to baseline mortality by pandemic wave and province. We then explored the association between pandemic excess mortality rates and health and socio-demographic factors, which included population size and age structure, population density, infant mortality rates, baseline death rates, and urbanization.

Results: Our analysis revealed high geographic heterogeneity in pandemic mortality impact. We identified 3 pandemic waves of varying timing and intensity covering the period from Jan-1918 to Jun-1919, with the highest pandemic-related excess mortality rates occurring during the months of October-November 1918 across all Spanish provinces. Cumulative excess mortality rates followed a south–north gradient after controlling for demographic factors, with the North experiencing highest excess mortality rates. A model that included latitude, population density, and the proportion of children living in provinces explained about 40% of the geographic variability in cumulative excess death rates during 1918–19, but different factors explained mortality variation in each wave.

Conclusions: A substantial fraction of the variability in excess mortality rates across Spanish provinces remained unexplained, which suggests that other unidentified factors such as comorbidities, climate and background immunity may have affected the 1918-19 pandemic mortality rates. Further archeo-epidemiological research should concentrate on identifying settings with combined availability of local historical mortality records and information on the prevalence of underlying risk factors, or patient-level clinical data, to further clarify the drivers of 1918 pandemic influenza mortality.

Created2014-07-05
128887-Thumbnail Image.png
Description

Background: Elucidating the role of the underlying risk factors for severe outcomes of the 2009 A/H1N1 influenza pandemic could be crucial to define priority risk groups in resource-limited settings in future pandemics.

Methods: We use individual-level clinical data on a large series of ARI (acute respiratory infection) hospitalizations from a prospective surveillance system

Background: Elucidating the role of the underlying risk factors for severe outcomes of the 2009 A/H1N1 influenza pandemic could be crucial to define priority risk groups in resource-limited settings in future pandemics.

Methods: We use individual-level clinical data on a large series of ARI (acute respiratory infection) hospitalizations from a prospective surveillance system of the Mexican Social Security medical system to analyze clinical features at presentation, admission delays, selected comorbidities and receipt of seasonal vaccine on the risk of A/H1N1-related death. We considered ARI hospitalizations and inpatient-deaths, and recorded demographic, geographic, and medical information on individual patients during August-December, 2009.

Results: Seasonal influenza vaccination was associated with a reduced risk of death among A/H1N1 inpatients (OR = 0.43 (95% CI: 0.25, 0.74)) after adjustment for age, gender, geography, antiviral treatment, admission delays, comorbidities and medical conditions. However, this result should be interpreted with caution as it could have been affected by factors not directly measured in our study. Moreover, the effect of antiviral treatment against A/H1N1 inpatient death did not reach statistical significance (OR = 0.56 (95% CI: 0.29, 1.10)) probably because only 8.9% of A/H1N1 inpatients received antiviral treatment. Moreover, diabetes (OR = 1.6) and immune suppression (OR = 2.3) were statistically significant risk factors for death whereas asthmatic persons (OR = 0.3) or pregnant women (OR = 0.4) experienced a reduced fatality rate among A/H1N1 inpatients. We also observed an increased risk of death among A/H1N1 inpatients with admission delays >2 days after symptom onset (OR = 2.7). Similar associations were also observed for A/H1N1-negative inpatients.

Conclusions: Geographical variation in identified medical risk factors including prevalence of diabetes and immune suppression may in part explain between-country differences in pandemic mortality burden. Furthermore, access to care including hospitalization without delay and antiviral treatment and are also important factors, as well as vaccination coverage with the 2008–09 trivalent inactivated influenza vaccine.

Created2012-07-16
128838-Thumbnail Image.png
Description

Background: The historical Japanese influenza vaccination program targeted at schoolchildren provides a unique opportunity to evaluate the indirect benefits of vaccinating high-transmitter groups to mitigate disease burden among seniors. Here we characterize the indirect mortality benefits of vaccinating schoolchildren based on data from Japan and the US.

Methods: We compared age-specific influenza-related excess

Background: The historical Japanese influenza vaccination program targeted at schoolchildren provides a unique opportunity to evaluate the indirect benefits of vaccinating high-transmitter groups to mitigate disease burden among seniors. Here we characterize the indirect mortality benefits of vaccinating schoolchildren based on data from Japan and the US.

Methods: We compared age-specific influenza-related excess mortality rates in Japanese seniors aged ≥65 years during the schoolchildren vaccination program (1978–1994) and after the program was discontinued (1995–2006). Indirect vaccine benefits were adjusted for demographic changes, socioeconomics and dominant influenza subtype; US mortality data were used as a control.

Results: We estimate that the schoolchildren vaccination program conferred a 36% adjusted mortality reduction among Japanese seniors (95%CI: 17–51%), corresponding to ∼1,000 senior deaths averted by vaccination annually (95%CI: 400–1,800). In contrast, influenza-related mortality did not change among US seniors, despite increasing vaccine coverage in this population.

Conclusions: The Japanese schoolchildren vaccination program was associated with substantial indirect mortality benefits in seniors.

ContributorsCharu, Vivek (Author) / Viboud, Cecile (Author) / Simonsen, Lone (Author) / Sturm-Ramirez, Katharine (Author) / Shinjoh, Masayoshi (Author) / Chowell-Puente, Gerardo (Author) / Miller, Mark (Author) / Sugaya, Norio (Author) / College of Liberal Arts and Sciences (Contributor)
Created2011-11-07