This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 17
Filtering by

Clear all filters

129022-Thumbnail Image.png
Description

Background: Blindness has evolved repeatedly in cave-dwelling organisms, and many hypotheses have been proposed to explain this observation, including both accumulation of neutral loss-of-function mutations and adaptation to darkness. Investigating the loss of sight in cave dwellers presents an opportunity to understand the operation of fundamental evolutionary processes, including drift, selection,

Background: Blindness has evolved repeatedly in cave-dwelling organisms, and many hypotheses have been proposed to explain this observation, including both accumulation of neutral loss-of-function mutations and adaptation to darkness. Investigating the loss of sight in cave dwellers presents an opportunity to understand the operation of fundamental evolutionary processes, including drift, selection, mutation, and migration.

Results: Here we model the evolution of blindness in caves. This model captures the interaction of three forces: (1) selection favoring alleles causing blindness, (2) immigration of sightedness alleles from a surface population, and (3) mutations creating blindness alleles. We investigated the dynamics of this model and determined selection-strength thresholds that result in blindness evolving in caves despite immigration of sightedness alleles from the surface. We estimate that the selection coefficient for blindness would need to be at least 0.005 (and maybe as high as 0.5) for blindness to evolve in the model cave-organism, Astyanax mexicanus.

Conclusions: Our results indicate that strong selection is required for the evolution of blindness in cave-dwelling organisms, which is consistent with recent work suggesting a high metabolic cost of eye development.

ContributorsCartwright, Reed (Author) / Schwartz, Rachel (Author) / Merry, Alexandra (Author) / Howell, Megan (Author) / Biodesign Institute (Contributor)
Created2017-02-07
128347-Thumbnail Image.png
Description

Bismuth drugs, despite being clinically used for decades, surprisingly remain in use and effective for the treatment of Helicobacter pylori infection, even for resistant strains when co-administrated with antibiotics. However, the molecular mechanisms underlying the clinically sustained susceptibility of H. pylori to bismuth drugs remain elusive. Herein, we report that

Bismuth drugs, despite being clinically used for decades, surprisingly remain in use and effective for the treatment of Helicobacter pylori infection, even for resistant strains when co-administrated with antibiotics. However, the molecular mechanisms underlying the clinically sustained susceptibility of H. pylori to bismuth drugs remain elusive. Herein, we report that integration of in-house metalloproteomics and quantitative proteomics allows comprehensive uncovering of the bismuth-associated proteomes, including 63 bismuth-binding and 119 bismuth-regulated proteins from Helicobacter pylori, with over 60% being annotated with catalytic functions. Through bioinformatics analysis in combination with bioassays, we demonstrated that bismuth drugs disrupted multiple essential pathways in the pathogen, including ROS defence and pH buffering, by binding and functional perturbation of a number of key enzymes. Moreover, we discovered that HpDnaK may serve as a new target of bismuth drugs to inhibit bacterium-host cell adhesion. The integrative approach we report, herein, provides a novel strategy to unveil the molecular mechanisms of antimicrobial metals against pathogens in general. This study sheds light on the design of new types of antimicrobial agents with multiple targets to tackle the current crisis of antimicrobial resistance.

ContributorsWang, Yuchuan (Author) / Hu, Ligang (Author) / Xu, Feng (Author) / Quan, Quan (Author) / Lai, Yau-Tsz (Author) / Xia, Wei (Author) / Yang, Ya (Author) / Chang, Yuen-Yan (Author) / Yang, Xinming (Author) / Chai, Zhifang (Author) / Wang, Junwen (Author) / Chu, Ivan K. (Author) / Li, Hongyan (Author) / Sun, Hongzhe (Author) / College of Health Solutions (Contributor)
Created2017-04-19
128348-Thumbnail Image.png
Description

The most common evolutionary events at the molecular level are single-base substitutions, as well as insertions and deletions (indels) of short DNA segments. A large body of research has been devoted to develop probabilistic substitution models and to infer their parameters using likelihood and Bayesian approaches. In contrast, relatively little

The most common evolutionary events at the molecular level are single-base substitutions, as well as insertions and deletions (indels) of short DNA segments. A large body of research has been devoted to develop probabilistic substitution models and to infer their parameters using likelihood and Bayesian approaches. In contrast, relatively little has been done to model indel dynamics, probably due to the difficulty in writing explicit likelihood functions. Here, we contribute to the effort of modeling indel dynamics by presenting SpartaABC, an approximate Bayesian computation (ABC) approach to infer indel parameters from sequence data (either aligned or unaligned). SpartaABC circumvents the need to use an explicit likelihood function by extracting summary statistics from simulated sequences. First, summary statistics are extracted from the input sequence data. Second, SpartaABC samples indel parameters from a prior distribution and uses them to simulate sequences. Third, it computes summary statistics from the simulated sets of sequences. By computing a distance between the summary statistics extracted from the input and each simulation, SpartaABC can provide an approximation to the posterior distribution of indel parameters as well as point estimates. We study the performance of our methodology and show that it provides accurate estimates of indel parameters in simulations. We next demonstrate the utility of SpartaABC by studying the impact of alignment errors on the inference of positive selection. A C ++ program implementing SpartaABC is freely available in http://spartaabc.tau.ac.il.

ContributorsLevy Karin, Eli (Author) / Shkedy, Dafna (Author) / Ashkenazy, Haim (Author) / Cartwright, Reed (Author) / Pupko, Tal (Author) / Biodesign Institute (Contributor)
Created2017-05-01
128353-Thumbnail Image.png
Description

Competing endogenous RNAs (ceRNAs) are RNA molecules that sequester shared microRNAs (miRNAs) thereby affecting the expression of other targets of the miRNAs. Whether genetic variants in ceRNA can affect its biological function and disease development is still an open question. Here we identified a large number of genetic variants that

Competing endogenous RNAs (ceRNAs) are RNA molecules that sequester shared microRNAs (miRNAs) thereby affecting the expression of other targets of the miRNAs. Whether genetic variants in ceRNA can affect its biological function and disease development is still an open question. Here we identified a large number of genetic variants that are associated with ceRNA's function using Geuvaids RNA-seq data for 462 individuals from the 1000 Genomes Project. We call these loci competing endogenous RNA expression quantitative trait loci or ‘cerQTL’, and found that a large number of them were unexplored in conventional eQTL mapping. We identified many cerQTLs that have undergone recent positive selection in different human populations, and showed that single nucleotide polymorphisms in gene 3΄UTRs at the miRNA seed binding regions can simultaneously regulate gene expression changes in both cis and trans by the ceRNA mechanism. We also discovered that cerQTLs are significantly enriched in traits/diseases associated variants reported from genome-wide association studies in the miRNA binding sites, suggesting that disease susceptibilities could be attributed to ceRNA regulation. Further in vitro functional experiments demonstrated that a cerQTL rs11540855 can regulate ceRNA function. These results provide a comprehensive catalog of functional non-coding regulatory variants that may be responsible for ceRNA crosstalk at the post-transcriptional level.

ContributorsLi, Mulin Jun (Author) / Zhang, Jian (Author) / Liang, Qian (Author) / Xuan, Chenghao (Author) / Wu, Jiexing (Author) / Jiang, Peng (Author) / Li, Wei (Author) / Zhu, Yun (Author) / Wang, Panwen (Author) / Fernandez, Daniel (Author) / Shen, Yujun (Author) / Chen, Yiwen (Author) / Kocher, Jean-Pierre A. (Author) / Yu, Ying (Author) / Sham, Pak Chung (Author) / Wang, Junwen (Author) / Liu, Jun S. (Author) / Liu, X. Shirley (Author) / College of Health Solutions (Contributor)
Created2017-05-02
128355-Thumbnail Image.png
Description

Infection after renal transplantation remains a major cause of morbidity and death, especially infection from the extensively drug-resistant bacteria, A. baumannii. A total of fourteen A. baumannii isolates were isolated from the donors’ preserved fluid from DCD (donation after cardiac death) renal transplantation and four isolates in the recipients’ draining

Infection after renal transplantation remains a major cause of morbidity and death, especially infection from the extensively drug-resistant bacteria, A. baumannii. A total of fourteen A. baumannii isolates were isolated from the donors’ preserved fluid from DCD (donation after cardiac death) renal transplantation and four isolates in the recipients’ draining liquid at the Kidney Disease Center, The First Affiliated Hospital, College of Medicine, Zhejiang University, from March 2013 to November 2014. An outbreak of A. baumannii emerging after DCD renal transplantation was tracked to understand the transmission of the pathogen. PFGE displayed similar DNA patterns between isolates from the same hospital. Antimicrobial susceptibility tests against thirteen antimicrobial agents were determined using the K-B diffusion method and eTest. Whole-genome sequencing was applied to investigate the genetic relationship of the isolates. With the clinical data and research results, we concluded that the A. baumannii isolates 3R1 and 3R2 was probably transmitted from the donor who acquired the bacteria during his stay in the ICU, while isolate 4R1 was transmitted from 3R1 and 3R2 via medical manipulation. This study demonstrated the value of integration of clinical profiles with molecular methods in outbreak investigation and their importance in controlling infection and preventing serious complications after DCD transplantation.

ContributorsJiang, Hong (Author) / Cao, Luxi (Author) / Qu, Lihui (Author) / Qu, Tingting (Author) / Liu, Guangjun (Author) / Wang, Rending (Author) / Li, Bingjue (Author) / Wang, Yuchen (Author) / Ying, Chaoqun (Author) / Chen, Miao (Author) / Lu, Yingying (Author) / Feng, Shi (Author) / Xiao, Yonghong (Author) / Wang, Junwen (Author) / Wu, Jianyong (Author) / Chen, Jianghua (Author) / College of Health Solutions (Contributor)
Created2017-05-16
128403-Thumbnail Image.png
Description

Under models of isolation-by-distance, population structure is determined by the probability of identity-by-descent between pairs of genes according to the geographic distance between them. Well established analytical results indicate that the relationship between geographical and genetic distance depends mostly on the neighborhood size of the population which represents a standardized

Under models of isolation-by-distance, population structure is determined by the probability of identity-by-descent between pairs of genes according to the geographic distance between them. Well established analytical results indicate that the relationship between geographical and genetic distance depends mostly on the neighborhood size of the population which represents a standardized measure of gene flow. To test this prediction, we model local dispersal of haploid individuals on a two-dimensional landscape using seven dispersal kernels: Rayleigh, exponential, half-normal, triangular, gamma, Lomax and Pareto. When neighborhood size is held constant, the distributions produce similar patterns of isolation-by-distance, confirming predictions. Considering this, we propose that the triangular distribution is the appropriate null distribution for isolation-by-distance studies. Under the triangular distribution, dispersal is uniform over the neighborhood area which suggests that the common description of neighborhood size as a measure of an effective, local panmictic population is valid for popular families of dispersal distributions. We further show how to draw random variables from the triangular distribution efficiently and argue that it should be utilized in other studies in which computational efficiency is important.

ContributorsFurstenau, Tara (Author) / Cartwright, Reed (Author) / College of Liberal Arts and Sciences (Contributor)
Created2016-03-29
128340-Thumbnail Image.png
Description

Whole genome sequencing (WGS) is a promising strategy to unravel variants or genes responsible for human diseases and traits. However, there is a lack of robust platforms for a comprehensive downstream analysis. In the present study, we first proposed three novel algorithms, sequence gap-filled gene feature annotation, bit-block encoded genotypes

Whole genome sequencing (WGS) is a promising strategy to unravel variants or genes responsible for human diseases and traits. However, there is a lack of robust platforms for a comprehensive downstream analysis. In the present study, we first proposed three novel algorithms, sequence gap-filled gene feature annotation, bit-block encoded genotypes and sectional fast access to text lines to address three fundamental problems. The three algorithms then formed the infrastructure of a robust parallel computing framework, KGGSeq, for integrating downstream analysis functions for whole genome sequencing data. KGGSeq has been equipped with a comprehensive set of analysis functions for quality control, filtration, annotation, pathogenic prediction and statistical tests. In the tests with whole genome sequencing data from 1000 Genomes Project, KGGSeq annotated several thousand more reliable non-synonymous variants than other widely used tools (e.g. ANNOVAR and SNPEff). It took only around half an hour on a small server with 10 CPUs to access genotypes of ∼60 million variants of 2504 subjects, while a popular alternative tool required around one day. KGGSeq's bit-block genotype format used 1.5% or less space to flexibly represent phased or unphased genotypes with multiple alleles and achieved a speed of over 1000 times faster to calculate genotypic correlation.

ContributorsLi, Miaoxin (Author) / Li, Jiang (Author) / Li, Mulin Jun (Author) / Pan, Zhicheng (Author) / Hsu, Jacob Shujui (Author) / Liu, Dajiang J. (Author) / Zhan, Xiaowei (Author) / Wang, Junwen (Author) / Song, Youqiang (Author) / Sham, Pak Chung (Author) / College of Health Solutions (Contributor)
Created2017-01-23
127983-Thumbnail Image.png
Description

Mutation is the ultimate source of all genetic variation and is, therefore, central to evolutionary change. Previous work on Paramecium tetraurelia found an unusually low germline base-substitution mutation rate in this ciliate. Here, we tested the generality of this result among ciliates using Tetrahymena thermophila. We sequenced the genomes of

Mutation is the ultimate source of all genetic variation and is, therefore, central to evolutionary change. Previous work on Paramecium tetraurelia found an unusually low germline base-substitution mutation rate in this ciliate. Here, we tested the generality of this result among ciliates using Tetrahymena thermophila. We sequenced the genomes of 10 lines of T. thermophila that had each undergone approximately 1,000 generations of mutation accumulation (MA). We applied an existing mutation-calling pipeline and developed a new probabilistic mutation detection approach that directly models the design of an MA experiment and accommodates the noise introduced by mismapped reads. Our probabilistic mutation-calling method provides a straightforward way of estimating the number of sites at which a mutation could have been called if one was present, providing the denominator for our mutation rate calculations. From these methods, we find that T. thermophila has a germline base-substitution mutation rate of 7.61 × 10 -12 per-site, per cell division, which is consistent with the low base-substitution mutation rate in P. tetraurelia. Over the course of the evolution experiment, genomic exclusion lines derived from the MA lines experienced a fitness decline that cannot be accounted for by germline base-substitution mutations alone, suggesting that other genetic or epigenetic factors must be involved. Because selection can only operate to reduce mutation rates based upon the "visible" mutational load, asexual reproduction with a transcriptionally silent germline may allow ciliates to evolve extremely low germline mutation rates.

ContributorsLong, Hongan (Author) / Winter, David (Author) / Chang, Allan Y.-C. (Author) / Sung, Way (Author) / Wu, Steven (Author) / Balboa, Mariel (Author) / Azevedo, Ricardo B. R. (Author) / Cartwright, Reed (Author) / Lynch, Michael (Author) / Zufall, Rebecca A. (Author) / Biodesign Institute (Contributor)
Created2016-09-15
127886-Thumbnail Image.png
Description

Modeling of transcriptional regulatory networks (TRNs) has been increasingly used to dissect the nature of gene regulation. Inference of regulatory relationships among transcription factors (TFs) and genes, especially among multiple TFs, is still challenging. In this study, we introduced an integrative method, LogicTRN, to decode TF–TF interactions that form TF

Modeling of transcriptional regulatory networks (TRNs) has been increasingly used to dissect the nature of gene regulation. Inference of regulatory relationships among transcription factors (TFs) and genes, especially among multiple TFs, is still challenging. In this study, we introduced an integrative method, LogicTRN, to decode TF–TF interactions that form TF logics in regulating target genes. By combining cis-regulatory logics and transcriptional kinetics into one single model framework, LogicTRN can naturally integrate dynamic gene expression data and TF-DNA-binding signals in order to identify the TF logics and to reconstruct the underlying TRNs. We evaluated the newly developed methodology using simulation, comparison and application studies, and the results not only show their consistence with existing knowledge, but also demonstrate its ability to accurately reconstruct TRNs in biological complex systems.

ContributorsYan, Bin (Author) / Guan, Daogang (Author) / Wang, Chao (Author) / Wang, Junwen (Author) / He, Bing (Author) / Qin, Jing (Author) / Boheler, Kenneth R. (Author) / Lu, Aiping (Author) / Zhang, Ge (Author) / Zhu, Hailong (Author) / College of Health Solutions (Contributor)
Created2017-10-19
127891-Thumbnail Image.png
Description

Inbreeding in hermaphroditic plants can occur through two different mechanisms: biparental inbreeding, when a plant mates with a related individual, or self-fertilization, when a plant mates with itself. To avoid inbreeding, many hermaphroditic plants have evolved self-incompatibility (SI) systems which prevent or limit self-fertilization. One particular SI system—homomorphic SI—can also

Inbreeding in hermaphroditic plants can occur through two different mechanisms: biparental inbreeding, when a plant mates with a related individual, or self-fertilization, when a plant mates with itself. To avoid inbreeding, many hermaphroditic plants have evolved self-incompatibility (SI) systems which prevent or limit self-fertilization. One particular SI system—homomorphic SI—can also reduce biparental inbreeding. Homomorphic SI is found in many angiosperm species, and it is often assumed that the additional benefit of reduced biparental inbreeding may be a factor in the success of this SI system. To test this assumption, we developed a spatially-explicit, individual-based simulation of plant populations that displayed three different types of homomorphic SI. We measured the total level of inbreeding avoidance by comparing each population to a self-compatible population (NSI), and we measured biparental inbreeding avoidance by comparing to a population of self-incompatible plants that were free to mate with any other individual (PSI).

Because biparental inbreeding is more common when offspring dispersal is limited, we examined the levels of biparental inbreeding over a range of dispersal distances. We also tested whether the introduction of inbreeding depression affected the level of biparental inbreeding avoidance. We found that there was a statistically significant decrease in autozygosity in each of the homomorphic SI populations compared to the PSI population and, as expected, this was more pronounced when seed and pollen dispersal was limited. However, levels of homozygosity and inbreeding depression were not reduced. At low dispersal, homomorphic SI populations also suffered reduced female fecundity and had smaller census population sizes. Overall, our simulations showed that the homomorphic SI systems had little impact on the amount of biparental inbreeding in the population especially when compared to the overall reduction in inbreeding compared to the NSI population. With further study, this observation may have important consequences for research into the origin and evolution of homomorphic self-incompatibility systems.

ContributorsFurstenau, Tara (Author) / Cartwright, Reed (Author) / Biodesign Institute (Contributor)
Created2017-11-24