This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 16
Filtering by

Clear all filters

129022-Thumbnail Image.png
Description

Background: Blindness has evolved repeatedly in cave-dwelling organisms, and many hypotheses have been proposed to explain this observation, including both accumulation of neutral loss-of-function mutations and adaptation to darkness. Investigating the loss of sight in cave dwellers presents an opportunity to understand the operation of fundamental evolutionary processes, including drift, selection,

Background: Blindness has evolved repeatedly in cave-dwelling organisms, and many hypotheses have been proposed to explain this observation, including both accumulation of neutral loss-of-function mutations and adaptation to darkness. Investigating the loss of sight in cave dwellers presents an opportunity to understand the operation of fundamental evolutionary processes, including drift, selection, mutation, and migration.

Results: Here we model the evolution of blindness in caves. This model captures the interaction of three forces: (1) selection favoring alleles causing blindness, (2) immigration of sightedness alleles from a surface population, and (3) mutations creating blindness alleles. We investigated the dynamics of this model and determined selection-strength thresholds that result in blindness evolving in caves despite immigration of sightedness alleles from the surface. We estimate that the selection coefficient for blindness would need to be at least 0.005 (and maybe as high as 0.5) for blindness to evolve in the model cave-organism, Astyanax mexicanus.

Conclusions: Our results indicate that strong selection is required for the evolution of blindness in cave-dwelling organisms, which is consistent with recent work suggesting a high metabolic cost of eye development.

ContributorsCartwright, Reed (Author) / Schwartz, Rachel (Author) / Merry, Alexandra (Author) / Howell, Megan (Author) / Biodesign Institute (Contributor)
Created2017-02-07
128130-Thumbnail Image.png
Description

Background: In Africa and Asia, sugarcane is the host of at least seven different virus species in the genus Mastrevirus of the family Geminiviridae. However, with the exception of Sugarcane white streak virus in Barbados, no other sugarcane-infecting mastrevirus has been reported in the New World. Conservation and exchange of sugarcane

Background: In Africa and Asia, sugarcane is the host of at least seven different virus species in the genus Mastrevirus of the family Geminiviridae. However, with the exception of Sugarcane white streak virus in Barbados, no other sugarcane-infecting mastrevirus has been reported in the New World. Conservation and exchange of sugarcane germplasm using stalk cuttings facilitates the spread of sugarcane-infecting viruses.

Methods: A virion-associated nucleic acids (VANA)-based metagenomics approach was used to detect mastrevirus sequences in 717 sugarcane samples from Florida (USA), Guadeloupe (French West Indies), and Réunion (Mascarene Islands). Contig assembly was performed using CAP3 and sequence searches using BLASTn and BLASTx. Mastrevirus full genomes were enriched from total DNA by rolling circle amplification, cloned and sequenced. Nucleotide and amino acid sequence identities were determined using SDT v1.2. Phylogenetic analyses were conducted using MEGA6 and PHYML3.

Results: We identified a new sugarcane-infecting mastrevirus in six plants sampled from germplasm collections in Florida and Guadeloupe. Full genome sequences were determined and analyzed for three virus isolates from Florida, and three from Guadeloupe. These six genomes share >88% genome-wide pairwise identity with one another and between 89 and 97% identity with a recently identified mastrevirus (KR150789) from a sugarcane plant sampled in China. Sequences similar to these were also identified in sugarcane plants in Réunion.

Conclusions: As these virus isolates share <64% genome-wide identity with all other known mastreviruses, we propose classifying them within a new mastrevirus species named Sugarcane striate virus. This is the first report of sugarcane striate virus (SCStV) in the Western Hemisphere, a virus that most likely originated in Asia. The distribution, vector, and impact of SCStV on sugarcane production remains to be determined.

ContributorsBoukari, Wardatou (Author) / Alcala-Briseno, Ricardo I. (Author) / Kraberger, Simona Joop (Author) / Fernandez, Emmanuel (Author) / Filloux, Denis (Author) / Daugrois, Jean-Heinrich (Author) / Comstock, Jack C. (Author) / Lett, Jean-Michel (Author) / Martin, Darren P. (Author) / Varsani, Arvind (Author) / Roumagnac, Philippe (Author) / Polston, Jane E. (Author) / Rott, Philippe C. (Author) / Biodesign Institute (Contributor)
Created2017-07-28
128136-Thumbnail Image.png
Description

Bacteriophages are ideal candidates for pathogen biocontrol to mitigate outbreaks of prevalent foodborne pathogens, such as Escherichia coli. We identified a bacteriophage (AAPEc6) from wastewater that infects E. coli O45:H10. The AAPEc6 genome sequence shares 93% identity (with 92% coverage) to enterobacterial phage K1E (Sp6likevirus) in the Autographivirinae subfamily (Podoviridae).

ContributorsNonis, Judith (Author) / Premaratne, Aruni (Author) / Billington, Craig (Author) / Varsani, Arvind (Author) / Biodesign Institute (Contributor)
Created2017-08-03
128348-Thumbnail Image.png
Description

The most common evolutionary events at the molecular level are single-base substitutions, as well as insertions and deletions (indels) of short DNA segments. A large body of research has been devoted to develop probabilistic substitution models and to infer their parameters using likelihood and Bayesian approaches. In contrast, relatively little

The most common evolutionary events at the molecular level are single-base substitutions, as well as insertions and deletions (indels) of short DNA segments. A large body of research has been devoted to develop probabilistic substitution models and to infer their parameters using likelihood and Bayesian approaches. In contrast, relatively little has been done to model indel dynamics, probably due to the difficulty in writing explicit likelihood functions. Here, we contribute to the effort of modeling indel dynamics by presenting SpartaABC, an approximate Bayesian computation (ABC) approach to infer indel parameters from sequence data (either aligned or unaligned). SpartaABC circumvents the need to use an explicit likelihood function by extracting summary statistics from simulated sequences. First, summary statistics are extracted from the input sequence data. Second, SpartaABC samples indel parameters from a prior distribution and uses them to simulate sequences. Third, it computes summary statistics from the simulated sets of sequences. By computing a distance between the summary statistics extracted from the input and each simulation, SpartaABC can provide an approximation to the posterior distribution of indel parameters as well as point estimates. We study the performance of our methodology and show that it provides accurate estimates of indel parameters in simulations. We next demonstrate the utility of SpartaABC by studying the impact of alignment errors on the inference of positive selection. A C ++ program implementing SpartaABC is freely available in http://spartaabc.tau.ac.il.

ContributorsLevy Karin, Eli (Author) / Shkedy, Dafna (Author) / Ashkenazy, Haim (Author) / Cartwright, Reed (Author) / Pupko, Tal (Author) / Biodesign Institute (Contributor)
Created2017-05-01
128352-Thumbnail Image.png
Description

Four genomovirus genomes were recovered from thrips (Echinothrips americanus) collected in Florida, USA. These represent four new species which are members of the Gemycircularvirus (n = 2), Gemyduguivirus (n = 1), and Gemykibivirus (n = 1) genera. This is the first record, to our knowledge, of genomoviruses associated with a

Four genomovirus genomes were recovered from thrips (Echinothrips americanus) collected in Florida, USA. These represent four new species which are members of the Gemycircularvirus (n = 2), Gemyduguivirus (n = 1), and Gemykibivirus (n = 1) genera. This is the first record, to our knowledge, of genomoviruses associated with a phytophagous insect.

ContributorsKraberger, Simona Joop (Author) / Polston, Jane E. (Author) / Capobianco, Heather M. (Author) / Alcala-Briseno, Ricardo I. (Author) / Fontenele, Rafaela Salgado (Author) / Varsani, Arvind (Author) / Biodesign Institute (Contributor)
Created2017-05-25
128403-Thumbnail Image.png
Description

Under models of isolation-by-distance, population structure is determined by the probability of identity-by-descent between pairs of genes according to the geographic distance between them. Well established analytical results indicate that the relationship between geographical and genetic distance depends mostly on the neighborhood size of the population which represents a standardized

Under models of isolation-by-distance, population structure is determined by the probability of identity-by-descent between pairs of genes according to the geographic distance between them. Well established analytical results indicate that the relationship between geographical and genetic distance depends mostly on the neighborhood size of the population which represents a standardized measure of gene flow. To test this prediction, we model local dispersal of haploid individuals on a two-dimensional landscape using seven dispersal kernels: Rayleigh, exponential, half-normal, triangular, gamma, Lomax and Pareto. When neighborhood size is held constant, the distributions produce similar patterns of isolation-by-distance, confirming predictions. Considering this, we propose that the triangular distribution is the appropriate null distribution for isolation-by-distance studies. Under the triangular distribution, dispersal is uniform over the neighborhood area which suggests that the common description of neighborhood size as a measure of an effective, local panmictic population is valid for popular families of dispersal distributions. We further show how to draw random variables from the triangular distribution efficiently and argue that it should be utilized in other studies in which computational efficiency is important.

ContributorsFurstenau, Tara (Author) / Cartwright, Reed (Author) / College of Liberal Arts and Sciences (Contributor)
Created2016-03-29
128339-Thumbnail Image.png
Description

With the advent of metagenomics approaches, a large diversity of known and unknown viruses has been identified in various types of environmental, plant, and animal samples. One such widespread virus group is the recently established family Genomoviridae which includes viruses with small (∼2–2.4 kb), circular ssDNA genomes encoding rolling-circle replication initiation

With the advent of metagenomics approaches, a large diversity of known and unknown viruses has been identified in various types of environmental, plant, and animal samples. One such widespread virus group is the recently established family Genomoviridae which includes viruses with small (∼2–2.4 kb), circular ssDNA genomes encoding rolling-circle replication initiation proteins (Rep) and unique capsid proteins. Here, we propose a sequence-based taxonomic framework for classification of 121 new virus genomes within this family. Genomoviruses display ∼47% sequence diversity, which is very similar to that within the well-established and extensively studied family Geminiviridae (46% diversity). Based on our analysis, we establish a 78% genome-wide pairwise identity as a species demarcation threshold. Furthermore, using a Rep sequence phylogeny-based analysis coupled with the current knowledge on the classification of geminiviruses, we establish nine genera within the Genomoviridae family. These are Gemycircularvirus (n = 73), Gemyduguivirus (n = 1), Gemygorvirus (n = 9), Gemykibivirus (n = 29), Gemykolovirus (n = 3), Gemykrogvirus (n = 3), Gemykroznavirus (n = 1), Gemytondvirus (n = 1), Gemyvongvirus (n = 1). The presented taxonomic framework offers rational classification of genomoviruses based on the sequence information alone and sets an example for future classification of other groups of uncultured viruses discovered using metagenomics approaches.

ContributorsVarsani, Arvind (Author) / Krupovic, Mart (Author) / Biodesign Institute (Contributor)
Created2017-02-02
127983-Thumbnail Image.png
Description

Mutation is the ultimate source of all genetic variation and is, therefore, central to evolutionary change. Previous work on Paramecium tetraurelia found an unusually low germline base-substitution mutation rate in this ciliate. Here, we tested the generality of this result among ciliates using Tetrahymena thermophila. We sequenced the genomes of

Mutation is the ultimate source of all genetic variation and is, therefore, central to evolutionary change. Previous work on Paramecium tetraurelia found an unusually low germline base-substitution mutation rate in this ciliate. Here, we tested the generality of this result among ciliates using Tetrahymena thermophila. We sequenced the genomes of 10 lines of T. thermophila that had each undergone approximately 1,000 generations of mutation accumulation (MA). We applied an existing mutation-calling pipeline and developed a new probabilistic mutation detection approach that directly models the design of an MA experiment and accommodates the noise introduced by mismapped reads. Our probabilistic mutation-calling method provides a straightforward way of estimating the number of sites at which a mutation could have been called if one was present, providing the denominator for our mutation rate calculations. From these methods, we find that T. thermophila has a germline base-substitution mutation rate of 7.61 × 10 -12 per-site, per cell division, which is consistent with the low base-substitution mutation rate in P. tetraurelia. Over the course of the evolution experiment, genomic exclusion lines derived from the MA lines experienced a fitness decline that cannot be accounted for by germline base-substitution mutations alone, suggesting that other genetic or epigenetic factors must be involved. Because selection can only operate to reduce mutation rates based upon the "visible" mutational load, asexual reproduction with a transcriptionally silent germline may allow ciliates to evolve extremely low germline mutation rates.

ContributorsLong, Hongan (Author) / Winter, David (Author) / Chang, Allan Y.-C. (Author) / Sung, Way (Author) / Wu, Steven (Author) / Balboa, Mariel (Author) / Azevedo, Ricardo B. R. (Author) / Cartwright, Reed (Author) / Lynch, Michael (Author) / Zufall, Rebecca A. (Author) / Biodesign Institute (Contributor)
Created2016-09-15
127891-Thumbnail Image.png
Description

Inbreeding in hermaphroditic plants can occur through two different mechanisms: biparental inbreeding, when a plant mates with a related individual, or self-fertilization, when a plant mates with itself. To avoid inbreeding, many hermaphroditic plants have evolved self-incompatibility (SI) systems which prevent or limit self-fertilization. One particular SI system—homomorphic SI—can also

Inbreeding in hermaphroditic plants can occur through two different mechanisms: biparental inbreeding, when a plant mates with a related individual, or self-fertilization, when a plant mates with itself. To avoid inbreeding, many hermaphroditic plants have evolved self-incompatibility (SI) systems which prevent or limit self-fertilization. One particular SI system—homomorphic SI—can also reduce biparental inbreeding. Homomorphic SI is found in many angiosperm species, and it is often assumed that the additional benefit of reduced biparental inbreeding may be a factor in the success of this SI system. To test this assumption, we developed a spatially-explicit, individual-based simulation of plant populations that displayed three different types of homomorphic SI. We measured the total level of inbreeding avoidance by comparing each population to a self-compatible population (NSI), and we measured biparental inbreeding avoidance by comparing to a population of self-incompatible plants that were free to mate with any other individual (PSI).

Because biparental inbreeding is more common when offspring dispersal is limited, we examined the levels of biparental inbreeding over a range of dispersal distances. We also tested whether the introduction of inbreeding depression affected the level of biparental inbreeding avoidance. We found that there was a statistically significant decrease in autozygosity in each of the homomorphic SI populations compared to the PSI population and, as expected, this was more pronounced when seed and pollen dispersal was limited. However, levels of homozygosity and inbreeding depression were not reduced. At low dispersal, homomorphic SI populations also suffered reduced female fecundity and had smaller census population sizes. Overall, our simulations showed that the homomorphic SI systems had little impact on the amount of biparental inbreeding in the population especially when compared to the overall reduction in inbreeding compared to the NSI population. With further study, this observation may have important consequences for research into the origin and evolution of homomorphic self-incompatibility systems.

ContributorsFurstenau, Tara (Author) / Cartwright, Reed (Author) / Biodesign Institute (Contributor)
Created2017-11-24
128617-Thumbnail Image.png
Description

Plasmodium vivax is the most prevalent malarial species in South America and exerts a substantial burden on the populations it affects. The control and eventual elimination of P. vivax are global health priorities. Genomic research contributes to this objective by improving our understanding of the biology of P. vivax and

Plasmodium vivax is the most prevalent malarial species in South America and exerts a substantial burden on the populations it affects. The control and eventual elimination of P. vivax are global health priorities. Genomic research contributes to this objective by improving our understanding of the biology of P. vivax and through the development of new genetic markers that can be used to monitor efforts to reduce malaria transmission. Here we analyze whole-genome data from eight field samples from a region in Cordóba, Colombia where malaria is endemic. We find considerable genetic diversity within this population, a result that contrasts with earlier studies suggesting that P. vivax had limited diversity in the Americas. We also identify a selective sweep around a substitution known to confer resistance to sulphadoxine-pyrimethamine (SP). This is the first observation of a selective sweep for SP resistance in this species. These results indicate that P. vivax has been exposed to SP pressure even when the drug is not in use as a first line treatment for patients afflicted by this parasite. We identify multiple non-synonymous substitutions in three other genes known to be involved with drug resistance in Plasmodium species. Finally, we found extensive microsatellite polymorphisms. Using this information we developed 18 polymorphic and easy to score microsatellite loci that can be used in epidemiological investigations in South America.

ContributorsWinter, David (Author) / Pacheco, Maria Andreina (Author) / Vallejo, Andres F. (Author) / Schwartz, Rachel (Author) / Arevalo-Herrera, Myriam (Author) / Herrera, Socrates (Author) / Cartwright, Reed (Author) / Escalante, Ananias (Author) / Biodesign Institute (Contributor)
Created2015-12-28