Search Content

Comparative genomics and novel bioinformatics methodology applied to the green anole reveal unique sex chromosome evolution

Description

In species with highly heteromorphic sex chromosomes, the degradation of one of the sex chromosomes can result in unequal gene expression between the sexes (e.g., between XX females and XY males) and between the sex chromosomes and the autosomes. Dosage compensation is a process whereby genes on the sex chromosomes…

In species with highly heteromorphic sex chromosomes, the degradation of one of the sex chromosomes can result in unequal gene expression between the sexes (e.g., between XX females and XY males) and between the sex chromosomes and the autosomes. Dosage compensation is a process whereby genes on the sex chromosomes achieve equal gene expression which prevents deleterious side effects from having too much or too little expression of genes on sex chromsomes. The green anole is part of a group of species that recently underwent an adaptive radiation. The green anole has XX/XY sex determination, but the content of the X chromosome and its evolution have not been described. Given its status as a model species, better understanding the green anole genome could reveal insights into other species. Genomic analyses are crucial for a comprehensive picture of sex chromosome differentiation and dosage compensation, in addition to understanding speciation.

In order to address this, multiple comparative genomics and bioinformatics analyses were conducted to elucidate patterns of evolution in the green anole and across multiple anole species. Comparative genomics analyses were used to infer additional X-linked loci in the green anole, RNAseq data from male and female samples were anayzed to quantify patterns of sex-biased gene expression across the genome, and the extent of dosage compensation on the anole X chromosome was characterized, providing evidence that the sex chromosomes in the green anole are dosage compensated.

In addition, X-linked genes have a lower ratio of nonsynonymous to synonymous substitution rates than the autosomes when compared to other Anolis species, and pairwise rates of evolution in genes across the anole genome were analyzed. To conduct this analysis a new pipeline was created for filtering alignments and performing batch calculations for whole genome coding sequences. This pipeline has been made publicly available.

ContributorsRupp, Shawn Michael (Author) / Wilson Sayres, Melissa A (Thesis advisor) / Kusumi, Kenro (Committee member) / DeNardo, Dale (Committee member) / Arizona State University (Publisher)

Created2016

Using Molecular, Cellular and Bioengineering Approaches Towards Understanding Muscle Stem Cell Biology

Description

Satellite cells are adult muscle stem cells that activate, proliferate, and differentiate into myofibers upon muscle damage. Satellite cells can be cultured and manipulated in vitro, and thus represent an accessible model for studying skeletal muscle biology, and a potential source of autologous stem cells for regenerative medicine. This work…

Satellite cells are adult muscle stem cells that activate, proliferate, and differentiate into myofibers upon muscle damage. Satellite cells can be cultured and manipulated in vitro, and thus represent an accessible model for studying skeletal muscle biology, and a potential source of autologous stem cells for regenerative medicine. This work summarizes efforts to further understanding of satellite cell biology, using novel model organisms, bioengineering, and molecular and cellular approaches. Lizards are evolutionarily the closest vertebrates to humans that regenerate entire appendages. An analysis of lizard myoprogenitor cell transcriptome determined they were most transcriptionally similar to mammalian satellite cells. Further examination showed that among genes with the highest level of expression in lizard satellite cells were an increased number of regulators of chondrogenesis. In micromass culture, lizard satellite cells formed nodules that expressed chondrogenic regulatory genes, thus demonstrating increased musculoskeletal plasticity. However, to exploit satellite cells for therapeutics, development of an ex vivo culture is necessary. This work investigates whether substrates composed of extracellular matrix (ECM) proteins, as either coatings or hydrogels, can support expansion of this population whilst maintaining their myogenic potency. Stiffer substrates are necessary for in vitro proliferation and differentiation of satellite cells, while the ECM composition was not significantly important. Additionally, satellite cells on hydrogels entered a quiescent state that could be reversed when the cells were subsequently cultured on Matrigel. Proliferation and gene expression data further indicated that C2C12 cells are not a good proxy for satellite cells. To further understand how different signaling pathways control satellite cell behavior, an investigation of the Notch inhibitor protein Numb was carried out. Numb deficient satellite cells fail to activate, proliferate and participate in muscle repair. Examination of Numb isoform expression in satellite cells and embryonic tissues revealed that while developing limb bud, neural tube, and heart express the long and short isoforms of NUMB, satellite cells predominantly express the short isoforms. A preliminary immunoprecipitation- proteomics experiment suggested that the roles of NUMB in satellite cells are related to cell cycle modulation, cytoskeleton dynamics, and regulation of transcription factors necessary for satellite cell function.

ContributorsPalade, Joanna (Author) / Wilson-Rawls, Norma (Thesis advisor) / Rawls, Jeffrey (Committee member) / Kusumi, Kenro (Committee member) / Newbern, Jason (Committee member) / Stabenfeldt, Sarah (Committee member) / Arizona State University (Publisher)

Created2020

Methods for Detecting Mutations in Non-model Organisms

Description

Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The…

Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.

ContributorsOrr, Adam James (Author) / Cartwright, Reed (Thesis advisor) / Wilson, Melissa (Committee member) / Kusumi, Kenro (Committee member) / Taylor, Jesse (Committee member) / Pfeifer, Susanne (Committee member) / Arizona State University (Publisher)

Created2020

The Agassiz’s Desert Tortoise Genome Provides a Resource for the Conservation of a Threatened Species

Description

Agassiz’s desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on dee…

Agassiz’s desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on deep transcriptome sequences of adult skeletal muscle, lung, brain, and blood. The draft genome assembly for G. agassizii has a scaffold N50 length of 252 kbp and a total length of 2.4 Gbp. Genome annotation reveals 20,172 protein-coding genes in the G. agassizii assembly, and that gene structure is more similar to chicken than other turtles. We provide a series of comparative analyses demonstrating (1) that turtles are among the slowest-evolving genome-enabled reptiles, (2) amino acid changes in genes controlling desert tortoise traits such as shell development, longevity and osmoregulation, and (3) fixed variants across the Gopherus species complex in genes related to desert adaptations, including circadian rhythm and innate immune response. This G. agassizii genome reference and annotation is the first such resource for any tortoise, and will serve as a foundation for future analysis of the genetic basis of adaptations to the desert environment, allow for investigation into genomic factors affecting tortoise health, disease and longevity, and serve as a valuable resource for additional studies in this species complex.

Data Availability: All genomic and transcriptomic sequence files are available from the NIH-NCBI BioProject database (accession numbers PRJNA352725, PRJNA352726, and PRJNA281763). All genome assembly, transcriptome assembly, predicted protein, transcript, genome annotation, repeatmasker, phylogenetic trees, .vcf and GO enrichment files are available on Harvard Dataverse (doi:10.7910/DVN/EH2S9K).

ContributorsTollis, Marc (Author) / DeNardo, Dale F (Author) / Cornelius, John A (Author) / Dolby, Greer A (Author) / Edwards, Taylor (Author) / Henen, Brian T. (Author) / Karl, Alice E. (Author) / Murphy, Robert W. (Author) / Kusumi, Kenro (Author)

Created2017-05-31

Filtering by

Comparative genomics and novel bioinformatics methodology applied to the green anole reveal unique sex chromosome evolution

Using Molecular, Cellular and Bioengineering Approaches Towards Understanding Muscle Stem Cell Biology

Methods for Detecting Mutations in Non-model Organisms

The Agassiz’s Desert Tortoise Genome Provides a Resource for the Conservation of a Threatened Species