Search Content

From autopsy donor to stem cell to neuron (and back again): cell line cohorts, IPSC proof-of-principle studies, and transcriptome comparisons of in vitro and in vivo neural cells

Description

Induced pluripotent stem cells (iPSCs) are an intriguing approach for neurological disease modeling, because neural lineage-specific cell types that retain the donors' complex genetics can be established in vitro. The statistical power of these iPSC-based models, however, is dependent on accurate diagnoses of the somatic cell donors; unfortunately, many neurodegenerative…

Induced pluripotent stem cells (iPSCs) are an intriguing approach for neurological disease modeling, because neural lineage-specific cell types that retain the donors' complex genetics can be established in vitro. The statistical power of these iPSC-based models, however, is dependent on accurate diagnoses of the somatic cell donors; unfortunately, many neurodegenerative diseases are commonly misdiagnosed in live human subjects. Postmortem histopathological examination of a donor's brain, combined with premortem clinical criteria, is often the most robust approach to correctly classify an individual as a disease-specific case or unaffected control. We describe the establishment of primary dermal fibroblasts cells lines from 28 autopsy donors. These fibroblasts were used to examine the proliferative effects of establishment protocol, tissue amount, biopsy site, and donor age. As proof-of-principle, iPSCs were generated from fibroblasts from a 75-year-old male, whole body donor, defined as an unaffected neurological control by both clinical and histopathological criteria. To our knowledge, this is the first study describing autopsy donor-derived somatic cells being used for iPSC generation and subsequent neural differentiation. This unique approach also enables us to compare iPSC-derived cell cultures to endogenous tissues from the same donor. We utilized RNA sequencing (RNA-Seq) to evaluate the transcriptional progression of in vitro-differentiated neural cells (over a timecourse of 0, 35, 70, 105 and 140 days), and compared this with donor-identical temporal lobe tissue. We observed in vitro progression towards the reference brain tissue, supported by (i) a significant increasing monotonic correlation between the days of our timecourse and the number of actively transcribed protein-coding genes and long intergenic non-coding RNAs (lincRNAs) (P < 0.05), consistent with the transcriptional complexity of the brain, (ii) an increase in CpG methylation after neural differentiation that resembled the epigenomic signature of the endogenous tissue, and (iii) a significant decreasing monotonic correlation between the days of our timecourse and the percent of in vitro to brain-tissue differences (P < 0.05) for tissue-specific protein-coding genes and all putative lincRNAs. These studies support the utility of autopsy donors' somatic cells for iPSC-based neurological disease models, and provide evidence that in vitro neural differentiation can result in physiologically progression.

ContributorsHjelm, Brooke E (Author) / Craig, David W. (Thesis advisor) / Wilson-Rawls, Norma J. (Thesis advisor) / Huentelman, Matthew J. (Committee member) / Mason, Hugh S. (Committee member) / Kusumi, Kenro (Committee member) / Arizona State University (Publisher)

Created2013

Advancing the lizard, Anolis carolinensis, as a model system for genomic studies of evolution, development and regeneration

Description

Well-established model systems exist in four out of the seven major classes of vertebrates. These include the mouse, chicken, frog and zebrafish. Noticeably missing from this list is a reptilian model organism for comparative studies between the vertebrates and for studies of biological processes unique to reptiles. To help fill…

Well-established model systems exist in four out of the seven major classes of vertebrates. These include the mouse, chicken, frog and zebrafish. Noticeably missing from this list is a reptilian model organism for comparative studies between the vertebrates and for studies of biological processes unique to reptiles. To help fill in this gap the green anole lizard, Anolis carolinensis, is being adapted as a model organism. Despite the recent release of the complete genomic sequence of the A. carolinensis, the lizard lacks some resources to aid researchers in their studies. Particularly, the lack of transcriptomic resources for lizard has made it difficult to identify genes complete with alternative splice forms and untranslated regions (UTRs). As part of this work the genome annotation for A. carolinensis was improved through next generation sequencing and assembly of the transcriptomes from 14 different adult and embryonic tissues. This revised annotation of the lizard will improve comparative studies between vertebrates, as well as studies within A. carolinensis itself, by providing more accurate gene models, which provide the bases for molecular studies. To demonstrate the utility of the improved annotations and reptilian model organism, the developmental process of somitogenesis in the lizard was analyzed and compared with other vertebrates. This study identified several key features both divergent and convergent between the vertebrates, which was not previously known before analysis of a reptilian model organism. The improved genome annotations have also allowed for molecular studies of tail regeneration in the lizard. With the annotation of 3' UTR sequences and next generation sequencing, it is now possible to do expressional studies of miRNA and predict their mRNA target transcripts at genomic scale. Through next generation small RNA sequencing and subsequent analysis, several differentially expressed miRNAs were identified in the regenerating tail, suggesting miRNA may play a key role in regulating this process in lizards. Through miRNA target prediction several key biological pathways were identified as potentially under the regulation of miRNAs during tail regeneration. In total, this work has both helped advance A. carolinensis as model system and displayed the utility of a reptilian model system.

ContributorsEckalbar, Walter L (Author) / Kusumi, Kenro (Thesis advisor) / Huentelman, Matthew (Committee member) / Rawls, Jeffery (Committee member) / Wilson-Rawls, Norma (Committee member) / Arizona State University (Publisher)

Created2012

Genomic diversity and abundance of LINE retrotransposons in 4 anole lizards

Description

Vertebrate genomes demonstrate a remarkable range of sizes from 0.3 to 133 gigabase pairs. The proliferation of repeat elements are a major genomic expansion. In particular, long interspersed nuclear elements (LINES) are autonomous retrotransposons that have the ability to "cut and paste" themselves into a host genome through a mechanism…

Vertebrate genomes demonstrate a remarkable range of sizes from 0.3 to 133 gigabase pairs. The proliferation of repeat elements are a major genomic expansion. In particular, long interspersed nuclear elements (LINES) are autonomous retrotransposons that have the ability to "cut and paste" themselves into a host genome through a mechanism called target-primed reverse transcription. LINES have been called "junk DNA," "viral DNA," and "selfish" DNA, and were once thought to be parasitic elements. However, LINES, which diversified before the emergence of many early vertebrates, has strongly shaped the evolution of eukaryotic genomes. This thesis will evaluate LINE abundance, diversity and activity in four anole lizards. An intrageneric analysis will be conducted using comparative phylogenetics and bioinformatics. Comparisons within the Anolis genus, which derives from a single lineage of an adaptive radiation, will be conducted to explore the relationship between LINE retrotransposon activity and causal changes in genomic size and composition.

ContributorsMay, Catherine (Author) / Kusumi, Kenro (Thesis advisor) / Gadau, Juergen (Committee member) / Rawls, Jeffery A (Committee member) / Arizona State University (Publisher)

Created2013

Structure-function study of telomerase RNA from evolutionary disparate species: remarkable divergence in gross architecture with the preservation of critical universal structural elements

Description

Telomerase enzyme is a truly remarkable enzyme specialized for the addition of short, highly repetitive DNA sequences onto linear eukaryotic chromosome ends. The telomerase enzyme functions as a ribonucleoprotein, minimally composed of the highly conserved catalytic telomerase reverse transcriptase and essential telomerase RNA component containing an internalized short template…

Telomerase enzyme is a truly remarkable enzyme specialized for the addition of short, highly repetitive DNA sequences onto linear eukaryotic chromosome ends. The telomerase enzyme functions as a ribonucleoprotein, minimally composed of the highly conserved catalytic telomerase reverse transcriptase and essential telomerase RNA component containing an internalized short template region within the vastly larger non-coding RNA. Even among closely related groups of species, telomerase RNA is astonishingly divergent in sequence, length, and secondary structure. This massive disparity is highly prohibitive for telomerase RNA identification from previously unexplored groups of species, which is fundamental for secondary structure determination. Combined biochemical enrichment and computational screening methods were employed for the discovery of numerous telomerase RNAs from the poorly characterized echinoderm lineage. This resulted in the revelation that--while closely related to the vertebrate lineage and grossly resembling vertebrate telomerase RNA--the echinoderm telomerase RNA central domain varies extensively in structure and sequence, diverging even within echinoderms amongst sea urchins and brittle stars. Furthermore, the origins of telomerase RNA within the eukaryotic lineage have remained a persistent mystery. The ancient Trypanosoma telomerase RNA was previously identified, however, a functionally verified secondary structure remained elusive. Synthetic Trypanosoma telomerase was generated for molecular dissection of Trypanosoma telomerase RNA revealing two RNA domains functionally equivalent to those found in known telomerase RNAs, yet structurally distinct. This work demonstrates that telomerase RNA is uncommonly divergent in gross architecture, while retaining critical universal elements.

ContributorsPodlevsky, Joshua (Author) / Chen, Julian (Thesis advisor) / Mangone, Marco (Committee member) / Kusumi, Kenro (Committee member) / Wilson-Rawls, Norma (Committee member) / Arizona State University (Publisher)

Created2015

miRNA Targeting: In depth review of biologically significant mechanisms and a bioinformatic approach to identifying targeting sequences in C. elegans

Description

microRNAs (miRNAs) are short ~22nt non-coding RNAs that regulate gene output at the post-transcriptional level. Via targeting of degenerate elements primarily in 3'untranslated regions (3'UTR) of mRNAs, miRNAs can target thousands of varying genes and suppress their protein translation. The precise mechanistic function and bio- logical role of miRNAs is…

microRNAs (miRNAs) are short ~22nt non-coding RNAs that regulate gene output at the post-transcriptional level. Via targeting of degenerate elements primarily in 3'untranslated regions (3'UTR) of mRNAs, miRNAs can target thousands of varying genes and suppress their protein translation. The precise mechanistic function and bio- logical role of miRNAs is not fully understood and yet it is a major contributor to a pleth- ora of diseases, including neurological disorders, muscular disorders, and cancer. Cer- tain model organisms are valuable in understanding the function of miRNA and there- fore fully understanding the biological significance of miRNA targeting. Here I report a mechanistic analysis of miRNA targeting in C. elegans, and a bioinformatic approach to aid in further investigation of miRNA targeted sequences. A few of the biologically significant mechanisms discussed in this thesis include alternative polyadenylation, RNA binding proteins, components of the miRNA recognition machinery, miRNA secondary structures, and their polymorphisms. This thesis also discusses a novel bioinformatic approach to studying miRNA biology, including computational miRNA target prediction software, and sequence complementarity. This thesis allows a better understanding of miRNA biology and presents an ideal strategy for approaching future research in miRNA targeting.

ContributorsWeigele, Dustin Keith (Author) / Mangone, Marco (Thesis director) / Katchman, Benjamin (Committee member) / Barrett, The Honors College (Contributor) / Department of Chemistry and Biochemistry (Contributor) / School of Life Sciences (Contributor)

Created2014-12

Insights towards developing regenerative therapies: the lizard, Anolis carolinensis, as a genetic model for regeneration in amniotes

Description

Damage to the central nervous system due to spinal cord or traumatic brain injury, as well as degenerative musculoskeletal disorders such as arthritis, drastically impact the quality of life. Regeneration of complex structures is quite limited in mammals, though other vertebrates possess this ability. Lizards are the most closely related…

Damage to the central nervous system due to spinal cord or traumatic brain injury, as well as degenerative musculoskeletal disorders such as arthritis, drastically impact the quality of life. Regeneration of complex structures is quite limited in mammals, though other vertebrates possess this ability. Lizards are the most closely related organism to humans that can regenerate de novo skeletal muscle, hyaline cartilage, spinal cord, vasculature, and skin. Progress in studying the cellular and molecular mechanisms of lizard regeneration has previously been limited by a lack of genomic resources. Building on the release of the genome of the green anole, Anolis carolinensis, we developed a second generation, robust RNA-Seq-based genome annotation, and performed the first transcriptomic analysis of tail regeneration in this species. In order to investigate gene expression in regenerating tissue, we performed whole transcriptome and microRNA transcriptome analysis of regenerating tail tip and base and associated tissues, identifying key genetic targets in the regenerative process. These studies have identified components of a genetic program for regeneration in the lizard that includes both developmental and adult repair mechanisms shared with mammals, indicating value in the translation of these findings to future regenerative therapies.

ContributorsHutchins, Elizabeth (Author) / Kusumi, Kenro (Thesis advisor) / Rawls, Jeffrey A. (Committee member) / Denardo, Dale F. (Committee member) / Huentelman, Matthew J. (Committee member) / Arizona State University (Publisher)

Created2015

Comparative genomics and novel bioinformatics methodology applied to the green anole reveal unique sex chromosome evolution

Description

In species with highly heteromorphic sex chromosomes, the degradation of one of the sex chromosomes can result in unequal gene expression between the sexes (e.g., between XX females and XY males) and between the sex chromosomes and the autosomes. Dosage compensation is a process whereby genes on the sex chromosomes…

In species with highly heteromorphic sex chromosomes, the degradation of one of the sex chromosomes can result in unequal gene expression between the sexes (e.g., between XX females and XY males) and between the sex chromosomes and the autosomes. Dosage compensation is a process whereby genes on the sex chromosomes achieve equal gene expression which prevents deleterious side effects from having too much or too little expression of genes on sex chromsomes. The green anole is part of a group of species that recently underwent an adaptive radiation. The green anole has XX/XY sex determination, but the content of the X chromosome and its evolution have not been described. Given its status as a model species, better understanding the green anole genome could reveal insights into other species. Genomic analyses are crucial for a comprehensive picture of sex chromosome differentiation and dosage compensation, in addition to understanding speciation.

In order to address this, multiple comparative genomics and bioinformatics analyses were conducted to elucidate patterns of evolution in the green anole and across multiple anole species. Comparative genomics analyses were used to infer additional X-linked loci in the green anole, RNAseq data from male and female samples were anayzed to quantify patterns of sex-biased gene expression across the genome, and the extent of dosage compensation on the anole X chromosome was characterized, providing evidence that the sex chromosomes in the green anole are dosage compensated.

In addition, X-linked genes have a lower ratio of nonsynonymous to synonymous substitution rates than the autosomes when compared to other Anolis species, and pairwise rates of evolution in genes across the anole genome were analyzed. To conduct this analysis a new pipeline was created for filtering alignments and performing batch calculations for whole genome coding sequences. This pipeline has been made publicly available.

ContributorsRupp, Shawn Michael (Author) / Wilson Sayres, Melissa A (Thesis advisor) / Kusumi, Kenro (Committee member) / DeNardo, Dale (Committee member) / Arizona State University (Publisher)

Created2016

Transcriptome gene expression analysis of breast cancer using RNA-Seq

Description

Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer deaths in females worldwide, accounting for 23% of all new cancer cases and 14% of all total cancer deaths in 2008. Five tumor-normal pairs of primary breast epithelial cells were treated for infinite proliferation by…

Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer deaths in females worldwide, accounting for 23% of all new cancer cases and 14% of all total cancer deaths in 2008. Five tumor-normal pairs of primary breast epithelial cells were treated for infinite proliferation by using a ROCK inhibitor and mouse feeder cells. Methods: Raw paired-end, 100x coverage RNA-Seq data was aligned to the Human Reference Genome Version 19 using BWA and Tophat. Gene differential expression analysis was completed using Cufflinks and Cuffdiff. Interactive Genome Viewer was used for data visualization. Results: 15 genes were found to be down-regulated by at least one log-fold change in 4/5 of tumor samples. 75 genes were found to be down-regulated in 3/5 of our tumor samples by at least one log-fold change. 11 genes were found to be up-regulated in 4/5 of our tumor samples, and 68 genes were identified to be up-regulated in 3/5 of the tumor samples by at least one-fold change. Conclusion: Expression changes in genes such as AZGP1, AGER, ALG11, and S1007 suggest a disruption in the glycosylation pathway. No correlation was found between Cufflink's Her2 gene-expression and DAKO score classification.

ContributorsHernandez, Fernando (Author) / Anderson, Karen (Thesis director) / Mangone, Marco (Committee member) / Park, Jin (Committee member) / Barrett, The Honors College (Contributor) / Department of Information Systems (Contributor)

Created2013-05

The Agassiz’s Desert Tortoise Genome Provides a Resource for the Conservation of a Threatened Species

Description

Agassiz’s desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on dee…

Agassiz’s desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on deep transcriptome sequences of adult skeletal muscle, lung, brain, and blood. The draft genome assembly for G. agassizii has a scaffold N50 length of 252 kbp and a total length of 2.4 Gbp. Genome annotation reveals 20,172 protein-coding genes in the G. agassizii assembly, and that gene structure is more similar to chicken than other turtles. We provide a series of comparative analyses demonstrating (1) that turtles are among the slowest-evolving genome-enabled reptiles, (2) amino acid changes in genes controlling desert tortoise traits such as shell development, longevity and osmoregulation, and (3) fixed variants across the Gopherus species complex in genes related to desert adaptations, including circadian rhythm and innate immune response. This G. agassizii genome reference and annotation is the first such resource for any tortoise, and will serve as a foundation for future analysis of the genetic basis of adaptations to the desert environment, allow for investigation into genomic factors affecting tortoise health, disease and longevity, and serve as a valuable resource for additional studies in this species complex.

Data Availability: All genomic and transcriptomic sequence files are available from the NIH-NCBI BioProject database (accession numbers PRJNA352725, PRJNA352726, and PRJNA281763). All genome assembly, transcriptome assembly, predicted protein, transcript, genome annotation, repeatmasker, phylogenetic trees, .vcf and GO enrichment files are available on Harvard Dataverse (doi:10.7910/DVN/EH2S9K).

ContributorsTollis, Marc (Author) / DeNardo, Dale F (Author) / Cornelius, John A (Author) / Dolby, Greer A (Author) / Edwards, Taylor (Author) / Henen, Brian T. (Author) / Karl, Alice E. (Author) / Murphy, Robert W. (Author) / Kusumi, Kenro (Author)

Created2017-05-31

Profiling of Indel Phases in Coding Regions

Description

Advances in sequencing technology have generated an enormous amount of data over the past decade. Equally advanced computational methods are needed to conduct comparative and functional genomic studies on these datasets, in particular tools that appropriately interpret indels within an evolutionary framework. The evolutionary history of indels is complex and…

Advances in sequencing technology have generated an enormous amount of data over the past decade. Equally advanced computational methods are needed to conduct comparative and functional genomic studies on these datasets, in particular tools that appropriately interpret indels within an evolutionary framework. The evolutionary history of indels is complex and often involves repetitive genomic regions, which makes identification, alignment, and annotation difficult. While previous studies have found that indel lengths in both deoxyribonucleic acid and proteins obey a power law, probabilistic models for indel evolution have rarely been explored due to their computational complexity. In my research, I first explore an application of an expectation-maximization algorithm for maximum-likelihood training of a codon substitution model. I demonstrate the training accuracy of the expectation-maximization on my substitution model. Then I apply this algorithm on a published 90 pairwise species dataset and find a negative correlation between the branch length and non-synonymous selection coefficient. Second, I develop a post-alignment fixation method to profile each indel event into three different phases according to its codon position. Because current codon-aware models can only identify the indels by placing the gaps between codons and lead to the misalignment of the sequences. I find that the mouse-rat species pair is under purifying selection by looking at the proportion difference of the indel phases. I also demonstrate the power of my sliding-window method by comparing the post-aligned and original gap positions. Third, I create an indel-phase moore machine including the indel rates of three phases, length distributions, and codon substitution models. Then I design a gillespie simulation that is capable of generating true sequence alignments. Next I develop an importance sampling method within the expectation-maximization algorithm that can successfully train the indel-phase model and infer accurate parameter estimates from alignments. Finally, I extend the indel phase analysis to the 90 pairwise species dataset across three alignment methods, including Mafft+sw method developed in chapter 3, coati-sampling methods applied in chapter 4, and coati-max method. Also I explore a non-linear relationship between the dN/dS and Zn/(Zn+Zs) ratio across 90 species pairs.

ContributorsZhu, Ziqi (Author) / Cartwright, Reed A (Thesis advisor) / Taylor, Jay (Committee member) / Wideman, Jeremy (Committee member) / Mangone, Marco (Committee member) / Arizona State University (Publisher)

Created2022

Filtering by