Matching Items (27)
Filtering by

Clear all filters

152029-Thumbnail Image.png
Description
Induced pluripotent stem cells (iPSCs) are an intriguing approach for neurological disease modeling, because neural lineage-specific cell types that retain the donors' complex genetics can be established in vitro. The statistical power of these iPSC-based models, however, is dependent on accurate diagnoses of the somatic cell donors; unfortunately, many neurodegenerative

Induced pluripotent stem cells (iPSCs) are an intriguing approach for neurological disease modeling, because neural lineage-specific cell types that retain the donors' complex genetics can be established in vitro. The statistical power of these iPSC-based models, however, is dependent on accurate diagnoses of the somatic cell donors; unfortunately, many neurodegenerative diseases are commonly misdiagnosed in live human subjects. Postmortem histopathological examination of a donor's brain, combined with premortem clinical criteria, is often the most robust approach to correctly classify an individual as a disease-specific case or unaffected control. We describe the establishment of primary dermal fibroblasts cells lines from 28 autopsy donors. These fibroblasts were used to examine the proliferative effects of establishment protocol, tissue amount, biopsy site, and donor age. As proof-of-principle, iPSCs were generated from fibroblasts from a 75-year-old male, whole body donor, defined as an unaffected neurological control by both clinical and histopathological criteria. To our knowledge, this is the first study describing autopsy donor-derived somatic cells being used for iPSC generation and subsequent neural differentiation. This unique approach also enables us to compare iPSC-derived cell cultures to endogenous tissues from the same donor. We utilized RNA sequencing (RNA-Seq) to evaluate the transcriptional progression of in vitro-differentiated neural cells (over a timecourse of 0, 35, 70, 105 and 140 days), and compared this with donor-identical temporal lobe tissue. We observed in vitro progression towards the reference brain tissue, supported by (i) a significant increasing monotonic correlation between the days of our timecourse and the number of actively transcribed protein-coding genes and long intergenic non-coding RNAs (lincRNAs) (P < 0.05), consistent with the transcriptional complexity of the brain, (ii) an increase in CpG methylation after neural differentiation that resembled the epigenomic signature of the endogenous tissue, and (iii) a significant decreasing monotonic correlation between the days of our timecourse and the percent of in vitro to brain-tissue differences (P < 0.05) for tissue-specific protein-coding genes and all putative lincRNAs. These studies support the utility of autopsy donors' somatic cells for iPSC-based neurological disease models, and provide evidence that in vitro neural differentiation can result in physiologically progression.
ContributorsHjelm, Brooke E (Author) / Craig, David W. (Thesis advisor) / Wilson-Rawls, Norma J. (Thesis advisor) / Huentelman, Matthew J. (Committee member) / Mason, Hugh S. (Committee member) / Kusumi, Kenro (Committee member) / Arizona State University (Publisher)
Created2013
151689-Thumbnail Image.png
Description
Sparsity has become an important modeling tool in areas such as genetics, signal and audio processing, medical image processing, etc. Via the penalization of l-1 norm based regularization, the structured sparse learning algorithms can produce highly accurate models while imposing various predefined structures on the data, such as feature groups

Sparsity has become an important modeling tool in areas such as genetics, signal and audio processing, medical image processing, etc. Via the penalization of l-1 norm based regularization, the structured sparse learning algorithms can produce highly accurate models while imposing various predefined structures on the data, such as feature groups or graphs. In this thesis, I first propose to solve a sparse learning model with a general group structure, where the predefined groups may overlap with each other. Then, I present three real world applications which can benefit from the group structured sparse learning technique. In the first application, I study the Alzheimer's Disease diagnosis problem using multi-modality neuroimaging data. In this dataset, not every subject has all data sources available, exhibiting an unique and challenging block-wise missing pattern. In the second application, I study the automatic annotation and retrieval of fruit-fly gene expression pattern images. Combined with the spatial information, sparse learning techniques can be used to construct effective representation of the expression images. In the third application, I present a new computational approach to annotate developmental stage for Drosophila embryos in the gene expression images. In addition, it provides a stage score that enables one to more finely annotate each embryo so that they are divided into early and late periods of development within standard stage demarcations. Stage scores help us to illuminate global gene activities and changes much better, and more refined stage annotations improve our ability to better interpret results when expression pattern matches are discovered between genes.
ContributorsYuan, Lei (Author) / Ye, Jieping (Thesis advisor) / Wang, Yalin (Committee member) / Xue, Guoliang (Committee member) / Kumar, Sudhir (Committee member) / Arizona State University (Publisher)
Created2013
152297-Thumbnail Image.png
Description
This thesis research focuses on developing a single-cell gene expression analysis method for marine diatom Thalassiosira pseudonana and constructing a chip level tool to realize the single cell RT-qPCR analysis. This chip will serve as a conceptual foundation for future deployable ocean monitoring systems. T. pseudonana, which is a common

This thesis research focuses on developing a single-cell gene expression analysis method for marine diatom Thalassiosira pseudonana and constructing a chip level tool to realize the single cell RT-qPCR analysis. This chip will serve as a conceptual foundation for future deployable ocean monitoring systems. T. pseudonana, which is a common surface water microorganism, was detected in the deep ocean as confirmed by phylogenetic and microbial community functional studies. Six-fold copy number differences between 23S rRNA and 23S rDNA were observed by RT-qPCR, demonstrating the moderate functional activity of detected photosynthetic microbes in the deep ocean including T. pseudonana. Because of the ubiquity of T. pseudonana, it is a good candidate for an early warning system for ocean environmental perturbation monitoring. This early warning system will depend on identifying outlier gene expression at the single-cell level. An early warning system based on single-cell analysis is expected to detect environmental perturbations earlier than population level analysis which can only be observed after a whole community has reacted. Preliminary work using tube-based, two-step RT-qPCR revealed for the first time, gene expression heterogeneity of T. pseudonana under different nutrient conditions. Heterogeneity was revealed by different gene expression activity for individual cells under the same conditions. This single cell analysis showed a skewed, lognormal distribution and helped to find outlier cells. The results indicate that the geometric average becomes more important and representative of the whole population than the arithmetic average. This is in contrast with population level analysis which is limited to arithmetic averages only and highlights the value of single cell analysis. In order to develop a deployable sensor in the ocean, a chip level device was constructed. The chip contains surface-adhering droplets, defined by hydrophilic patterning, that serve as real-time PCR reaction chambers when they are immersed in oil. The chip had demonstrated sensitivities at the single cell level for both DNA and RNA. The successful rate of these chip-based reactions was around 85%. The sensitivity of the chip was equivalent to published microfluidic devices with complicated designs and protocols, but the production process of the chip was simple and the materials were all easily accessible in conventional environmental and/or biology laboratories. On-chip tests provided heterogeneity information about the whole population and were validated by comparing with conventional tube based methods and by p-values analysis. The power of chip-based single-cell analyses were mainly between 65-90% which were acceptable and can be further increased by higher throughput devices. With this chip and single-cell analysis approaches, a new paradigm for robust early warning systems of ocean environmental perturbation is possible.
ContributorsShi, Xu (Author) / Meldrum, Deirdre R. (Thesis advisor) / Zhang, Weiwen (Committee member) / Chao, Shih-hui (Committee member) / Westerhoff, Paul (Committee member) / Arizona State University (Publisher)
Created2013
152847-Thumbnail Image.png
Description
The processes of a human somatic cell are very complex with various genetic mechanisms governing its fate. Such cells undergo various genetic mutations, which translate to the genetic aberrations that we see in cancer. There are more than 100 types of cancer, each having many more subtypes with aberrations being

The processes of a human somatic cell are very complex with various genetic mechanisms governing its fate. Such cells undergo various genetic mutations, which translate to the genetic aberrations that we see in cancer. There are more than 100 types of cancer, each having many more subtypes with aberrations being unique to each. In the past two decades, the widespread application of high-throughput genomic technologies, such as micro-arrays and next-generation sequencing, has led to the revelation of many such aberrations. Known types and subtypes can be readily identified using gene-expression profiling and more importantly, high-throughput genomic datasets have helped identify novel sub-types with distinct signatures. Recent studies showing usage of gene-expression profiling in clinical decision making in breast cancer patients underscore the utility of high-throughput datasets. Beyond prognosis, understanding the underlying cellular processes is essential for effective cancer treatment. Various high-throughput techniques are now available to look at a particular aspect of a genetic mechanism in cancer tissue. To look at these mechanisms individually is akin to looking at a broken watch; taking apart each of its parts, looking at them individually and finally making a list of all the faulty ones. Integrative approaches are needed to transform one-dimensional cancer signatures into multi-dimensional interaction and regulatory networks, consequently bettering our understanding of cellular processes in cancer. Here, I attempt to (i) address ways to effectively identify high quality variants when multiple assays on the same sample samples are available through two novel tools, snpSniffer and NGSPE; (ii) glean new biological insight into multiple myeloma through two novel integrative analysis approaches making use of disparate high-throughput datasets. While these methods focus on multiple myeloma datasets, the informatics approaches are applicable to all cancer datasets and will thus help advance cancer genomics.
ContributorsYellapantula, Venkata (Author) / Dinu, Valentin (Thesis advisor) / Scotch, Matthew (Committee member) / Wallstrom, Garrick (Committee member) / Keats, Jonathan (Committee member) / Arizona State University (Publisher)
Created2014
150126-Thumbnail Image.png
Description
Given the process of tumorigenesis, biological signaling pathways have become of interest in the field of oncology. Many of the regulatory mechanisms that are altered in cancer are directly related to signal transduction and cellular communication. Thus, identifying signaling pathways that have become deregulated may provide useful information

Given the process of tumorigenesis, biological signaling pathways have become of interest in the field of oncology. Many of the regulatory mechanisms that are altered in cancer are directly related to signal transduction and cellular communication. Thus, identifying signaling pathways that have become deregulated may provide useful information to better understanding altered regulatory mechanisms within cancer. Many methods that have been created to measure the distinct activity of signaling pathways have relied strictly upon transcription profiles. With advancements in comparative genomic hybridization techniques, copy number data has become extremely useful in providing valuable information pertaining to the genomic landscape of cancer. The purpose of this thesis is to develop a methodology that incorporates both gene expression and copy number data to identify signaling pathways that have become deregulated in cancer. The central idea is that copy number data may significantly assist in identifying signaling pathway deregulation by justifying the aberrant activity being measured in gene expression profiles. This method was then applied to four different subtypes of breast cancer resulting in the identification of signaling pathways associated with distinct functionalities for each of the breast cancer subtypes.
ContributorsTrevino, Robert (Author) / Kim, Seungchan (Thesis advisor) / Ringner, Markus (Committee member) / Liu, Huan (Committee member) / Arizona State University (Publisher)
Created2011
149928-Thumbnail Image.png
Description
The technology expansion seen in the last decade for genomics research has permitted the generation of large-scale data sources pertaining to molecular biological assays, genomics, proteomics, transcriptomics and other modern omics catalogs. New methods to analyze, integrate and visualize these data types are essential to unveil relevant disease mechanisms. Towards

The technology expansion seen in the last decade for genomics research has permitted the generation of large-scale data sources pertaining to molecular biological assays, genomics, proteomics, transcriptomics and other modern omics catalogs. New methods to analyze, integrate and visualize these data types are essential to unveil relevant disease mechanisms. Towards these objectives, this research focuses on data integration within two scenarios: (1) transcriptomic, proteomic and functional information and (2) real-time sensor-based measurements motivated by single-cell technology. To assess relationships between protein abundance, transcriptomic and functional data, a nonlinear model was explored at static and temporal levels. The successful integration of these heterogeneous data sources through the stochastic gradient boosted tree approach and its improved predictability are some highlights of this work. Through the development of an innovative validation subroutine based on a permutation approach and the use of external information (i.e., operons), lack of a priori knowledge for undetected proteins was overcome. The integrative methodologies allowed for the identification of undetected proteins for Desulfovibrio vulgaris and Shewanella oneidensis for further biological exploration in laboratories towards finding functional relationships. In an effort to better understand diseases such as cancer at different developmental stages, the Microscale Life Science Center headquartered at the Arizona State University is pursuing single-cell studies by developing novel technologies. This research arranged and applied a statistical framework that tackled the following challenges: random noise, heterogeneous dynamic systems with multiple states, and understanding cell behavior within and across different Barrett's esophageal epithelial cell lines using oxygen consumption curves. These curves were characterized with good empirical fit using nonlinear models with simple structures which allowed extraction of a large number of features. Application of a supervised classification model to these features and the integration of experimental factors allowed for identification of subtle patterns among different cell types visualized through multidimensional scaling. Motivated by the challenges of analyzing real-time measurements, we further explored a unique two-dimensional representation of multiple time series using a wavelet approach which showcased promising results towards less complex approximations. Also, the benefits of external information were explored to improve the image representation.
ContributorsTorres Garcia, Wandaliz (Author) / Meldrum, Deirdre R. (Thesis advisor) / Runger, George C. (Thesis advisor) / Gel, Esma S. (Committee member) / Li, Jing (Committee member) / Zhang, Weiwen (Committee member) / Arizona State University (Publisher)
Created2011
156067-Thumbnail Image.png
Description
Plants are a promising upcoming platform for production of vaccine components and other desirable pharmaceutical proteins that can only, at present, be made in living systems. The unique soil microbe Agrobacterium tumefaciens can transfer DNA to plants very efficiently, essentially turning plants into factories capable of producing virtually any gene.

Plants are a promising upcoming platform for production of vaccine components and other desirable pharmaceutical proteins that can only, at present, be made in living systems. The unique soil microbe Agrobacterium tumefaciens can transfer DNA to plants very efficiently, essentially turning plants into factories capable of producing virtually any gene. While genetically modified bacteria have historically been used for producing useful biopharmaceuticals like human insulin, plants can assemble much more complicated proteins, like human antibodies, that bacterial systems cannot. As plants do not harbor human pathogens, they are also safer alternatives than animal cell cultures. Additionally, plants can be grown very cheaply, in massive quantities.

In my research, I have studied the genetic mechanisms that underlie gene expression, in order to improve plant-based biopharmaceutical production. To do this, inspiration was drawn from naturally-occurring gene regulatory mechanisms, especially those from plant viruses, which have evolved mechanisms to co-opt the plant cellular machinery to produce high levels of viral proteins. By testing, modifying, and combining genetic elements from diverse sources, an optimized expression system has been developed that allows very rapid production of vaccine components, monoclonal antibodies, and other biopharmaceuticals. To improve target gene expression while maintaining the health and function of the plants, I identified, studied, and modified 5’ untranslated regions, combined gene terminators, and a nuclear matrix attachment region. The replication mechanisms of a plant geminivirus were also studied, which lead to additional strategies to produce more toxic biopharmaceutical proteins. Finally, the mechanisms employed by a geminivirus to spread between cells were investigated. It was demonstrated that these movement mechanisms can be functionally transplanted into a separate genus of geminivirus, allowing modified virus-based gene expression vectors to be spread between neighboring plant cells. Additionally, my work helps shed light on the basic genetic mechanisms employed by all living organisms to control gene expression.
ContributorsDiamos, Andy (Author) / Mason, Hugh S (Thesis advisor) / Mor, Tsafrir (Committee member) / Hogue, Brenda (Committee member) / Stout, Valerie (Committee member) / Arizona State University (Publisher)
Created2017
156404-Thumbnail Image.png
Description
Recombinases are powerful tools for genome engineering and synthetic biology, however recombinases are limited by a lack of user-programmability and often require complex directed-evolution experiments to retarget specificity. Conversely, CRISPR systems have extreme versatility yet can induce off-target mutations and karyotypic destabilization. To address these constraints we developed an RNA-guided

Recombinases are powerful tools for genome engineering and synthetic biology, however recombinases are limited by a lack of user-programmability and often require complex directed-evolution experiments to retarget specificity. Conversely, CRISPR systems have extreme versatility yet can induce off-target mutations and karyotypic destabilization. To address these constraints we developed an RNA-guided recombinase protein by fusing a hyperactive mutant resolvase from transposon TN3 to catalytically inactive Cas9. We validated recombinase-Cas9 (rCas9) function in model eukaryote Saccharomyces cerevisiae using a chromosomally integrated fluorescent reporter. Moreover, we demonstrated cooperative targeting by CRISPR RNAs at spacings of 22 or 40bps is necessary for directing recombination. Using PCR and Sanger sequencing, we confirmed rCas9 targets DNA recombination. With further development we envision rCas9 becoming useful in the development of RNA-programmed genetic circuitry as well as high-specificity genome engineering.
ContributorsStandage-Beier, Kylie S (Author) / Wang, Xiao (Thesis advisor) / Brafman, David A (Committee member) / Tian, Xiao-jun (Committee member) / Arizona State University (Publisher)
Created2018
156030-Thumbnail Image.png
Description
Cancer is a heterogeneous disease with discrete oncogenic mechanisms. P53 mutation is the most common oncogenic mutation in many cancers including breast cancer. This dissertation focuses on fundamental genetic alterations enforced by p53 mutation as an indirect target. p53 mutation upregulates the mevalonate pathway genes altering cholesterol biosynthesis and prenylation.

Cancer is a heterogeneous disease with discrete oncogenic mechanisms. P53 mutation is the most common oncogenic mutation in many cancers including breast cancer. This dissertation focuses on fundamental genetic alterations enforced by p53 mutation as an indirect target. p53 mutation upregulates the mevalonate pathway genes altering cholesterol biosynthesis and prenylation. Prenylation, a lipid modification, is required for small GTPases signaling cascades. Project 1 demonstrates that prenylation inhibition can specifically target cells harboring p53 mutation resulting in reduced tumor proliferation and migration. Mutating p53 is associated with Ras and RhoA activation and statin prevents this activity by inhibiting prenylation. Ras-related pathway genes were selected from the transcriptomic analysis for evaluating correlation to statin sensitivity. A gene signature of seventeen genes and TP53 genotype (referred to as MPR signature) is generated to predict response to statins. MPR signature is validated through two datasets of drug screening in cell lines. As advancements in targeted gene modification are rising, the CRISPR-Cas9 technology has emerged as a new cancer therapeutic strategy. One of the important risk factors in gene therapy is the immune recognition of the exogenous therapeutic tool, resulting in obstruction of treatment and possibly serious health consequences. Project 2 describes a method development that can potentially improve the safety and efficacy of gene-targeting proteins. A cohort of 155 healthy individuals was screened for pre-existing B cell and T cell immune response to the S. pyogenes Cas9 protein. We detected antibodies against Cas9 in more than 10% of the healthy population and identified two immunodominant T cell epitopes of this protein. A de-immunized Cas9 that maintains the wild-type functionality was engineered by mutating the identified T cell epitopes. The gene signature and method described here have the potential to improve strategies for genome-driven tumor targeting.
ContributorsRoshdi Ferdosi, Shayesteh (Author) / Anderson, Karen S (Thesis advisor) / LaBaer, Joshua (Thesis advisor) / Woodbury, Neel (Committee member) / Borges, Chad (Committee member) / Arizona State University (Publisher)
Created2017
157282-Thumbnail Image.png
Description
Parkinson’s disease (PD) is a progressive neurodegenerative disorder, diagnosed late in

the disease by a series of motor deficits that manifest over years or decades. It is characterized by degeneration of mid-brain dopaminergic neurons with a high prevalence of dementia associated with the spread of pathology to cortical regions. Patients exhibiting

Parkinson’s disease (PD) is a progressive neurodegenerative disorder, diagnosed late in

the disease by a series of motor deficits that manifest over years or decades. It is characterized by degeneration of mid-brain dopaminergic neurons with a high prevalence of dementia associated with the spread of pathology to cortical regions. Patients exhibiting symptoms have already undergone significant neuronal loss without chance for recovery. Analysis of disease specific changes in gene expression directly from human patients can uncover invaluable clues about a still unknown etiology, the potential of which grows exponentially as additional gene regulatory measures are questioned. Epigenetic mechanisms are emerging as important components of neurodegeneration, including PD; the extent to which methylation changes correlate with disease progression has not yet been reported. This collection of work aims to define multiple layers of PD that will work toward developing biomarkers that not only could improve diagnostic accuracy, but also push the boundaries of the disease detection timeline. I examined changes in gene expression, alternative splicing of those gene products, and the regulatory mechanism of DNA methylation in the Parkinson’s disease system, as well as the pathologically related Alzheimer’s disease (AD). I first used RNA sequencing (RNAseq) to evaluate differential gene expression and alternative splicing in the posterior cingulate cortex of patients with PD and PD with dementia (PDD). Next, I performed a longitudinal genome-wide methylation study surveying ~850K CpG methylation sites in whole blood from 189 PD patients and 191 control individuals obtained at both a baseline and at a follow-up visit after 2 years. I also considered how symptom management medications could affect the regulatory mechanism of DNA methylation. In the last chapter of this work, I intersected RNAseq and DNA methylation array datasets from whole blood patient samples for integrated differential analyses of both PD and AD. Changes in gene expression and DNA methylation reveal clear patterns of pathway dysregulation that can be seen across brain and blood, from one study to the next. I present a thorough survey of molecular changes occurring within the idiopathic Parkinson’s disease patient and propose candidate targets for potential molecular biomarkers.
ContributorsHenderson, Adrienne Rose (Author) / Huentelman, Matthew J (Thesis advisor) / Newbern, Jason (Thesis advisor) / Dunckley, Travis L (Committee member) / Jensen, Kendall (Committee member) / Wilson, Melissa (Committee member) / Arizona State University (Publisher)
Created2019