Matching Items (16)
Filtering by

Clear all filters

153977-Thumbnail Image.png
Description
Rapid advancements in genomic technologies have increased our understanding of rare human disease. Generation of multiple types of biological data including genetic variation from genome or exome, expression from transcriptome, methylation patterns from epigenome, protein complexity from proteome and metabolite information from metabolome is feasible. "Omics" tools provide comprehensive view

Rapid advancements in genomic technologies have increased our understanding of rare human disease. Generation of multiple types of biological data including genetic variation from genome or exome, expression from transcriptome, methylation patterns from epigenome, protein complexity from proteome and metabolite information from metabolome is feasible. "Omics" tools provide comprehensive view into biological mechanisms that impact disease trait and risk. In spite of available data types and ability to collect them simultaneously from patients, researchers still rely on their independent analysis. Combining information from multiple biological data can reduce missing information, increase confidence in single data findings, and provide a more complete view of genotype-phenotype correlations. Although rare disease genetics has been greatly improved by exome sequencing, a substantial portion of clinical patients remain undiagnosed. Multiple frameworks for integrative analysis of genomic and transcriptomic data are presented with focus on identifying functional genetic variations in patients with undiagnosed, rare childhood conditions. Direct quantitation of X inactivation ratio was developed from genomic and transcriptomic data using allele specific expression and segregation analysis to determine magnitude and inheritance mode of X inactivation. This approach was applied in two families revealing non-random X inactivation in female patients. Expression based analysis of X inactivation showed high correlation with standard clinical assay. These findings improved understanding of molecular mechanisms underlying X-linked disorders. In addition multivariate outlier analysis of gene and exon level data from RNA-seq using Mahalanobis distance, and its integration of distance scores with genomic data found genotype-phenotype correlations in variant prioritization process in 25 families. Mahalanobis distance scores revealed variants with large transcriptional impact in patients. In this dataset, frameshift variants were more likely result in outlier expression signatures than other types of functional variants. Integration of outlier estimates with genetic variants corroborated previously identified, presumed causal variants and highlighted new candidate in previously un-diagnosed case. Integrative genomic approaches in easily attainable tissue will facilitate the search for biomarkers that impact disease trait, uncover pharmacogenomics targets, provide novel insight into molecular underpinnings of un-characterized conditions, and help improve analytical approaches that use large datasets.
ContributorsSzelinger, Szabolcs (Author) / Craig, David W. (Thesis advisor) / Kusumi, Kenro (Thesis advisor) / Narayan, Vinodh (Committee member) / Rosenberg, Michael S. (Committee member) / Huentelman, Matthew J (Committee member) / Arizona State University (Publisher)
Created2015
157806-Thumbnail Image.png
Description
The WNT signaling pathway plays numerous roles in development and maintenance of adult homeostasis. In concordance with it’s numerous roles, dysfunction of WNT signaling leads to a variety of human diseases ranging from developmental disorders to cancer. WNT signaling is composed of a family of 19 WNT soluble secreted glycoproteins,

The WNT signaling pathway plays numerous roles in development and maintenance of adult homeostasis. In concordance with it’s numerous roles, dysfunction of WNT signaling leads to a variety of human diseases ranging from developmental disorders to cancer. WNT signaling is composed of a family of 19 WNT soluble secreted glycoproteins, which are evolutionarily conserved across all phyla of the animal kingdom. WNT ligands interact most commonly with a family of receptors known as frizzled (FZ) receptors, composed of 10 independent genes. Specific interactions between WNT proteins and FZ receptors are not well characterized and are known to be promiscuous, Traditionally canonical WNT signaling is described as a binary system in which WNT signaling is either off or on. In the ‘off’ state, in the absence of a WNT ligand, cytoplasmic β-catenin is continuously degraded by the action of the APC/Axin/GSK-3β destruction complex. In the ‘on’ state, when WNT binds to its Frizzled (Fz) receptor and LRP coreceptor, this protein destruction complex is disrupted, allowing β-catenin to translocate into the nucleus where it interacts with the DNA-bound T cell factor/lymphoid factor (TCF/LEF) family of proteins to regulate target gene expression. However in a variety of systems in development and disease canonical WNT signaling acts in a gradient fashion, suggesting more complex regulation of β-catenin transcriptional activity. As such, the traditional ‘binary’ view of WNT signaling does not clearly explain how this graded signal is transmitted intracellularly to control concentration-dependent changes in gene expression and cell identity. I have developed an in vitro human pluripotent stem cell (hPSC)-based model that recapitulates the same in vivo developmental effects of the WNT signaling gradient on the anterior-posterior (A/P) patterning of the neural tube observed during early development. Using RNA-seq and ChIP-seq I have characterized β-catenin binding at different levels of WNT signaling and identified different classes of β-catenin peaks that bind cis-regulatory elements to influence neural cell fate. This work expands the traditional binary view of canonical WNT signaling and illuminates WNT/β-catenin activity in other developmental and diseased contexts.
ContributorsCutts, Joshua Patrick (Author) / Brafman, David A (Thesis advisor) / Stabenfeldt, Sarah (Committee member) / Nikkhah, Mehdi (Committee member) / Wang, Xiao (Committee member) / Plaisier, Christopher (Committee member) / Arizona State University (Publisher)
Created2019
Description
Traumatic injury to the central nervous or musculoskeletal system in traditional amniote models, such as mouse and chicken, is permanent with long-term physiological and functional effects. However, among amniotes, the ability to regrow complex, multi-tissue structures is unique to non-avian reptiles. Structural regeneration is extensively studied in lizards, with most

Traumatic injury to the central nervous or musculoskeletal system in traditional amniote models, such as mouse and chicken, is permanent with long-term physiological and functional effects. However, among amniotes, the ability to regrow complex, multi-tissue structures is unique to non-avian reptiles. Structural regeneration is extensively studied in lizards, with most species able to regrow a functional tail. The lizard regenerated tail includes the spinal cord, cartilage, de novo muscle, vasculature, and skin, and unlike mammals, these tissues can be replaced in lizards as adults. These studies focus on the events that occur before and after the tail regrowth phase, identifying conserved mechanisms that enable functional tail regeneration in the green anole lizard, Anolis carolinensis. An examination of coordinated interactions between peripheral nerves, Schwann cells, and skeletal muscle reveal that reformation of the lizard neuromuscular system is dependent upon developmental programs as well as those unique to the adult during late stages of regeneration. On the other hand, transcriptomic analysis of the early injury response identified many immunoregulatory genes that may be essential for inhibiting fibrosis and initiating regenerative programs. Lastly, an anatomical and histological study of regrown alligator tails reveal that regenerative capacity varies between different reptile groups, providing comparative opportunities within amniotes and across vertebrates. In order to identify mechanisms that limit regeneration, these cross-species analyses will be critical. Taken together, these studies serve as a foundation for future experimental work that will reveal the interplay between reparative and regenerative mechanisms in adult amniotes with translational implications for medical therapies.
ContributorsXu, Cindy (Author) / Kusumi, Kenro (Thesis advisor) / Newbern, Jason M (Thesis advisor) / Wilson-Rawls, Jeanne (Committee member) / Fisher, Rebecca E (Committee member) / Arizona State University (Publisher)
Created2020
157920-Thumbnail Image.png
Description
Fusion proteins that specifically interact with biochemical marks on chromosomes represent a new class of synthetic transcriptional regulators that decode cell state information rather than deoxyribose nucleic acid (DNA) sequences. In multicellular organisms, information relevant to cell state, tissue identity, and oncogenesis is often encoded as biochemical modifications of histones,

Fusion proteins that specifically interact with biochemical marks on chromosomes represent a new class of synthetic transcriptional regulators that decode cell state information rather than deoxyribose nucleic acid (DNA) sequences. In multicellular organisms, information relevant to cell state, tissue identity, and oncogenesis is often encoded as biochemical modifications of histones, which are bound to DNA in eukaryotic nuclei and regulate gene expression states. In 2011, Haynes et al. showed that a synthetic regulator called the Polycomb chromatin Transcription Factor (PcTF), a fusion protein that binds methylated histones, reactivated an artificially-silenced luciferase reporter gene. These synthetic transcription activators are derived from the polycomb repressive complex (PRC) and associate with the epigenetic silencing mark H3K27me3 to reactivate the expression of silenced genes. It is demonstrated here that the duration of epigenetic silencing does not perturb reactivation via PcTF fusion proteins. After 96 hours PcTF shows the strongest reactivation activity. A variant called Pc2TF, which has roughly double the affinity for H3K27me3 in vitro, reactivated the silenced luciferase gene by at least 2-fold in living cells.
ContributorsVargas, Daniel A. (Author) / Haynes, Karmella (Thesis advisor) / Wang, Xiao (Committee member) / Mills, Jeremy (Committee member) / Arizona State University (Publisher)
Created2019
158849-Thumbnail Image.png
Description
Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The

Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.
ContributorsOrr, Adam James (Author) / Cartwright, Reed (Thesis advisor) / Wilson, Melissa (Committee member) / Kusumi, Kenro (Committee member) / Taylor, Jesse (Committee member) / Pfeifer, Susanne (Committee member) / Arizona State University (Publisher)
Created2020
158301-Thumbnail Image.png
Description
The human transcriptional regulatory machine utilizes hundreds of transcription factors which bind to specific genic sites resulting in either activation or repression of targeted genes. Networks comprised of nodes and edges can be constructed to model the relationships of regulators and their targets. Within these biological networks small enriched structural

The human transcriptional regulatory machine utilizes hundreds of transcription factors which bind to specific genic sites resulting in either activation or repression of targeted genes. Networks comprised of nodes and edges can be constructed to model the relationships of regulators and their targets. Within these biological networks small enriched structural patterns containing at least three nodes can be identified as potential building blocks from which a network is organized. A first iteration computational pipeline was designed to generate a disease specific gene regulatory network for motif detection using established computational tools. The first goal was to identify motifs that can express themselves in a state that results in differential patient survival in one of the 32 different cancer types studied. This study identified issues for detecting strongly correlated motifs that also effect patient survival, yielding preliminary results for possible driving cancer etiology. Second, a comparison was performed for the topology of network motifs across multiple different data types to identify possible divergence from a conserved enrichment pattern in network perturbing diseases. The topology of enriched motifs across all the datasets converged upon a single conserved pattern reported in a previous study which did not appear to diverge dependent upon the type of disease. This report highlights possible methods to improve detection of disease driving motifs that can aid in identifying possible treatment targets in cancer. Finally, networks where only minimally perturbed, suggesting that regulatory programs were run from evolved circuits into a cancer context.
ContributorsStriker, Shawn Scott (Author) / Plaisier, Christopher (Thesis advisor) / Brafman, David (Committee member) / Wang, Xiao (Committee member) / Arizona State University (Publisher)
Created2020