I hypothesize that duplication events grant miRNA families with enhanced regulatory capabilities, specifically through distinct targeting preferences by family members. This has relevance for our understanding of vertebrate evolution, as well disease detection and personalized medicine. To test this hypothesis, I apply a conjunction of bioinformatic and experimental approaches, and design a novel high-throughput screening platform to identify human miRNA targets. Combined with conventional approaches, this tool allows systematic testing for functional targets of human miRNAs, and the identification of novel target genes on an unprecedented scale.
In this dissertation, I explore evolutionary signatures of 62 deeply conserved metazoan miRNA families, as well as the targeting preferences for several human miRNAs. I find that constraints on miRNA processing impact sequence evolution, creating evolutionary hotspots within families that guide distinct target preferences. I apply our novel screening platform to two cancer-relevant miRNAs, and identify hundreds of previously undescribed targets. I also analyze critical features of functional miRNA target sites, finding that each miRNA recognizes surprisingly distinct features of targets. To further explore the functional distinction between family members, I analyze miRNA expression patterns in multiple contexts, including mouse embryogenesis, RNA-seq data from human tissues, and cancer cell lines. Together, my results inform a model that describes the evolution of metazoan miRNAs, and suggests that highly similar miRNA family members possess distinct functions. These findings broaden our understanding of miRNA function in vertebrate evolution and development, and how their misexpression contributes to human disease.
The conservation of the dystrophin gene across metazoans suggests that both vertebrate and invertebrate model systems can provide valuable contributions to the understanding of DMD initiation and progression. Specifically, the invertebrate C. elegans possesses a dystrophin protein ortholog, dys-1, and a mild inflammatory response that is inactive in the muscle, allowing for the characterization of transcriptome rearrangements affecting disease progression independently of inflammation. Furthermore, C. elegans do not possess a satellite cell equivalent, meaning muscle regeneration does not occur. This makes C. elegans unique in that they allow for the study of dystrophin deficiencies without muscle regeneration that may obscure detection of subtle but consequential changes in gene expression.
I hypothesize that gaining a comprehensive definition of both the structural and signaling roles of dystrophin in C. elegans will improve the community’s understanding of the progression of DMD as a whole. To address this hypothesis, I have performed a phylogenetic analysis on the conservation of each member of the dystrophin associated protein complex (DAPC) across 10 species, established an in vivo system to identify muscle-specific changes in gene expression in the dystrophin-deficient C. elegans, and performed a functional analysis to test the biological significance of changes in gene expression identified in my sequencing results. The results from this study indicate that in C. elegans, dystrophin may have a signaling role early in development, and its absence may activate compensatory mechanisms that counteract disease progression. Furthermore, these findings allow for the identification of transcriptome changes that potentially serve as both independent drivers of disease and potential therapeutic targets for the treatment of DMD.
We present results from experiments at the Linac Coherent Light Source (LCLS) demonstrating that serial femtosecond crystallography (SFX) can be performed to high resolution (~2.5 Å) using protein microcrystals deposited on an ultra-thin silicon nitride membrane and embedded in a preservation medium at room temperature. Data can be acquired at a high acquisition rate using x-ray free electron laser sources to overcome radiation damage, while sample consumption is dramatically reduced compared to flowing jet methods. We achieved a peak data acquisition rate of 10 Hz with a hit rate of ~38%, indicating that a complete data set could be acquired in about one 12-hour LCLS shift using the setup described here, or in even less time using hardware optimized for fixed target SFX. This demonstration opens the door to ultra low sample consumption SFX using the technique of diffraction-before-destruction on proteins that exist in only small quantities and/or do not produce the copious quantities of microcrystals required for flowing jet methods.
MicroRNAs (miRNAs) are short non-coding RNAs that regulate gene output at the post-transcriptional level by targeting degenerate elements primarily in 3′untranslated regions (3′UTRs) of mRNAs. Individual miRNAs can regulate networks of hundreds of genes, yet for the majority of miRNAs few, if any, targets are known. Misexpression of miRNAs is also a major contributor to cancer progression, thus there is a critical need to validate miRNA targets in high-throughput to understand miRNAs' contribution to tumorigenesis. Here we introduce a novel high-throughput assay to detect miRNA targets in 3′UTRs, called Luminescent Identification of Functional Elements in 3′UTRs (3′LIFE). We demonstrate the feasibility of 3′LIFE using a data set of 275 human 3′UTRs and two cancer-relevant miRNAs, let-7c and miR-10b, and compare our results to alternative methods to detect miRNA targets throughout the genome. We identify a large number of novel gene targets for these miRNAs, with only 32% of hits being bioinformatically predicted and 27% directed by non-canonical interactions. Functional analysis of target genes reveals consistent roles for each miRNA as either a tumor suppressor (let-7c) or oncogenic miRNA (miR-10b), and preferentially target multiple genes within regulatory networks, suggesting 3′LIFE is a rapid and sensitive method to detect miRNA targets in high-throughput.
Photosynthesis, a process catalysed by plants, algae and cyanobacteria converts sunlight to energy thus sustaining all higher life on Earth. Two large membrane protein complexes, photosystem I and II (PSI and PSII), act in series to catalyse the light-driven reactions in photosynthesis. PSII catalyses the light-driven water splitting process, which maintains the Earth’s oxygenic atmosphere. In this process, the oxygen-evolving complex (OEC) of PSII cycles through five states, S0 to S4, in which four electrons are sequentially extracted from the OEC in four light-driven charge-separation events. Here we describe time resolved experiments on PSII nano/microcrystals from Thermosynechococcus elongatus performed with the recently developed technique of serial femtosecond crystallography. Structures have been determined from PSII in the dark S1 state and after double laser excitation (putative S3 state) at 5 and 5.5 Å resolution, respectively. The results provide evidence that PSII undergoes significant conformational changes at the electron acceptor side and at the Mn4CaO5 core of the OEC. These include an elongation of the metal cluster, accompanied by changes in the protein environment, which could allow for binding of the second substrate water molecule between the more distant protruding Mn (referred to as the ‘dangler’ Mn) and the Mn3CaOx cubane in the S2 to S3 transition, as predicted by spectroscopic and computational studies. This work shows the great potential for time-resolved serial femtosecond crystallography for investigation of catalytic processes in biomolecules.
It has been suggested that the extended intensity profiles surrounding Bragg reflections that arise when a series of finite crystals of varying size and shape are illuminated by the intense, coherent illumination of an x-ray free-electron laser may enable the crystal’s unit-cell electron density to be obtained ab initio via well-established iterative phasing algorithms. Such a technique could have a significant impact on the field of biological structure determination since it avoids the need for a priori information from similar known structures, multiple measurements near resonant atomic absorption energies, isomorphic derivative crystals, or atomic-resolution data. Here, we demonstrate this phasing technique on diffraction patterns recorded from artificial two-dimensional microcrystals using the seeded soft x-ray free-electron laser FERMI. We show that the technique is effective when the illuminating wavefront has nonuniform phase and amplitude, and when the diffraction intensities cannot be measured uniformly throughout reciprocal space because of a limited signal-to-noise ratio.