Recent transcriptome data from yeast, worm, plants, and humans has shown that alternative polyadenylation (APA), a mechanism that enables expression of multiple 3′UTR isoforms for the same gene, is widespread in eukaryotic organisms. It is still poorly understood why metazoans require multiple 3′UTRs for the same gene, but accumulating evidence suggests that APA is largely regulated at a tissue-specific level. APA may direct combinatorial variation between cis-elements and microRNAs, perhaps to regulate gene expression in a tissue-specific manner. Apart from a few single gene anecdotes, this idea has not been systematically explored.
This dissertation research employs a systems biology approach to study the somatic tissue dynamics of APA and its impact on microRNA targeting networks in the small nematode C. elegans. In the first aim, tools were developed and applied to isolate and sequence mRNA from worm intestine and muscle tissues, which revealed pervasive tissue-specific APA correlated with microRNA regulation. The second aim provides genetic evidence that two worm genes use APA to escape repression by microRNAs in the body muscle. Finally, in aim three, mRNA from five additional somatic worm tissues was sequenced and their 3′ends mapped, allowing for an integrative study of APA and microRNA targeting dynamics in worms. Together, this work provides evidence that APA is a pervasive mechanism operating in somatic tissues of C. elegans with the potential to significantly rearrange their microRNA regulatory networks and precisely dose their gene expression.
This project utilized computational tools to analyze large data sets and interpreted the results from historical and philosophical perspectives. Tools deployed were derived from scientometrics, corpus linguistics, text-based analysis, network analysis, and GIS analysis to analyze more than 9000 articles (metadata and text) on systems biology. The application of these tools to a HPS project represents a novel approach.
The dissertation shows that systems biology has transitioned from a more mathematical, computational, and engineering-oriented discipline focusing on modeling to a more biology-oriented discipline that uses modeling as a means to address real biological problems. Also, the results show that bioengineering and medical research has increased within systems biology. This is reflected in the increase of the centrality of biology-related concepts such as cancer, over time. The dissertation also compares the development of systems biology in China with some other parts of the world, and reveals regional differences, such as a unique trajectory of systems biology in China related to a focus on traditional Chinese medicine.
This dissertation adds to the historiography of modern biology where few studies have focused on systems biology compared with the history of molecular biology and evolutionary biology.
The first chapter revisits some of the key experiments that contributed to the development of the repression model of genetic regulation in the lac operon and concludes that the early research on gene expression and genetic regulation depict an iterative and integrative process, which was neither reductionist nor holist. In doing so, it challenges a common application of a conceptual framework in the history of biology and offers an alternative framework. The second chapter argues that the concept of emergence in the history and philosophy of biology is too ambiguous to account for the current research in post-genomic molecular biology and it is often erroneously used to argue against some reductionist theses. The third chapter investigates the use of network representations of gene expression in developmental evolution research and takes up some of the conceptual and methodological problems it has generated. The concluding comments present potential avenues for future research arising from each substantial chapter.
In sum, this dissertation argues that the epistemic practices of gene expression research are an iterative and integrative process, which produces theoretical representations of the complex interactions in gene expression as networks. Moreover, conceptualizing these interactions as networks constrains empirical research strategies by the limited number of ways in which gene expression can be controlled through general rules of network interactions. Making these strategies explicit helps to clarify how they can explain the dynamic and adaptive features of genomes.
Many drugs are effective in the early stage of treatment, but patients develop drug resistance after a certain period of treatment, causing failure of the therapy. An important example is Herceptin, a popular monoclonal antibody drug for breast cancer by specifically targeting human epidermal growth factor receptor 2 (Her2). Here we demonstrate a quantitative binding kinetics analysis of drug-target interactions to investigate the molecular scale origin of drug resistance. Using a surface plasmon resonance imaging, we measured the in situ Herceptin-Her2 binding kinetics in single intact cancer cells for the first time, and observed significantly weakened Herceptin-Her2 interactions in Herceptin-resistant cells, compared to those in Herceptin-sensitive cells. We further showed that the steric hindrance of Mucin-4, a membrane protein, was responsible for the altered drug-receptor binding. This effect of a third molecule on drug-receptor interactions cannot be studied using traditional purified protein methods, demonstrating the importance of the present intact cell-based binding kinetics analysis.
Lineage-committed cells of many tissues exhibit substantial plasticity in contexts such as wound healing and tumorigenesis, but the regulation of this process is not well understood. We identified the Hippo transducer WWTR1/TAZ in a screen of transcription factors that are able to prompt lineage switching of mammary epithelial cells. Forced expression of TAZ in luminal cells induces them to adopt basal characteristics, and depletion of TAZ in basal and/or myoepithelial cells leads to luminal differentiation. In human and mouse tissues, TAZ is active only in basal cells and is critical for basal cell maintenance during homeostasis. Accordingly, loss of TAZ affects mammary gland development, leading to an imbalance of luminal and basal populations as well as branching defects. Mechanistically, TAZ interacts with components of the SWI/SNF complex to modulate lineage-specific gene expression. Collectively, these findings uncover a new role for Hippo signaling in the determination of lineage identity through recruitment of chromatin-remodeling complexes.
Extraintestinal pathogenic Escherichia coli (ExPEC) strains are important pathogens that cause diverse diseases in humans and poultry. Some E. coli isolates from chicken feces contain ExPEC-associated virulence genes, so appear potentially pathogenic; they conceivably could be transmitted to humans through handling and/or consumption of contaminated meat. However, the actual extraintestinal virulence potential of chicken-source fecal E. coli is poorly understood. Here, we assessed whether fecal E. coli isolates from healthy production chickens could cause diseases in a chicken model of avian colibacillosis and three rodent models of ExPEC-associated human infections. From 304 E. coli isolates from chicken fecal samples, 175 E. coli isolates were screened by PCR for virulence genes associated with human-source ExPEC or avian pathogenic E. coli (APEC), an ExPEC subset that causes extraintestinal infections in poultry. Selected isolates genetically identified as ExPEC and non-ExPEC isolates were assessed in vitro for virulence-associated phenotypes, and in vivo for disease-causing ability in animal models of colibacillosis, sepsis, meningitis, and urinary tract infection. Among the study isolates, 13% (40/304) were identified as ExPEC; the majority of these were classified as APEC and uropathogenic E. coli, but none as neonatal meningitis E. coli. Multiple chicken-source fecal ExPEC isolates resembled avian and human clinical ExPEC isolates in causing one or more ExPEC-associated illnesses in experimental animal infection models. Additionally, some isolates that were classified as non-ExPEC were able to cause ExPEC-associated illnesses in animal models, and thus future studies are needed to elucidate their mechanisms of virulence. These findings show that E. coli isolates from chicken feces contain ExPEC-associated genes, exhibit ExPEC-associated in vitro phenotypes, and can cause ExPEC-associated infections in animal models, and thus may pose a health threat to poultry and consumers.