Filtering by
- Language: English
treatments, and neo-antigens are the targets of immune system in cancer patients who
respond to the treatments. The cancer vaccine field is focused on using neo-antigens from
unique point mutations of genomic sequence in the cancer patient for making
personalized cancer vaccines. However, we choose a different path to find frameshift
neo-antigens at the mRNA level and develop broadly effective cancer vaccines based on
frameshift antigens.
In this dissertation, I have summarized and characterized all the potential frameshift
antigens from microsatellite regions in human, dog and mouse. A list of frameshift
antigens was validated by PCR in tumor samples and the mutation rate was calculated for
one candidate – SEC62. I develop a method to screen the antibody response against
frameshift antigens in human and dog cancer patients by using frameshift peptide arrays.
Frameshift antigens selected by positive antibody response in cancer patients or by MHC
predictions show protection in different mouse tumor models. A dog version of the
cancer vaccine based on frameshift antigens was developed and tested in a small safety
trial. The results demonstrate that the vaccine is safe and it can induce strong B and T cell
immune responses. Further, I built the human exon junction frameshift database which
includes all possible frameshift antigens from mis-splicing events in exon junctions, and I
develop a method to find potential frameshift antigens from large cancer
immunosignature dataset with these databases. In addition, I test the idea of ‘early cancer
diagnosis, early treatment’ in a transgenic mouse cancer model. The results show that
ii
early treatment gives significantly better protection than late treatment and the correct
time point for treatment is crucial to give the best clinical benefit. A model for early
treatment is developed with these results.
Frameshift neo-antigens from microsatellite regions and mis-splicing events are
abundant at mRNA level and they are better antigens than neo-antigens from point
mutations in the genomic sequences of cancer patients in terms of high immunogenicity,
low probability to cause autoimmune diseases and low cost to develop a broadly effective
vaccine. This dissertation demonstrates the feasibility of using frameshift antigens for
cancer vaccine development.
Although emerging evidence indicates that deep-sea water contains an untapped reservoir of high metabolic and genetic diversity, this realm has not been studied well compared with surface sea water. The study provided the first integrated meta-genomic and -transcriptomic analysis of the microbial communities in deep-sea water of North Pacific Ocean. DNA/RNA amplifications and simultaneous metagenomic and metatranscriptomic analyses were employed to discover information concerning deep-sea microbial communities from four different deep-sea sites ranging from the mesopelagic to pelagic ocean. Within the prokaryotic community, bacteria is absolutely dominant (~90%) over archaea in both metagenomic and metatranscriptomic data pools. The emergence of archaeal phyla Crenarchaeota, Euryarchaeota, Thaumarchaeota, bacterial phyla Actinobacteria, Firmicutes, sub-phyla Betaproteobacteria, Deltaproteobacteria, and Gammaproteobacteria, and the decrease of bacterial phyla Bacteroidetes and Alphaproteobacteria are the main composition changes of prokaryotic communities in the deep-sea water, when compared with the reference Global Ocean Sampling Expedition (GOS) surface water. Photosynthetic Cyanobacteria exist in all four metagenomic libraries and two metatranscriptomic libraries. In Eukaryota community, decreased abundance of fungi and algae in deep sea was observed. RNA/DNA ratio was employed as an index to show metabolic activity strength of microbes in deep sea. Functional analysis indicated that deep-sea microbes are leading a defensive lifestyle.
Background: The use of culture-independent nucleic acid techniques, such as ribosomal RNA gene cloning library analysis, has unveiled the tremendous microbial diversity that exists in natural environments. In sharp contrast to this great achievement is the current difficulty in cultivating the majority of bacterial species or phylotypes revealed by molecular approaches. Although recent new technologies such as metagenomics and metatranscriptomics can provide more functionality information about the microbial communities, it is still important to develop the capacity to isolate and cultivate individual microbial species or strains in order to gain a better understanding of microbial physiology and to apply isolates for various biotechnological applications.
Results: We have developed a new system to cultivate bacteria in an array of droplets. The key component of the system is the microbe observation and cultivation array (MOCA), which consists of a Petri dish that contains an array of droplets as cultivation chambers. MOCA exploits the dominance of surface tension in small amounts of liquid to spontaneously trap cells in well-defined droplets on hydrophilic patterns. During cultivation, the growth of the bacterial cells across the droplet array can be monitored using an automated microscope, which can produce a real-time record of the growth. When bacterial cells grow to a visible microcolony level in the system, they can be transferred using a micropipette for further cultivation or analysis.
Conclusions: MOCA is a flexible system that is easy to set up, and provides the sensitivity to monitor growth of single bacterial cells. It is a cost-efficient technical platform for bioassay screening and for cultivation and isolation of bacteria from natural environments.
The unicellular microalga Haematococcus pluvialis has emerged as a promising biomass feedstock for the ketocarotenoid astaxanthin and neutral lipid triacylglycerol. Motile flagellates, resting palmella cells, and cysts are the major life cycle stages of H. pluvialis. Fast-growing motile cells are usually used to induce astaxanthin and triacylglycerol biosynthesis under stress conditions (high light or nutrient starvation); however, productivity of biomass and bioproducts are compromised due to the susceptibility of motile cells to stress. This study revealed that the Photosystem II (PSII) reaction center D1 protein, the manganese-stabilizing protein PsbO, and several major membrane glycerolipids (particularly for chloroplast membrane lipids monogalactosyldiacylglycerol and phosphatidylglycerol), decreased dramatically in motile cells under high light (HL). In contrast, palmella cells, which are transformed from motile cells after an extended period of time under favorable growth conditions, have developed multiple protective mechanisms - including reduction in chloroplast membrane lipids content, downplay of linear photosynthetic electron transport, and activating nonphotochemical quenching mechanisms - while accumulating triacylglycerol. Consequently, the membrane lipids and PSII proteins (D1 and PsbO) remained relatively stable in palmella cells subjected to HL. Introducing palmella instead of motile cells to stress conditions may greatly increase astaxanthin and lipid production in H. pluvialis culture.
Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt to model, may have a diverse range of applications.
Methods: We investigate the utility of a number of statistical methods to determine model performance and address challenges inherent in analyzing immunosignatures. Some of these methods include exploratory and confirmatory factor analyses, classical significance testing, structural equation and mixture modeling.
Results: We demonstrate an ability to classify samples based on disease status and show that immunosignaturing is a very promising technology for screening and presymptomatic screening of disease. In addition, we are able to model complex patterns and latent factors underlying immunosignatures. These latent factors may serve as biomarkers for disease and may play a key role in a bioinformatic method for antibody discovery.
Conclusion: Based on this research, we lay out an analytic framework illustrating how immunosignatures may be useful as a general method for screening and presymptomatic screening of disease as well as antibody discovery.
Background: Microarray image analysis processes scanned digital images of hybridized arrays to produce the input spot-level data for downstream analysis, so it can have a potentially large impact on those and subsequent analysis. Signal saturation is an optical effect that occurs when some pixel values for highly expressed genes or peptides exceed the upper detection threshold of the scanner software (216 - 1 = 65, 535 for 16-bit images). In practice, spots with a sizable number of saturated pixels are often flagged and discarded. Alternatively, the saturated values are used without adjustments for estimating spot intensities. The resulting expression data tend to be biased downwards and can distort high-level analysis that relies on these data. Hence, it is crucial to effectively correct for signal saturation.
Results: We developed a flexible mixture model-based segmentation and spot intensity estimation procedure that accounts for saturated pixels by incorporating a censored component in the mixture model. As demonstrated with biological data and simulation, our method extends the dynamic range of expression data beyond the saturation threshold and is effective in correcting saturation-induced bias when the lost information is not tremendous. We further illustrate the impact of image processing on downstream classification, showing that the proposed method can increase diagnostic accuracy using data from a lymphoma cancer diagnosis study.
Conclusions: The presented method adjusts for signal saturation at the segmentation stage that identifies a pixel as part of the foreground, background or other. The cluster membership of a pixel can be altered versus treating saturated values as truly observed. Thus, the resulting spot intensity estimates may be more accurate than those obtained from existing methods that correct for saturation based on already segmented data. As a model-based segmentation method, our procedure is able to identify inner holes, fuzzy edges and blank spots that are common in microarray images. The approach is independent of microarray platform and applicable to both single- and dual-channel microarrays.
Background: Heterogeneity within cell populations is relevant to the onset and progression of disease, as well as development and maintenance of homeostasis. Analysis and understanding of the roles of heterogeneity in biological systems require methods and technologies that are capable of single cell resolution. Single cell gene expression analysis by RT-qPCR is an established technique for identifying transcriptomic heterogeneity in cellular populations, but it generally requires specialized equipment or tedious manipulations for cell isolation.
Results: We describe the optimization of a simple, inexpensive and rapid pipeline which includes isolation and culture of live single cells as well as fluorescence microscopy and gene expression analysis of the same single cells by RT-qPCR. We characterize the efficiency of single cell isolation and demonstrate our method by identifying single GFP-expressing cells from a mixed population of GFP-positive and negative cells by correlating fluorescence microscopy and RT-qPCR.
Conclusions: Single cell gene expression analysis by RT-qPCR is a convenient means for investigating cellular heterogeneity, but is most useful when correlating observations with additional measurements. We demonstrate a convenient and simple pipeline for multiplexing single cell RT-qPCR with fluorescence microscopy which is adaptable to other molecular analyses.
Background: High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of features. As new microarrays are invented, classification systems that worked well for other array types may not be ideal. Expression microarrays, arguably one of the most prevalent array types, have been used for years to help develop classification algorithms. Many biological assumptions are built into classifiers that were designed for these types of data. One of the more problematic is the assumption of independence, both at the probe level and again at the biological level. Probes for RNA transcripts are designed to bind single transcripts. At the biological level, many genes have dependencies across transcriptional pathways where co-regulation of transcriptional units may make many genes appear as being completely dependent. Thus, algorithms that perform well for gene expression data may not be suitable when other technologies with different binding characteristics exist. The immunosignaturing microarray is based on complex mixtures of antibodies binding to arrays of random sequence peptides. It relies on many-to-many binding of antibodies to the random sequence peptides. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides. This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. However, it is not clear what is the optimal classification algorithm for analyzing this new type of data.
Results: We characterized several classification algorithms to analyze immunosignaturing data. We selected several datasets that range from easy to difficult to classify, from simple monoclonal binding to complex binding patterns in asthma patients. We then classified the biological samples using 17 different classification algorithms. Using a wide variety of assessment criteria, we found ‘Naïve Bayes’ far more useful than other widely used methods due to its simplicity, robustness, speed and accuracy.
Conclusions: ‘Naïve Bayes’ algorithm appears to accommodate the complex patterns hidden within multilayered immunosignaturing microarray data due to its fundamental mathematical properties.