treatments, and neo-antigens are the targets of immune system in cancer patients who
respond to the treatments. The cancer vaccine field is focused on using neo-antigens from
unique point mutations of genomic sequence in the cancer patient for making
personalized cancer vaccines. However, we choose a different path to find frameshift
neo-antigens at the mRNA level and develop broadly effective cancer vaccines based on
frameshift antigens.
In this dissertation, I have summarized and characterized all the potential frameshift
antigens from microsatellite regions in human, dog and mouse. A list of frameshift
antigens was validated by PCR in tumor samples and the mutation rate was calculated for
one candidate – SEC62. I develop a method to screen the antibody response against
frameshift antigens in human and dog cancer patients by using frameshift peptide arrays.
Frameshift antigens selected by positive antibody response in cancer patients or by MHC
predictions show protection in different mouse tumor models. A dog version of the
cancer vaccine based on frameshift antigens was developed and tested in a small safety
trial. The results demonstrate that the vaccine is safe and it can induce strong B and T cell
immune responses. Further, I built the human exon junction frameshift database which
includes all possible frameshift antigens from mis-splicing events in exon junctions, and I
develop a method to find potential frameshift antigens from large cancer
immunosignature dataset with these databases. In addition, I test the idea of ‘early cancer
diagnosis, early treatment’ in a transgenic mouse cancer model. The results show that
ii
early treatment gives significantly better protection than late treatment and the correct
time point for treatment is crucial to give the best clinical benefit. A model for early
treatment is developed with these results.
Frameshift neo-antigens from microsatellite regions and mis-splicing events are
abundant at mRNA level and they are better antigens than neo-antigens from point
mutations in the genomic sequences of cancer patients in terms of high immunogenicity,
low probability to cause autoimmune diseases and low cost to develop a broadly effective
vaccine. This dissertation demonstrates the feasibility of using frameshift antigens for
cancer vaccine development.
Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt to model, may have a diverse range of applications.
Methods: We investigate the utility of a number of statistical methods to determine model performance and address challenges inherent in analyzing immunosignatures. Some of these methods include exploratory and confirmatory factor analyses, classical significance testing, structural equation and mixture modeling.
Results: We demonstrate an ability to classify samples based on disease status and show that immunosignaturing is a very promising technology for screening and presymptomatic screening of disease. In addition, we are able to model complex patterns and latent factors underlying immunosignatures. These latent factors may serve as biomarkers for disease and may play a key role in a bioinformatic method for antibody discovery.
Conclusion: Based on this research, we lay out an analytic framework illustrating how immunosignatures may be useful as a general method for screening and presymptomatic screening of disease as well as antibody discovery.
Background: Microarray image analysis processes scanned digital images of hybridized arrays to produce the input spot-level data for downstream analysis, so it can have a potentially large impact on those and subsequent analysis. Signal saturation is an optical effect that occurs when some pixel values for highly expressed genes or peptides exceed the upper detection threshold of the scanner software (216 - 1 = 65, 535 for 16-bit images). In practice, spots with a sizable number of saturated pixels are often flagged and discarded. Alternatively, the saturated values are used without adjustments for estimating spot intensities. The resulting expression data tend to be biased downwards and can distort high-level analysis that relies on these data. Hence, it is crucial to effectively correct for signal saturation.
Results: We developed a flexible mixture model-based segmentation and spot intensity estimation procedure that accounts for saturated pixels by incorporating a censored component in the mixture model. As demonstrated with biological data and simulation, our method extends the dynamic range of expression data beyond the saturation threshold and is effective in correcting saturation-induced bias when the lost information is not tremendous. We further illustrate the impact of image processing on downstream classification, showing that the proposed method can increase diagnostic accuracy using data from a lymphoma cancer diagnosis study.
Conclusions: The presented method adjusts for signal saturation at the segmentation stage that identifies a pixel as part of the foreground, background or other. The cluster membership of a pixel can be altered versus treating saturated values as truly observed. Thus, the resulting spot intensity estimates may be more accurate than those obtained from existing methods that correct for saturation based on already segmented data. As a model-based segmentation method, our procedure is able to identify inner holes, fuzzy edges and blank spots that are common in microarray images. The approach is independent of microarray platform and applicable to both single- and dual-channel microarrays.