Search Content

Dense non-natural sequence peptide microarrays for epitope mapping and diagnostics

Description

The healthcare system in this country is currently unacceptable. New technologies may contribute to reducing cost and improving outcomes. Early diagnosis and treatment represents the least risky option for addressing this issue. Such a technology needs to be inexpensive, highly sensitive, highly specific, and amenable to adoption in a clinic.…

The healthcare system in this country is currently unacceptable. New technologies may contribute to reducing cost and improving outcomes. Early diagnosis and treatment represents the least risky option for addressing this issue. Such a technology needs to be inexpensive, highly sensitive, highly specific, and amenable to adoption in a clinic. This thesis explores an immunodiagnostic technology based on highly scalable, non-natural sequence peptide microarrays designed to profile the humoral immune response and address the healthcare problem. The primary aim of this thesis is to explore the ability of these arrays to map continuous (linear) epitopes. I discovered that using a technique termed subsequence analysis where epitopes could be decisively mapped to an eliciting protein with high success rate. This led to the discovery of novel linear epitopes from Plasmodium falciparum (Malaria) and Treponema palladium (Syphilis), as well as validation of previously discovered epitopes in Dengue and monoclonal antibodies. Next, I developed and tested a classification scheme based on Support Vector Machines for development of a Dengue Fever diagnostic, achieving higher sensitivity and specificity than current FDA approved techniques. The software underlying this method is available for download under the BSD license. Following this, I developed a kinetic model for immunosignatures and tested it against existing data driven by previously unexplained phenomena. This model provides a framework and informs ways to optimize the platform for maximum stability and efficiency. I also explored the role of sequence composition in explaining an immunosignature binding profile, determining a strong role for charged residues that seems to have some predictive ability for disease. Finally, I developed a database, software and indexing strategy based on Apache Lucene for searching motif patterns (regular expressions) in large biological databases. These projects as a whole have advanced knowledge of how to approach high throughput immunodiagnostics and provide an example of how technology can be fused with biology in order to affect scientific and health outcomes.

ContributorsRicher, Joshua Amos (Author) / Johnston, Stephen A. (Thesis advisor) / Woodbury, Neal (Committee member) / Stafford, Phillip (Committee member) / Papandreou-Suppappola, Antonia (Committee member) / Arizona State University (Publisher)

Created2014

Use of large, immunosignature databases to pose new questions about infection and health status

Description

Immunosignature is a technology that retrieves information from the immune system. The technology is based on microarrays with peptides chosen from random sequence space. My thesis focuses on improving the Immunosignature platform and using Immunosignatures to improve diagnosis for diseases. I first contributed to the optimization of the immunosignature platform…

Immunosignature is a technology that retrieves information from the immune system. The technology is based on microarrays with peptides chosen from random sequence space. My thesis focuses on improving the Immunosignature platform and using Immunosignatures to improve diagnosis for diseases. I first contributed to the optimization of the immunosignature platform by introducing scoring metrics to select optimal parameters, considering performance as well as practicality. Next, I primarily worked on identifying a signature shared across various pathogens that can distinguish them from the healthy population. I further retrieved consensus epitopes from the disease common signature and proposed that most pathogens could share the signature by studying the enrichment of the common signature in the pathogen proteomes. Following this, I worked on studying cancer samples from different stages and correlated the immune response with whether the epitope presented by tumor is similar to the pathogen proteome. An effective immune response is defined as an antibody titer increasing followed by decrease, suggesting elimination of the epitope. I found that an effective immune response usually correlates with epitopes that are more similar to pathogens. This suggests that the immune system might occupy a limited space and can be effective against only certain epitopes that have similarity with pathogens. I then participated in the attempt to solve the antibiotic resistance problem by developing a classification algorithm that can distinguish bacterial versus viral infection. This algorithm outperforms other currently available classification methods. Finally, I worked on the concept of deriving a single number to represent all the data on the immunosignature platform. This is in resemblance to the concept of temperature, which is an approximate measurement of whether an individual is healthy. The measure of Immune Entropy was found to work best as a single measurement to describe the immune system information derived from the immunosignature. Entropy is relatively invariant in healthy population, but shows significant differences when comparing healthy donors with patients either infected with a pathogen or have cancer.

ContributorsWang, Lu (Author) / Johnston, Stephen (Thesis advisor) / Stafford, Phillip (Committee member) / Buetow, Kenneth (Committee member) / McFadden, Grant (Committee member) / Arizona State University (Publisher)

Created2018

Frameshift antigens for cancer vaccine development

Description

Immunotherapy has been revitalized with the advent of immune checkpoint blockade

treatments, and neo-antigens are the targets of immune system in cancer patients who

respond to the treatments. The cancer vaccine field is focused on using neo-antigens from

unique point mutations of genomic sequence in the cancer patient for making

personalized cancer vaccines. However,…

Immunotherapy has been revitalized with the advent of immune checkpoint blockade

treatments, and neo-antigens are the targets of immune system in cancer patients who

respond to the treatments. The cancer vaccine field is focused on using neo-antigens from

unique point mutations of genomic sequence in the cancer patient for making

personalized cancer vaccines. However, we choose a different path to find frameshift

neo-antigens at the mRNA level and develop broadly effective cancer vaccines based on

frameshift antigens.

In this dissertation, I have summarized and characterized all the potential frameshift

antigens from microsatellite regions in human, dog and mouse. A list of frameshift

antigens was validated by PCR in tumor samples and the mutation rate was calculated for

one candidate – SEC62. I develop a method to screen the antibody response against

frameshift antigens in human and dog cancer patients by using frameshift peptide arrays.

Frameshift antigens selected by positive antibody response in cancer patients or by MHC

predictions show protection in different mouse tumor models. A dog version of the

cancer vaccine based on frameshift antigens was developed and tested in a small safety

trial. The results demonstrate that the vaccine is safe and it can induce strong B and T cell

immune responses. Further, I built the human exon junction frameshift database which

includes all possible frameshift antigens from mis-splicing events in exon junctions, and I

develop a method to find potential frameshift antigens from large cancer

immunosignature dataset with these databases. In addition, I test the idea of ‘early cancer

diagnosis, early treatment’ in a transgenic mouse cancer model. The results show that

ii

early treatment gives significantly better protection than late treatment and the correct

time point for treatment is crucial to give the best clinical benefit. A model for early

treatment is developed with these results.

Frameshift neo-antigens from microsatellite regions and mis-splicing events are

abundant at mRNA level and they are better antigens than neo-antigens from point

mutations in the genomic sequences of cancer patients in terms of high immunogenicity,

low probability to cause autoimmune diseases and low cost to develop a broadly effective

vaccine. This dissertation demonstrates the feasibility of using frameshift antigens for

cancer vaccine development.

ContributorsZhang, Jian (Author) / Johnston, Stephen Albert (Thesis advisor) / Chang, Yung (Committee member) / Stafford, Phillip (Committee member) / Chen, Qiang (Committee member) / Arizona State University (Publisher)

Created2018

Integrated Metagenomic and Metatranscriptomic Analyses of Microbial Communities in the Meso- and Bathypelagic Realm of North Pacific Ocean

Description

Although emerging evidence indicates that deep-sea water contains an untapped reservoir of high metabolic and genetic diversity, this realm has not been studied well compared with surface sea water. The study provided the first integrated meta-genomic and -transcriptomic analysis of the microbial communities in deep-sea water of North Pacific Ocean.…

Although emerging evidence indicates that deep-sea water contains an untapped reservoir of high metabolic and genetic diversity, this realm has not been studied well compared with surface sea water. The study provided the first integrated meta-genomic and -transcriptomic analysis of the microbial communities in deep-sea water of North Pacific Ocean. DNA/RNA amplifications and simultaneous metagenomic and metatranscriptomic analyses were employed to discover information concerning deep-sea microbial communities from four different deep-sea sites ranging from the mesopelagic to pelagic ocean. Within the prokaryotic community, bacteria is absolutely dominant (~90%) over archaea in both metagenomic and metatranscriptomic data pools. The emergence of archaeal phyla Crenarchaeota, Euryarchaeota, Thaumarchaeota, bacterial phyla Actinobacteria, Firmicutes, sub-phyla Betaproteobacteria, Deltaproteobacteria, and Gammaproteobacteria, and the decrease of bacterial phyla Bacteroidetes and Alphaproteobacteria are the main composition changes of prokaryotic communities in the deep-sea water, when compared with the reference Global Ocean Sampling Expedition (GOS) surface water. Photosynthetic Cyanobacteria exist in all four metagenomic libraries and two metatranscriptomic libraries. In Eukaryota community, decreased abundance of fungi and algae in deep sea was observed. RNA/DNA ratio was employed as an index to show metabolic activity strength of microbes in deep sea. Functional analysis indicated that deep-sea microbes are leading a defensive lifestyle.

ContributorsWu, Jieying (Author) / Gao, Weimin (Author) / Johnson, Roger (Author) / Zhang, Weiwen (Author) / Meldrum, Deirdre (Author) / Biodesign Institute (Contributor)

Created2013-10-11

Microbe Observation and Cultivation Array (MOCA) for Cultivating and Analyzing Environmental Microbiota

Description

Background: The use of culture-independent nucleic acid techniques, such as ribosomal RNA gene cloning library analysis, has unveiled the tremendous microbial diversity that exists in natural environments. In sharp contrast to this great achievement is the current difficulty in cultivating the majority of bacterial species or phylotypes revealed by molecular approaches.…

Background: The use of culture-independent nucleic acid techniques, such as ribosomal RNA gene cloning library analysis, has unveiled the tremendous microbial diversity that exists in natural environments. In sharp contrast to this great achievement is the current difficulty in cultivating the majority of bacterial species or phylotypes revealed by molecular approaches. Although recent new technologies such as metagenomics and metatranscriptomics can provide more functionality information about the microbial communities, it is still important to develop the capacity to isolate and cultivate individual microbial species or strains in order to gain a better understanding of microbial physiology and to apply isolates for various biotechnological applications.

Results: We have developed a new system to cultivate bacteria in an array of droplets. The key component of the system is the microbe observation and cultivation array (MOCA), which consists of a Petri dish that contains an array of droplets as cultivation chambers. MOCA exploits the dominance of surface tension in small amounts of liquid to spontaneously trap cells in well-defined droplets on hydrophilic patterns. During cultivation, the growth of the bacterial cells across the droplet array can be monitored using an automated microscope, which can produce a real-time record of the growth. When bacterial cells grow to a visible microcolony level in the system, they can be transferred using a micropipette for further cultivation or analysis.

Conclusions: MOCA is a flexible system that is easy to set up, and provides the sensitivity to monitor growth of single bacterial cells. It is a cost-efficient technical platform for bioassay screening and for cultivation and isolation of bacteria from natural environments.

ContributorsGao, Weimin (Author) / Navarroli, Dena (Author) / Naimark, Jared (Author) / Zhang, Weiwen (Author) / Chao, Shih-hui (Author) / Meldrum, Deirdre (Author) / Biodesign Institute (Contributor)

Created2013-01-09

Cellular Capacities for High-Light Acclimation and Changing Lipid Profiles Across Life Cycle Stages of the Green Alga Haematococcus Pluvialis

Description

The unicellular microalga Haematococcus pluvialis has emerged as a promising biomass feedstock for the ketocarotenoid astaxanthin and neutral lipid triacylglycerol. Motile flagellates, resting palmella cells, and cysts are the major life cycle stages of H. pluvialis. Fast-growing motile cells are usually used to induce astaxanthin and triacylglycerol biosynthesis under stress…

The unicellular microalga Haematococcus pluvialis has emerged as a promising biomass feedstock for the ketocarotenoid astaxanthin and neutral lipid triacylglycerol. Motile flagellates, resting palmella cells, and cysts are the major life cycle stages of H. pluvialis. Fast-growing motile cells are usually used to induce astaxanthin and triacylglycerol biosynthesis under stress conditions (high light or nutrient starvation); however, productivity of biomass and bioproducts are compromised due to the susceptibility of motile cells to stress. This study revealed that the Photosystem II (PSII) reaction center D1 protein, the manganese-stabilizing protein PsbO, and several major membrane glycerolipids (particularly for chloroplast membrane lipids monogalactosyldiacylglycerol and phosphatidylglycerol), decreased dramatically in motile cells under high light (HL). In contrast, palmella cells, which are transformed from motile cells after an extended period of time under favorable growth conditions, have developed multiple protective mechanisms - including reduction in chloroplast membrane lipids content, downplay of linear photosynthetic electron transport, and activating nonphotochemical quenching mechanisms - while accumulating triacylglycerol. Consequently, the membrane lipids and PSII proteins (D1 and PsbO) remained relatively stable in palmella cells subjected to HL. Introducing palmella instead of motile cells to stress conditions may greatly increase astaxanthin and lipid production in H. pluvialis culture.

ContributorsWang, Baobei (Author) / Zhang, Zhen (Author) / Hu, Qiang (Author) / Sommerfeld, Milton (Author) / Lu, Yinghua (Author) / Han, Danxiang (Author) / College of Liberal Arts and Sciences (Contributor)

Created2014-09-15

Statistical Methods for Analyzing Immunosignatures

Description

Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt…

Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt to model, may have a diverse range of applications.

Methods: We investigate the utility of a number of statistical methods to determine model performance and address challenges inherent in analyzing immunosignatures. Some of these methods include exploratory and confirmatory factor analyses, classical significance testing, structural equation and mixture modeling.

Results: We demonstrate an ability to classify samples based on disease status and show that immunosignaturing is a very promising technology for screening and presymptomatic screening of disease. In addition, we are able to model complex patterns and latent factors underlying immunosignatures. These latent factors may serve as biomarkers for disease and may play a key role in a bioinformatic method for antibody discovery.

Conclusion: Based on this research, we lay out an analytic framework illustrating how immunosignatures may be useful as a general method for screening and presymptomatic screening of disease as well as antibody discovery.

ContributorsBrown, Justin (Author) / Stafford, Phillip (Author) / Johnston, Stephen (Author) / Dinu, Valentin (Author) / College of Health Solutions (Contributor)

Created2011-08-19

Segmentation and Intensity Estimation for Microarray Images With Saturated Pixels

Description

Background: Microarray image analysis processes scanned digital images of hybridized arrays to produce the input spot-level data for downstream analysis, so it can have a potentially large impact on those and subsequent analysis. Signal saturation is an optical effect that occurs when some pixel values for highly expressed genes or…

Background: Microarray image analysis processes scanned digital images of hybridized arrays to produce the input spot-level data for downstream analysis, so it can have a potentially large impact on those and subsequent analysis. Signal saturation is an optical effect that occurs when some pixel values for highly expressed genes or peptides exceed the upper detection threshold of the scanner software (2¹⁶ - 1 = 65, 535 for 16-bit images). In practice, spots with a sizable number of saturated pixels are often flagged and discarded. Alternatively, the saturated values are used without adjustments for estimating spot intensities. The resulting expression data tend to be biased downwards and can distort high-level analysis that relies on these data. Hence, it is crucial to effectively correct for signal saturation.

Results: We developed a flexible mixture model-based segmentation and spot intensity estimation procedure that accounts for saturated pixels by incorporating a censored component in the mixture model. As demonstrated with biological data and simulation, our method extends the dynamic range of expression data beyond the saturation threshold and is effective in correcting saturation-induced bias when the lost information is not tremendous. We further illustrate the impact of image processing on downstream classification, showing that the proposed method can increase diagnostic accuracy using data from a lymphoma cancer diagnosis study.

Conclusions: The presented method adjusts for signal saturation at the segmentation stage that identifies a pixel as part of the foreground, background or other. The cluster membership of a pixel can be altered versus treating saturated values as truly observed. Thus, the resulting spot intensity estimates may be more accurate than those obtained from existing methods that correct for saturation based on already segmented data. As a model-based segmentation method, our procedure is able to identify inner holes, fuzzy edges and blank spots that are common in microarray images. The approach is independent of microarray platform and applicable to both single- and dual-channel microarrays.

ContributorsYang, Yan (Author) / Stafford, Phillip (Author) / Kim, YoonJoo (Author) / College of Liberal Arts and Sciences (Contributor)

Created2011-11-30

A Convenient, Optimized Pipeline for Isolation, Fluorescence Microscopy and Molecular Analysis of Live Single Cells

Description

Background: Heterogeneity within cell populations is relevant to the onset and progression of disease, as well as development and maintenance of homeostasis. Analysis and understanding of the roles of heterogeneity in biological systems require methods and technologies that are capable of single cell resolution. Single cell gene expression analysis by RT-qPCR…

Background: Heterogeneity within cell populations is relevant to the onset and progression of disease, as well as development and maintenance of homeostasis. Analysis and understanding of the roles of heterogeneity in biological systems require methods and technologies that are capable of single cell resolution. Single cell gene expression analysis by RT-qPCR is an established technique for identifying transcriptomic heterogeneity in cellular populations, but it generally requires specialized equipment or tedious manipulations for cell isolation.

Results: We describe the optimization of a simple, inexpensive and rapid pipeline which includes isolation and culture of live single cells as well as fluorescence microscopy and gene expression analysis of the same single cells by RT-qPCR. We characterize the efficiency of single cell isolation and demonstrate our method by identifying single GFP-expressing cells from a mixed population of GFP-positive and negative cells by correlating fluorescence microscopy and RT-qPCR.

Conclusions: Single cell gene expression analysis by RT-qPCR is a convenient means for investigating cellular heterogeneity, but is most useful when correlating observations with additional measurements. We demonstrate a convenient and simple pipeline for multiplexing single cell RT-qPCR with fluorescence microscopy which is adaptable to other molecular analyses.

ContributorsYaron, Jordan (Author) / Ziegler, Colleen (Author) / Tran, Thai (Author) / Glenn, Honor (Author) / Meldrum, Deirdre (Author) / Biodesign Institute (Contributor)

Created2014-05-08

Comparative Study of Classification Algorithms for Immunosignaturing Data

Description

Background: High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of…

Background: High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of features. As new microarrays are invented, classification systems that worked well for other array types may not be ideal. Expression microarrays, arguably one of the most prevalent array types, have been used for years to help develop classification algorithms. Many biological assumptions are built into classifiers that were designed for these types of data. One of the more problematic is the assumption of independence, both at the probe level and again at the biological level. Probes for RNA transcripts are designed to bind single transcripts. At the biological level, many genes have dependencies across transcriptional pathways where co-regulation of transcriptional units may make many genes appear as being completely dependent. Thus, algorithms that perform well for gene expression data may not be suitable when other technologies with different binding characteristics exist. The immunosignaturing microarray is based on complex mixtures of antibodies binding to arrays of random sequence peptides. It relies on many-to-many binding of antibodies to the random sequence peptides. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides. This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. However, it is not clear what is the optimal classification algorithm for analyzing this new type of data.

Results: We characterized several classification algorithms to analyze immunosignaturing data. We selected several datasets that range from easy to difficult to classify, from simple monoclonal binding to complex binding patterns in asthma patients. We then classified the biological samples using 17 different classification algorithms. Using a wide variety of assessment criteria, we found ‘Naïve Bayes’ far more useful than other widely used methods due to its simplicity, robustness, speed and accuracy.

Conclusions: ‘Naïve Bayes’ algorithm appears to accommodate the complex patterns hidden within multilayered immunosignaturing microarray data due to its fundamental mathematical properties.

ContributorsKukreja, Muskan (Author) / Johnston, Stephen (Author) / Stafford, Phillip (Author) / Biodesign Institute (Contributor)

Created2012-06-21

Filtering by