Search Content

Diversity and Distribution of the Desert Stink Beetles: Systematics of the Amphidorini LeConte, 1862 (Coleoptera: Tenebrionidae)

Description

Understanding the diversity, evolutionary relationships, and geographic distribution of species is foundational knowledge in biology. However, this knowledge is lacking for many diverse lineages of the tree of life. This is the case for the desert stink beetles in the tribe Amphidorini LeConte, 1862 (Coleoptera: Tenebrionidae) – a lineage of…

Understanding the diversity, evolutionary relationships, and geographic distribution of species is foundational knowledge in biology. However, this knowledge is lacking for many diverse lineages of the tree of life. This is the case for the desert stink beetles in the tribe Amphidorini LeConte, 1862 (Coleoptera: Tenebrionidae) – a lineage of arid-adapted flightless beetles found throughout western North America. Four interconnected studies that jointly increase our knowledge of this group are presented. First, the darkling beetle fauna of the Algodones sand dunes in southern California is examined as a case study to explore the scientific practice of checklist creation. An updated list of the species known from this region is presented, with a critical focus on material now made available through digitization and global aggregation. This part concludes with recommendations for future biodiversity checklist authors. Second, the psammophilic genus Trogloderus LeConte, 1879 is revised. Six new species are described, and the first, multi-gene phylogeny for the genus is inferred. In addition, historical biogeographic reconstructions along with novel hypotheses of speciation patterns within the Intermountain Region are given. In particular, the Kaibab Plateau and Kaiparowitz Formation are found to have promoted speciation on the Colorado Plateau. The Owens Valley and prehistoric Bouse Embayment are similarly hypothesized to drive species diversification in southern California. Third, a novel phylogenomic analysis for the tribe Amphidorini is presented, based on 29 de novo partial transcriptomes. Three putative ortholog sets were discovered and analyzed to infer the relationships between species groups and genera. The existing classification of the tribe is found to be highly inadequate, though the earliest-diverging relationships within the tribe are still in question. Finally, the new phylogenetic framework is used to provide a genus-level revision for the Amphidorini, which previously contained six valid genera and 253 valid species. This updated classification includes more than 100 taxonomic changes and results in the revised tribe consisting of 16 genera, with three being described as new to science.

ContributorsJohnston, Murray Andrew (Author) / Franz, Nico M (Thesis advisor) / Cartwright, Reed (Committee member) / Taylor, Jesse (Committee member) / Pigg, Kathleen (Committee member) / Arizona State University (Publisher)

Created2018

Using antibodies to characterize healthy, disease, and age states

Description

The advent of new high throughput technology allows for increasingly detailed characterization of the immune system in healthy, disease, and age states. The immune system is composed of two main branches: the innate and adaptive immune system, though the border between these two states is appearing less distinct. The adaptive…

The advent of new high throughput technology allows for increasingly detailed characterization of the immune system in healthy, disease, and age states. The immune system is composed of two main branches: the innate and adaptive immune system, though the border between these two states is appearing less distinct. The adaptive immune system is further split into two main categories: humoral and cellular immunity. The humoral immune response produces antibodies against specific targets, and these antibodies can be used to learn about disease and normal states. In this document, I use antibodies to characterize the immune system in two ways: 1. I determine the Antibody Status (AbStat) from the data collected from applying sera to an array of non-natural sequence peptides, and demonstrate that this AbStat measure can distinguish between disease, normal, and aged samples as well as produce a single AbStat number for each sample; 2. I search for antigens for use in a cancer vaccine, and this search results in several candidates as well as a new hypothesis. Antibodies provide us with a powerful tool for characterizing the immune system, and this natural tool combined with emerging technologies allows us to learn more about healthy and disease states.

ContributorsWhittemore, Kurt (Author) / Sykes, Kathryn (Thesis advisor) / Johnston, Stephen A. (Committee member) / Jacobs, Bertram (Committee member) / Stafford, Phillip (Committee member) / Stout, Valerie (Committee member) / Arizona State University (Publisher)

Created2014

Antibody based strategies for multiplexed diagnostics

Description

Peptide microarrays are to proteomics as sequencing is to genomics. As microarrays become more content-rich, higher resolution proteomic studies will parallel deep sequencing of nucleic acids. Antigen-antibody interactions can be studied at a much higher resolution using microarrays than was possible only a decade ago. My dissertation focuses on testing…

Peptide microarrays are to proteomics as sequencing is to genomics. As microarrays become more content-rich, higher resolution proteomic studies will parallel deep sequencing of nucleic acids. Antigen-antibody interactions can be studied at a much higher resolution using microarrays than was possible only a decade ago. My dissertation focuses on testing the feasibility of using either the Immunosignature platform, based on non-natural peptide sequences, or a pathogen peptide microarray, which uses bioinformatically-selected peptides from pathogens for creating sensitive diagnostics. Both diagnostic applications use relatively little serum from infected individuals, but each approaches diagnosis of disease differently. The first project compares pathogen epitope peptide (life-space) and non-natural (random-space) peptide microarrays while using them for the early detection of Coccidioidomycosis (Valley Fever). The second project uses NIAID category A, B and C priority pathogen epitope peptides in a multiplexed microarray platform to assess the feasibility of using epitope peptides to simultaneously diagnose multiple exposures using a single assay. Cross-reactivity is a consistent feature of several antigen-antibody based immunodiagnostics. This work utilizes microarray optimization and bioinformatic approaches to distill the underlying disease specific antibody signature pattern. Circumventing inherent cross-reactivity observed in antibody binding to peptides was crucial to achieve the goal of this work to accurately distinguishing multiple exposures simultaneously.

ContributorsNavalkar, Krupa Arun (Author) / Johnston, Stephen A. (Thesis advisor) / Stafford, Phillip (Thesis advisor) / Sykes, Kathryn (Committee member) / Jacobs, Bertram (Committee member) / Arizona State University (Publisher)

Created2014

Dense non-natural sequence peptide microarrays for epitope mapping and diagnostics

Description

The healthcare system in this country is currently unacceptable. New technologies may contribute to reducing cost and improving outcomes. Early diagnosis and treatment represents the least risky option for addressing this issue. Such a technology needs to be inexpensive, highly sensitive, highly specific, and amenable to adoption in a clinic.…

The healthcare system in this country is currently unacceptable. New technologies may contribute to reducing cost and improving outcomes. Early diagnosis and treatment represents the least risky option for addressing this issue. Such a technology needs to be inexpensive, highly sensitive, highly specific, and amenable to adoption in a clinic. This thesis explores an immunodiagnostic technology based on highly scalable, non-natural sequence peptide microarrays designed to profile the humoral immune response and address the healthcare problem. The primary aim of this thesis is to explore the ability of these arrays to map continuous (linear) epitopes. I discovered that using a technique termed subsequence analysis where epitopes could be decisively mapped to an eliciting protein with high success rate. This led to the discovery of novel linear epitopes from Plasmodium falciparum (Malaria) and Treponema palladium (Syphilis), as well as validation of previously discovered epitopes in Dengue and monoclonal antibodies. Next, I developed and tested a classification scheme based on Support Vector Machines for development of a Dengue Fever diagnostic, achieving higher sensitivity and specificity than current FDA approved techniques. The software underlying this method is available for download under the BSD license. Following this, I developed a kinetic model for immunosignatures and tested it against existing data driven by previously unexplained phenomena. This model provides a framework and informs ways to optimize the platform for maximum stability and efficiency. I also explored the role of sequence composition in explaining an immunosignature binding profile, determining a strong role for charged residues that seems to have some predictive ability for disease. Finally, I developed a database, software and indexing strategy based on Apache Lucene for searching motif patterns (regular expressions) in large biological databases. These projects as a whole have advanced knowledge of how to approach high throughput immunodiagnostics and provide an example of how technology can be fused with biology in order to affect scientific and health outcomes.

ContributorsRicher, Joshua Amos (Author) / Johnston, Stephen A. (Thesis advisor) / Woodbury, Neal (Committee member) / Stafford, Phillip (Committee member) / Papandreou-Suppappola, Antonia (Committee member) / Arizona State University (Publisher)

Created2014

Use of large, immunosignature databases to pose new questions about infection and health status

Description

Immunosignature is a technology that retrieves information from the immune system. The technology is based on microarrays with peptides chosen from random sequence space. My thesis focuses on improving the Immunosignature platform and using Immunosignatures to improve diagnosis for diseases. I first contributed to the optimization of the immunosignature platform…

Immunosignature is a technology that retrieves information from the immune system. The technology is based on microarrays with peptides chosen from random sequence space. My thesis focuses on improving the Immunosignature platform and using Immunosignatures to improve diagnosis for diseases. I first contributed to the optimization of the immunosignature platform by introducing scoring metrics to select optimal parameters, considering performance as well as practicality. Next, I primarily worked on identifying a signature shared across various pathogens that can distinguish them from the healthy population. I further retrieved consensus epitopes from the disease common signature and proposed that most pathogens could share the signature by studying the enrichment of the common signature in the pathogen proteomes. Following this, I worked on studying cancer samples from different stages and correlated the immune response with whether the epitope presented by tumor is similar to the pathogen proteome. An effective immune response is defined as an antibody titer increasing followed by decrease, suggesting elimination of the epitope. I found that an effective immune response usually correlates with epitopes that are more similar to pathogens. This suggests that the immune system might occupy a limited space and can be effective against only certain epitopes that have similarity with pathogens. I then participated in the attempt to solve the antibiotic resistance problem by developing a classification algorithm that can distinguish bacterial versus viral infection. This algorithm outperforms other currently available classification methods. Finally, I worked on the concept of deriving a single number to represent all the data on the immunosignature platform. This is in resemblance to the concept of temperature, which is an approximate measurement of whether an individual is healthy. The measure of Immune Entropy was found to work best as a single measurement to describe the immune system information derived from the immunosignature. Entropy is relatively invariant in healthy population, but shows significant differences when comparing healthy donors with patients either infected with a pathogen or have cancer.

ContributorsWang, Lu (Author) / Johnston, Stephen (Thesis advisor) / Stafford, Phillip (Committee member) / Buetow, Kenneth (Committee member) / McFadden, Grant (Committee member) / Arizona State University (Publisher)

Created2018

Frameshift antigens for cancer vaccine development

Description

Immunotherapy has been revitalized with the advent of immune checkpoint blockade

treatments, and neo-antigens are the targets of immune system in cancer patients who

respond to the treatments. The cancer vaccine field is focused on using neo-antigens from

unique point mutations of genomic sequence in the cancer patient for making

personalized cancer vaccines. However,…

Immunotherapy has been revitalized with the advent of immune checkpoint blockade

treatments, and neo-antigens are the targets of immune system in cancer patients who

respond to the treatments. The cancer vaccine field is focused on using neo-antigens from

unique point mutations of genomic sequence in the cancer patient for making

personalized cancer vaccines. However, we choose a different path to find frameshift

neo-antigens at the mRNA level and develop broadly effective cancer vaccines based on

frameshift antigens.

In this dissertation, I have summarized and characterized all the potential frameshift

antigens from microsatellite regions in human, dog and mouse. A list of frameshift

antigens was validated by PCR in tumor samples and the mutation rate was calculated for

one candidate – SEC62. I develop a method to screen the antibody response against

frameshift antigens in human and dog cancer patients by using frameshift peptide arrays.

Frameshift antigens selected by positive antibody response in cancer patients or by MHC

predictions show protection in different mouse tumor models. A dog version of the

cancer vaccine based on frameshift antigens was developed and tested in a small safety

trial. The results demonstrate that the vaccine is safe and it can induce strong B and T cell

immune responses. Further, I built the human exon junction frameshift database which

includes all possible frameshift antigens from mis-splicing events in exon junctions, and I

develop a method to find potential frameshift antigens from large cancer

immunosignature dataset with these databases. In addition, I test the idea of ‘early cancer

diagnosis, early treatment’ in a transgenic mouse cancer model. The results show that

ii

early treatment gives significantly better protection than late treatment and the correct

time point for treatment is crucial to give the best clinical benefit. A model for early

treatment is developed with these results.

Frameshift neo-antigens from microsatellite regions and mis-splicing events are

abundant at mRNA level and they are better antigens than neo-antigens from point

mutations in the genomic sequences of cancer patients in terms of high immunogenicity,

low probability to cause autoimmune diseases and low cost to develop a broadly effective

vaccine. This dissertation demonstrates the feasibility of using frameshift antigens for

cancer vaccine development.

ContributorsZhang, Jian (Author) / Johnston, Stephen Albert (Thesis advisor) / Chang, Yung (Committee member) / Stafford, Phillip (Committee member) / Chen, Qiang (Committee member) / Arizona State University (Publisher)

Created2018

Transportation Techniques for Geometric Clustering

Description

This thesis introduces new techniques for clustering distributional data according to their geometric similarities. This work builds upon the optimal transportation (OT) problem that seeks global minimum cost for matching distributional data and leverages the connection between OT and power diagrams to solve different clustering problems. The OT formulation is…

This thesis introduces new techniques for clustering distributional data according to their geometric similarities. This work builds upon the optimal transportation (OT) problem that seeks global minimum cost for matching distributional data and leverages the connection between OT and power diagrams to solve different clustering problems. The OT formulation is based on the variational principle to differentiate hard cluster assignments, which was missing in the literature. This thesis shows multiple techniques to regularize and generalize OT to cope with various tasks including clustering, aligning, and interpolating distributional data. It also discusses the connections of the new formulation to other OT and clustering formulations to better understand their gaps and the means to close them. Finally, this thesis demonstrates the advantages of the proposed OT techniques in solving machine learning problems and their downstream applications in computer graphics, computer vision, and image processing.

ContributorsMi, Liang (Author) / Wang, Yalin (Thesis advisor) / Chen, Kewei (Committee member) / Karam, Lina (Committee member) / Li, Baoxin (Committee member) / Turaga, Pavan (Committee member) / Arizona State University (Publisher)

Created2020

Methods for Detecting Mutations in Non-model Organisms

Description

Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The…

Next-generation sequencing is a powerful tool for detecting genetic variation. How-ever, it is also error-prone, with error rates that are much larger than mutation rates.
This can make mutation detection difficult; and while increasing sequencing depth
can often help, sequence-specific errors and other non-random biases cannot be de-
tected by increased depth. The problem of accurate genotyping is exacerbated when
there is not a reference genome or other auxiliary information available.
I explore several methods for sensitively detecting mutations in non-model or-
ganisms using an example Eucalyptus melliodora individual. I use the structure of
the tree to find bounds on its somatic mutation rate and evaluate several algorithms
for variant calling. I find that conventional methods are suitable if the genome of a
close relative can be adapted to the study organism. However, with structured data,
a likelihood framework that is aware of this structure is more accurate. I use the
techniques developed here to evaluate a reference-free variant calling algorithm.
I also use this data to evaluate a k-mer based base quality score recalibrator
(KBBQ), a tool I developed to recalibrate base quality scores attached to sequencing
data. Base quality scores can help detect errors in sequencing reads, but are often
inaccurate. The most popular method for correcting this issue requires a known
set of variant sites, which is unavailable in most cases. I simulate data and show
that errors in this set of variant sites can cause calibration errors. I then show that
KBBQ accurately recalibrates base quality scores while requiring no reference or other
information and performs as well as other methods.
Finally, I use the Eucalyptus data to investigate the impact of quality score calibra-
tion on the quality of output variant calls and show that improved base quality score
calibration increases the sensitivity and reduces the false positive rate of a variant
calling algorithm.

ContributorsOrr, Adam James (Author) / Cartwright, Reed (Thesis advisor) / Wilson, Melissa (Committee member) / Kusumi, Kenro (Committee member) / Taylor, Jesse (Committee member) / Pfeifer, Susanne (Committee member) / Arizona State University (Publisher)

Created2020

Pathways of Distinction Analysis of Liver Cancer Data: Genetic Differences Between Males and Females

Description

The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males…

The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males and females. The goal of the study was to identify germline variation that differs by sex in hepatocellular carcinoma. Using the program, multiple pathways and genes were identified to have significant differences in their relationship to liver cancer in males and females. In animal studies, the genes which were identified using the PoDA analysis have been shown to impact liver cancer, often with different results for males and females. While these genes are often the focus in animal models, they are absent from current Genome Wide Association Studies (GWAS) catalogs for humans. By working to bridge the results of animal studies and human studies, the results help to identify the causes of liver cancer, and more specifically, the reason the disease affects males at much higher rates. The differences in pathways identified to be significant for the two sexes indicate the germline variance may play sex-specific roles in the development of hepatocellular carcinoma. Additionally, these results reinforce the capacity of the PoDA analysis to identify genes that may be missed by more traditional GWAS methods. This study lays the groundwork for further investigations into the identified genes and pathways, and how they behave differently within males and females.

ContributorsOlson, Erik Jon (Author) / Buetow, Kenneth (Thesis advisor) / Wilson, Melissa (Committee member) / Cartwright, Reed (Committee member) / Arizona State University (Publisher)

Created2021

Studying Ventricular Abnormalities in Mild Cognitive Impairment With Hyperbolic Ricci Flow and Tensor-Based Morphometry

Description

Mild Cognitive Impairment (MCI) is a transitional stage between normal aging and dementia and people with MCI are at high risk of progression to dementia. MCI is attracting increasing attention, as it offers an opportunity to target the disease process during an early symptomatic stage. Structural magnetic resonance imaging (MRI)…

Mild Cognitive Impairment (MCI) is a transitional stage between normal aging and dementia and people with MCI are at high risk of progression to dementia. MCI is attracting increasing attention, as it offers an opportunity to target the disease process during an early symptomatic stage. Structural magnetic resonance imaging (MRI) measures have been the mainstay of Alzheimer's disease (AD) imaging research, however, ventricular morphometry analysis remains challenging because of its complicated topological structure. Here we describe a novel ventricular morphometry system based on the hyperbolic Ricci flow method and tensor-based morphometry (TBM) statistics. Unlike prior ventricular surface parameterization methods, hyperbolic conformal parameterization is angle-preserving and does not have any singularities. Our system generates a one-to-one diffeomorphic mapping between ventricular surfaces with consistent boundary matching conditions. The TBM statistics encode a great deal of surface deformation information that could be inaccessible or overlooked by other methods. We applied our system to the baseline MRI scans of a set of MCI subjects from the Alzheimer's Disease Neuroimaging Initiative (ADNI: 71 MCI converters vs. 62 MCI stable). Although the combined ventricular area and volume features did not differ between the two groups, our fine-grained surface analysis revealed significant differences in the ventricular regions close to the temporal lobe and posterior cingulate, structures that are affected early in AD. Significant correlations were also detected between ventricular morphometry, neuropsychological measures, and a previously described imaging index based on fluorodeoxyglucose positron emission tomography (FDG-PET) scans. This novel ventricular morphometry method may offer a new and more sensitive approach to study preclinical and early symptomatic stage AD.

ContributorsShi, Jie (Author) / Stonnington, Cynthia M. (Author) / Thompson, Paul M. (Author) / Chen, Kewei (Author) / Gutman, Boris (Author) / Reschke, Cole (Author) / Baxter, Leslie C. (Author) / Reiman, Eric M. (Author) / Caselli, Richard J. (Author) / Wang, Yalin (Author) / Ira A. Fulton Schools of Engineering (Contributor)

Created2015-01-01