Search Content

Use of large, immunosignature databases to pose new questions about infection and health status

Description

Immunosignature is a technology that retrieves information from the immune system. The technology is based on microarrays with peptides chosen from random sequence space. My thesis focuses on improving the Immunosignature platform and using Immunosignatures to improve diagnosis for diseases. I first contributed to the optimization of the immunosignature platform…

Immunosignature is a technology that retrieves information from the immune system. The technology is based on microarrays with peptides chosen from random sequence space. My thesis focuses on improving the Immunosignature platform and using Immunosignatures to improve diagnosis for diseases. I first contributed to the optimization of the immunosignature platform by introducing scoring metrics to select optimal parameters, considering performance as well as practicality. Next, I primarily worked on identifying a signature shared across various pathogens that can distinguish them from the healthy population. I further retrieved consensus epitopes from the disease common signature and proposed that most pathogens could share the signature by studying the enrichment of the common signature in the pathogen proteomes. Following this, I worked on studying cancer samples from different stages and correlated the immune response with whether the epitope presented by tumor is similar to the pathogen proteome. An effective immune response is defined as an antibody titer increasing followed by decrease, suggesting elimination of the epitope. I found that an effective immune response usually correlates with epitopes that are more similar to pathogens. This suggests that the immune system might occupy a limited space and can be effective against only certain epitopes that have similarity with pathogens. I then participated in the attempt to solve the antibiotic resistance problem by developing a classification algorithm that can distinguish bacterial versus viral infection. This algorithm outperforms other currently available classification methods. Finally, I worked on the concept of deriving a single number to represent all the data on the immunosignature platform. This is in resemblance to the concept of temperature, which is an approximate measurement of whether an individual is healthy. The measure of Immune Entropy was found to work best as a single measurement to describe the immune system information derived from the immunosignature. Entropy is relatively invariant in healthy population, but shows significant differences when comparing healthy donors with patients either infected with a pathogen or have cancer.

ContributorsWang, Lu (Author) / Johnston, Stephen (Thesis advisor) / Stafford, Phillip (Committee member) / Buetow, Kenneth (Committee member) / McFadden, Grant (Committee member) / Arizona State University (Publisher)

Created2018

Systematic Analysis of the Factors Contributing to the Variation and Change of the Microbiome

Description

Understanding changes and trends in biomedical knowledge is crucial for individuals, groups, and institutions as biomedicine improves people’s lives, supports national economies, and facilitates innovation. However, as knowledge changes what evidence illustrates knowledge changes? In the case of microbiome, a multi-dimensional concept from biomedicine, there are significant increases in publications,…

Understanding changes and trends in biomedical knowledge is crucial for individuals, groups, and institutions as biomedicine improves people’s lives, supports national economies, and facilitates innovation. However, as knowledge changes what evidence illustrates knowledge changes? In the case of microbiome, a multi-dimensional concept from biomedicine, there are significant increases in publications, citations, funding, collaborations, and other explanatory variables or contextual factors. What is observed in the microbiome, or any historical evolution of a scientific field or scientific knowledge, is that these changes are related to changes in knowledge, but what is not understood is how to measure and track changes in knowledge. This investigation highlights how contextual factors from the language and social context of the microbiome are related to changes in the usage, meaning, and scientific knowledge on the microbiome. Two interconnected studies integrating qualitative and quantitative evidence examine the variation and change of the microbiome evidence are presented. First, the concepts microbiome, metagenome, and metabolome are compared to determine the boundaries of the microbiome concept in relation to other concepts where the conceptual boundaries have been cited as overlapping. A collection of publications for each concept or corpus is presented, with a focus on how to create, collect, curate, and analyze large data collections. This study concludes with suggestions on how to analyze biomedical concepts using a hybrid approach that combines results from the larger language context and individual words. Second, the results of a systematic review that describes the variation and change of microbiome research, funding, and knowledge are examined. A corpus of approximately 28,000 articles on the microbiome are characterized, and a spectrum of microbiome interpretations are suggested based on differences related to context. The collective results suggest the microbiome is a separate concept from the metagenome and metabolome, and the variation and change to the microbiome concept was influenced by contextual factors. These results provide insight into how concepts with extensive resources behave within biomedicine and suggest the microbiome is possibly representative of conceptual change or a preview of new dynamics within science that are expected in the future.

ContributorsAiello, Kenneth (Author) / Laubichler, Manfred D (Thesis advisor) / Simeone, Michael (Committee member) / Buetow, Kenneth (Committee member) / Walker, Sara I (Committee member) / Arizona State University (Publisher)

Created2018

A search for parent-of-origin effects in the parasitoid jewel wasp Nasonia vitripennis

Description

In most diploid cells, autosomal genes are equally expressed from the paternal and maternal alleles resulting in biallelic expression. However, as an exception, there exists a small number of genes that show a pattern of monoallelic or biased-allele expression based on the allele’s parent-of-origin. This phenomenon is termed genomic imprinting…

In most diploid cells, autosomal genes are equally expressed from the paternal and maternal alleles resulting in biallelic expression. However, as an exception, there exists a small number of genes that show a pattern of monoallelic or biased-allele expression based on the allele’s parent-of-origin. This phenomenon is termed genomic imprinting and is an evolutionary paradox. The best explanation for imprinting is David Haig's kinship theory, which hypothesizes that monoallelic gene expression is largely the result of evolutionary conflict between males and females over maternal involvement in their offspring. One previous RNAseq study has investigated the presence of parent-of-origin effects, or imprinting, in the parasitic jewel wasp Nasonia vitripennis (N. vitripennis) and its sister species Nasonia giraulti (N. giraulti) to test the predictions of kinship theory in a non-eusocial species for comparison to a eusocial one. In order to continue to tease apart the connection between social and eusocial Hymenoptera, this study proposed a similar RNAseq study that attempted to reproduce these results in unique samples of reciprocal F1 Nasonia hybrids. Building a pseudo N. giraulti reference genome, differences were observed when aligning RNAseq reads to a N. vitripennis reference genome compared to aligning reads to a pseudo N. giraulti reference. As well, no evidence for parent-of-origin or imprinting patterns in adult Nasonia were found. These results demonstrated a species-of-origin effect. Importantly, the study continued to build a repository of support with the aim to elucidate the mechanisms behind imprinting in an excellent epigenetic model species, as it can also help with understanding the phenomenon of imprinting in complex human diseases.

ContributorsUnderwood, Avery Elizabeth (Author) / Wilson, Melissa (Thesis advisor) / Buetow, Kenneth (Committee member) / Gile, Gillian (Committee member) / Arizona State University (Publisher)

Created2019

Patterns of Sex-Biased Gene Expression in the Human Brain

Description

Schizophrenia is a disease that affects 15.2/100,000 US citizens, with about 0.6-1.9% of the total population being afflicted with some range of severity of the disease. A lot of research has been done on the progression of the disease and its differences between males and females; however, the true underlying…

Schizophrenia is a disease that affects 15.2/100,000 US citizens, with about 0.6-1.9% of the total population being afflicted with some range of severity of the disease. A lot of research has been done on the progression of the disease and its differences between males and females; however, the true underlying cause of the disease remains unknown. In the literature, however, there is a lot of indication that a genetic cause for schizophrenia is the primary origin for the disorder. In order to establish a foundation in differential gene expression and isoform expression between males and females, we utilized the Genotype-Tissue Expression Project data set (which contains samples from healthy individuals at their time of death) for the amygdala, anterior cingulate cortex, and frontal cortex. We performed quality control on the data with Trimmomatic and visualized it with FastQC and MultiQC. We then aligned to a sex-specific reference genome with Hisat2. Finally, we performed a differential expression analysis dthrough the limma/voom package with inputs from featureCounts. An isoform level analysis was run on the anterior cingulate cortex with the IsoformSwitchAnalyzeR package. We were able to identify a few differentially expressed genes in the three tissue sites, which included XIST and other highly conserved, Y-linked genes. As for the isoform level analysis, we were able to identify 13 genes with significant levels of differential isoform usage and expression, two of which have clinical relevance (DAB1 and PACRG). These findings will allow for a comparison to be made by future studies on gene expression in brain tissue samples from patients that had been diagnosed with schizophrenia in their life. By identifying any unique genes in these patients, gene therapies can be developed to target and correct any misexpression that may be occurring.

ContributorsEvanovich, Austin Phillip (Author) / Wilson, Melissa (Thesis director) / Buetow, Kenneth (Committee member) / Natri, Heini Maaret (Committee member) / School of Life Sciences (Contributor, Contributor) / Barrett, The Honors College (Contributor)

Created2019-05

A Robust scRNA-seq Data Analysis Pipeline for Measuring Gene Expression Noise

Description

The past decade has seen a drastic increase in collaboration between Computer Science (CS) and Molecular Biology (MB). Current foci in CS such as deep learning require very large amounts of data, and MB research can often be rapidly advanced by analysis and models from CS. One of the places…

The past decade has seen a drastic increase in collaboration between Computer Science (CS) and Molecular Biology (MB). Current foci in CS such as deep learning require very large amounts of data, and MB research can often be rapidly advanced by analysis and models from CS. One of the places where CS could aid MB is during analysis of sequences to find binding sites, prediction of folding patterns of proteins. Maintenance and replication of stem-like cells is possible for long terms as well as differentiation of these cells into various tissue types. These behaviors are possible by controlling the expression of specific genes. These genes then cascade into a network effect by either promoting or repressing downstream gene expression. The expression level of all gene transcripts within a single cell can be analyzed using single cell RNA sequencing (scRNA-seq). A significant portion of noise in scRNA-seq data are results of extrinsic factors and could only be removed by customized scRNA-seq analysis pipeline. scRNA-seq experiments utilize next-gen sequencing to measure genome scale gene expression levels with single cell resolution.

Almost every step during analysis and quantification requires the use of an often empirically determined threshold, which makes quantification of noise less accurate. In addition, each research group often develops their own data analysis pipeline making it impossible to compare data from different groups. To remedy this problem a streamlined and standardized scRNA-seq data analysis and normalization protocol was designed and developed. After analyzing multiple experiments we identified the possible pipeline stages, and tools needed. Our pipeline is capable of handling data with adapters and barcodes, which was not the case with pipelines from some experiments. Our pipeline can be used to analyze single experiment scRNA-seq data and also to compare scRNA-seq data across experiments. Various processes like data gathering, file conversion, and data merging were automated in the pipeline. The main focus was to standardize and normalize single-cell RNA-seq data to minimize technical noise introduced by disparate platforms.

ContributorsBalachandran, Parithi (Author) / Wang, Xiao (Thesis advisor) / Brafman, David (Committee member) / Lockhart, Thurmon (Committee member) / Arizona State University (Publisher)

Created2017

Towards an Asynchronous Course-based Undergraduate Research Experience (CURE) Framework: A Pilot Case Study in Remote Genomics Research

Description

Course-based undergraduate research experiences (CUREs) are strategically designed to advance novel research and integrate future professionals into the scientific community by making relevant discoveries through iteration, communication, and collaboration. With Universities also expanding online undergraduate degree programs that incorporate students who are otherwise unable to attend college, there is a…

Course-based undergraduate research experiences (CUREs) are strategically designed to advance novel research and integrate future professionals into the scientific community by making relevant discoveries through iteration, communication, and collaboration. With Universities also expanding online undergraduate degree programs that incorporate students who are otherwise unable to attend college, there is a demand for online asynchronous courses to train online students in authentic research, thereby leading to a more skilled, diverse, and inclusive workforce. In this case-study, a pilot CURE leveraging the data-intensive field of genomics was presented as an inclusive opportunity for asynchronous, online students to increase their research experience without having to commit to in person or extra-curricular assignments. This online CURE was designed to investigate the effects of trimming software on high-throughput sequencing data when analyzing sex differential gene expression. Project-based objectives were developed to asynchronously teach (1) the biology behind the research, (2) the coding needed to conduct the research, and (3) professional development tools to communicate research findings. Course effectiveness was evaluated qualitatively and quantitatively using weekly, open-response progress reports and an assessment administered before and after term completion. This pilot study exhibited that students can be successful in remote research experiences that incorporate channels for communication, bespoke and accessible learning materials, and open-response reports to monitor challenges and coping strategies. In this iteration, remote students demonstrated improved learning outcomes and self-reported improved confidence as researchers. In addition, students gained more realistic expectations to self-assess computational research skill-levels and self-identified adaptive coping strategies that are transferrable to future research projects. Overall, this framework for an online asynchronous CURE effectively taught students computational skills to conduct genomics research in addition to professional skills to transition to and thrive in the workforce.

ContributorsAlarid, Danielle Olga (Author) / Wilson, Melissa A (Thesis advisor) / Buetow, Kenneth (Committee member) / Cooper, Katelyn (Committee member) / Arizona State University (Publisher)

Created2023

Pathway Analysis Reveals Sex Differences in Human Hepatocellular Carcinoma

Description

Hepatocellular carcinoma (HCC) is the third leading cause of cancer death worldwide and exhibits a male-bias in occurrence and mortality. Previous studies have provided insight into the role of inherited genetic regulation of transcription in modulating sex-differences in HCC etiology and mortality. This study uses pathway analysis to add insight…

Hepatocellular carcinoma (HCC) is the third leading cause of cancer death worldwide and exhibits a male-bias in occurrence and mortality. Previous studies have provided insight into the role of inherited genetic regulation of transcription in modulating sex-differences in HCC etiology and mortality. This study uses pathway analysis to add insight into the biological processes that drive sex-differences in HCC etiology as well as a provide additional framework for future studies on sex-biased cancers. Gene expression data from normal, tumor adjacent, and HCC liver tissue were used to calculate pathway scores using a tool called PathOlogist that not only takes into consideration the molecules in a biological pathway, but also the interaction type and directionality of the signaling pathways. Analysis of the pathway scores uncovered etiologically relevant pathways differentiating male and female HCC. In normal and tumor adjacent liver tissue, males showed higher activity of pathways related to translation factors and signaling. Females did not show higher activity of any pathways compared to males in normal and tumor adjacent liver tissue. Work suggest biologic processes that underlie sex-biases in HCC occurrence and mortality. Both males and females differed in the activation of pathways related apoptosis, cell cycle, signaling, and metabolism in HCC. These results identify clinically relevant pathways for future research and therapeutic targeting.

ContributorsRehling, Thomas E (Author) / Buetow, Kenneth (Thesis advisor) / Wilson, Melissa (Committee member) / Maley, Carlo (Committee member) / Arizona State University (Publisher)

Created2021

Pathways of Distinction Analysis of Liver Cancer Data: Genetic Differences Between Males and Females

Description

The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males…

The Pathways of Distinction Analysis (PoDA) program calculates relationships between a given group of genes contained within a pathway, and a disease state. It was used here to investigate liver cancer, and to explore how genetic variability may contribute to the different rates of development of the disease in males and females. The goal of the study was to identify germline variation that differs by sex in hepatocellular carcinoma. Using the program, multiple pathways and genes were identified to have significant differences in their relationship to liver cancer in males and females. In animal studies, the genes which were identified using the PoDA analysis have been shown to impact liver cancer, often with different results for males and females. While these genes are often the focus in animal models, they are absent from current Genome Wide Association Studies (GWAS) catalogs for humans. By working to bridge the results of animal studies and human studies, the results help to identify the causes of liver cancer, and more specifically, the reason the disease affects males at much higher rates. The differences in pathways identified to be significant for the two sexes indicate the germline variance may play sex-specific roles in the development of hepatocellular carcinoma. Additionally, these results reinforce the capacity of the PoDA analysis to identify genes that may be missed by more traditional GWAS methods. This study lays the groundwork for further investigations into the identified genes and pathways, and how they behave differently within males and females.

ContributorsOlson, Erik Jon (Author) / Buetow, Kenneth (Thesis advisor) / Wilson, Melissa (Committee member) / Cartwright, Reed (Committee member) / Arizona State University (Publisher)

Created2021

Isoform Variation across Triple Negative Breast Cancer and Prostate Cancer

Description

Cancer is a disease in which abnormal cells divide uncontrollably and destroy body tissue, and currently plagues today’s world. Carcinomas are cancers derived from epithelial cells and include breast and prostate cancer. Breast cancer is a type of carcinoma that forms in breast tissue cells. The tumor cells can be…

Cancer is a disease in which abnormal cells divide uncontrollably and destroy body tissue, and currently plagues today’s world. Carcinomas are cancers derived from epithelial cells and include breast and prostate cancer. Breast cancer is a type of carcinoma that forms in breast tissue cells. The tumor cells can be further categorized after testing the cells for the presence of certain molecules. Hormone receptor positive breast cancer includes the tumor cells with receptors that respond to the steroid hormones, estrogen and progesterone, or the peptide hormone, HER2. These forms of cancer respond well to chemotherapy and endocrine therapy. On the other hand, triple negative breast cancer (TNBC) is characterized by the lack of hormone receptor expression and tends to have a worse prognosis in women. Prostate cancer forms in the cells of the prostate gland and has been attributed to mutations in androgen receptor ligand specificity. In a subset of triple negative breast cancer, genetic expression profiling has found a luminal androgen receptor that is dependent on androgen signaling. TNBC has also been found to respond well to enzalutamide, a an androgen receptor inhibitor. As the gene of the androgen receptor, AR, is located on the X chromosome and expressed in a variety of tissues, the responsiveness of TNBC to androgen receptor inhibition could be due to the differential usage of isoforms - different gene mRNA transcripts that produce different proteins. Thus, this study analyzed differential gene expression and differential isoform usage between TNBC cancers – that do and do not express the androgen receptor – and prostate cancer in order to better understand the underlying mechanism behind the effectiveness of androgen receptor inhibition in TNBC. Through the analysis of differential gene expression between the TNBC AR+ and AR- conditions, it was found that seven genes are significantly differentially expressed between the two types of tissues. Genes of significance are AR and EN1, which was found to be a potential prognostic marker in a subtype of TNBC. While some genes are differentially expressed between the TNBC AR+ and AR- tissues, the differences in isoform expression between the two tissues do not reflect the difference in gene expression. We discovered 11 genes that exhibited significant isoform switching between AR+ and AR- TNBC and have been found to contribute to cancer characteristics. The genes CLIC1 and RGS5 have been found to help the rapid, uncontrolled growth of cancer cells. HSD11B2, IRAK1, and COL1Al have been found to contribute to general cancer characteristics and metastasis in breast cancer. PSMA7 has been found to play a role in androgen receptor activation. Finally, SIDT1 and GLYATL1 are both associated with breast and prostate cancers. Overall, through the analysis of differential isoform usage between AR+ and AR- samples, we uncovered differences that were not detected by a gene level differential expression analysis. Thus, future work will focus on analyzing differential gene and isoform expression across all types of breast cancer and prostate cancer to better understand the responsiveness of TNBC to androgen receptor inhibition.

ContributorsDeshpande, Anagha J (Author) / Wilson-Sayres, Melissa (Thesis director) / Buetow, Kenneth (Committee member) / Natri, Heini (Committee member) / School of Human Evolution & Social Change (Contributor) / School of Life Sciences (Contributor) / Barrett, The Honors College (Contributor)

Created2019-05

A review of pathway-based visualization and quantification analysis tools using microarray data

Description

Pathway analysis helps researchers gain insight into the biology behind gene expression-based data. By applying this data to known biological pathways, we can learn about mutations or other changes in cellular function, such as those seen in cancer. There are many tools that can be used to analyze pathways; however,…

Pathway analysis helps researchers gain insight into the biology behind gene expression-based data. By applying this data to known biological pathways, we can learn about mutations or other changes in cellular function, such as those seen in cancer. There are many tools that can be used to analyze pathways; however, it can be difficult to find and learn about the which tool is optimal for use in a certain experiment. This thesis aims to comprehensively review four tools, Cytoscape, PaxtoolsR, PathOlogist, and Reactome, and their role in pathway analysis. This is done by applying a known microarray data set to each tool and testing their different functions. The functions of these programs will then be analyzed to determine their roles in learning about biology and assisting new researchers with their experiments. It was found that each tools holds a very unique and important role in pathway analysis. Visualization pathways have the role of exploring individual pathways and interpreting genomic results. Quantification pathways use statistical tests to determine pathway significance. Together one can find pathways of interest and then explore areas of interest.

ContributorsRehling, Thomas Evan (Author) / Buetow, Kenneth (Thesis director) / Wilson, Melissa (Committee member) / School of Life Sciences (Contributor, Contributor) / Barrett, The Honors College (Contributor)

Created2020-05

Filtering by