Search Content

Identification of Tumor Associated Antigens using Nucleic Acid Programmable Protein Arrays

Description

Identifying disease biomarkers may aid in the early detection of breast cancer and improve patient outcomes. Recent evidence suggests that tumors are immunogenic and therefore patients may launch an autoantibody response to tumor associated antigens. Single-chain variable fragments of autoantibodies derived from regional lymph node B cells of breast cancer…

Identifying disease biomarkers may aid in the early detection of breast cancer and improve patient outcomes. Recent evidence suggests that tumors are immunogenic and therefore patients may launch an autoantibody response to tumor associated antigens. Single-chain variable fragments of autoantibodies derived from regional lymph node B cells of breast cancer patients were used to discover these tumor associated biomarkers on protein microarrays. Six candidate biomarkers were discovered from 22 heavy chain-only variable region antibody fragments screened. Validation tests are necessary to confirm the tumorgenicity of these antigens. However, the use of single-chain variable autoantibody fragments presents a novel platform for diagnostics and cancer therapeutics.

ContributorsSharman, M. Camila (Author) / Magee, Dewey (Mitch) (Thesis director) / Wallstrom, Garrick (Committee member) / Petritis, Brianne (Committee member) / Barrett, The Honors College (Contributor) / College of Liberal Arts and Sciences (Contributor) / Virginia G. Piper Center for Personalized Diagnostics (Contributor) / Biodesign Institute (Contributor)

Created2012-12

Transcriptome gene expression analysis of breast cancer using RNA-Seq

Description

Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer deaths in females worldwide, accounting for 23% of all new cancer cases and 14% of all total cancer deaths in 2008. Five tumor-normal pairs of primary breast epithelial cells were treated for infinite proliferation by…

Background: Breast cancer is the most frequently diagnosed cancer and the leading cause of cancer deaths in females worldwide, accounting for 23% of all new cancer cases and 14% of all total cancer deaths in 2008. Five tumor-normal pairs of primary breast epithelial cells were treated for infinite proliferation by using a ROCK inhibitor and mouse feeder cells. Methods: Raw paired-end, 100x coverage RNA-Seq data was aligned to the Human Reference Genome Version 19 using BWA and Tophat. Gene differential expression analysis was completed using Cufflinks and Cuffdiff. Interactive Genome Viewer was used for data visualization. Results: 15 genes were found to be down-regulated by at least one log-fold change in 4/5 of tumor samples. 75 genes were found to be down-regulated in 3/5 of our tumor samples by at least one log-fold change. 11 genes were found to be up-regulated in 4/5 of our tumor samples, and 68 genes were identified to be up-regulated in 3/5 of the tumor samples by at least one-fold change. Conclusion: Expression changes in genes such as AZGP1, AGER, ALG11, and S1007 suggest a disruption in the glycosylation pathway. No correlation was found between Cufflink's Her2 gene-expression and DAKO score classification.

ContributorsHernandez, Fernando (Author) / Anderson, Karen (Thesis director) / Mangone, Marco (Committee member) / Park, Jin (Committee member) / Barrett, The Honors College (Contributor) / Department of Information Systems (Contributor)

Created2013-05

Therapeutic Target Exploration in Triple Negative Breast Cancer

Description

This investigation investigates the impact of ARAF knockdown on the invasion capabilities of breast epithelial cells carrying the TP53 R273C mutation, a prevalent genetic alteration in triple-negative breast cancer (TNBC). Through the use of invasion assays, the study uncovers an unexpected increase in invasion following ARAF knockdown in mutant cell…

This investigation investigates the impact of ARAF knockdown on the invasion capabilities of breast epithelial cells carrying the TP53 R273C mutation, a prevalent genetic alteration in triple-negative breast cancer (TNBC). Through the use of invasion assays, the study uncovers an unexpected increase in invasion following ARAF knockdown in mutant cell lines. Further analysis hints at the presence of a novel truncated ARAF protein, challenging traditional notions of ARAF's role in cancer. These findings offer insights into potential therapeutic targets for TNBC and underscore the significance of exploring the functional implications of genetic mutations in cancer progression.

ContributorsLeaver, Jory (Author) / Park, Jin (Thesis director) / Grief, Dustin (Committee member) / Barrett, The Honors College (Contributor) / School of Life Sciences (Contributor)

Created2024-05

Integrative analysis of genomic aberrations in cancer and xenograft Models

Description

No two cancers are alike. Cancer is a dynamic and heterogeneous disease, such heterogeneity arise among patients with the same cancer type, among cancer cells within the same individual’s tumor and even among cells within the same sub-clone over time. The recent application of next-generation sequencing and precision medicine techniques…

No two cancers are alike. Cancer is a dynamic and heterogeneous disease, such heterogeneity arise among patients with the same cancer type, among cancer cells within the same individual’s tumor and even among cells within the same sub-clone over time. The recent application of next-generation sequencing and precision medicine techniques is the driving force to uncover the complexity of cancer and the best clinical practice. The core concept of precision medicine is to move away from crowd-based, best-for-most treatment and take individual variability into account when optimizing the prevention and treatment strategies. Next-generation sequencing is the method to sift through the entire 3 billion letters of each patient’s DNA genetic code in a massively parallel fashion.

The deluge of next-generation sequencing data nowadays has shifted the bottleneck of cancer research from multiple “-omics” data collection to integrative analysis and data interpretation. In this dissertation, I attempt to address two distinct, but dependent, challenges. The first is to design specific computational algorithms and tools that can process and extract useful information from the raw data in an efficient, robust, and reproducible manner. The second challenge is to develop high-level computational methods and data frameworks for integrating and interpreting these data. Specifically, Chapter 2 presents a tool called Snipea (SNv Integration, Prioritization, Ensemble, and Annotation) to further identify, prioritize and annotate somatic SNVs (Single Nucleotide Variant) called from multiple variant callers. Chapter 3 describes a novel alignment-based algorithm to accurately and losslessly classify sequencing reads from xenograft models. Chapter 4 describes a direct and biologically motivated framework and associated methods for identification of putative aberrations causing survival difference in GBM patients by integrating whole-genome sequencing, exome sequencing, RNA-Sequencing, methylation array and clinical data. Lastly, chapter 5 explores longitudinal and intratumor heterogeneity studies to reveal the temporal and spatial context of tumor evolution. The long-term goal is to help patients with cancer, particularly those who are in front of us today. Genome-based analysis of the patient tumor can identify genomic alterations unique to each patient’s tumor that are candidate therapeutic targets to decrease therapy resistance and improve clinical outcome.

ContributorsPeng, Sen (Author) / Dinu, Valentin (Thesis advisor) / Scotch, Matthew (Committee member) / Wallstrom, Garrick (Committee member) / Arizona State University (Publisher)

Created2015

Informatics approaches for integrative analysis of disparate high-throughput genomic datasets in cancer

Description

The processes of a human somatic cell are very complex with various genetic mechanisms governing its fate. Such cells undergo various genetic mutations, which translate to the genetic aberrations that we see in cancer. There are more than 100 types of cancer, each having many more subtypes with aberrations being…

The processes of a human somatic cell are very complex with various genetic mechanisms governing its fate. Such cells undergo various genetic mutations, which translate to the genetic aberrations that we see in cancer. There are more than 100 types of cancer, each having many more subtypes with aberrations being unique to each. In the past two decades, the widespread application of high-throughput genomic technologies, such as micro-arrays and next-generation sequencing, has led to the revelation of many such aberrations. Known types and subtypes can be readily identified using gene-expression profiling and more importantly, high-throughput genomic datasets have helped identify novel sub-types with distinct signatures. Recent studies showing usage of gene-expression profiling in clinical decision making in breast cancer patients underscore the utility of high-throughput datasets. Beyond prognosis, understanding the underlying cellular processes is essential for effective cancer treatment. Various high-throughput techniques are now available to look at a particular aspect of a genetic mechanism in cancer tissue. To look at these mechanisms individually is akin to looking at a broken watch; taking apart each of its parts, looking at them individually and finally making a list of all the faulty ones. Integrative approaches are needed to transform one-dimensional cancer signatures into multi-dimensional interaction and regulatory networks, consequently bettering our understanding of cellular processes in cancer. Here, I attempt to (i) address ways to effectively identify high quality variants when multiple assays on the same sample samples are available through two novel tools, snpSniffer and NGSPE; (ii) glean new biological insight into multiple myeloma through two novel integrative analysis approaches making use of disparate high-throughput datasets. While these methods focus on multiple myeloma datasets, the informatics approaches are applicable to all cancer datasets and will thus help advance cancer genomics.

ContributorsYellapantula, Venkata (Author) / Dinu, Valentin (Thesis advisor) / Scotch, Matthew (Committee member) / Wallstrom, Garrick (Committee member) / Keats, Jonathan (Committee member) / Arizona State University (Publisher)

Created2014

Structural variant detection: a novel approach

Description

Genomic structural variation (SV) is defined as gross alterations in the genome broadly classified as insertions/duplications, deletions inversions and translocations. DNA sequencing ushered structural variant discovery beyond laboratory detection techniques to high resolution informatics approaches. Bioinformatics tools for computational discovery of SVs however are still missing variants in the complex…

Genomic structural variation (SV) is defined as gross alterations in the genome broadly classified as insertions/duplications, deletions inversions and translocations. DNA sequencing ushered structural variant discovery beyond laboratory detection techniques to high resolution informatics approaches. Bioinformatics tools for computational discovery of SVs however are still missing variants in the complex cancer genome. This study aimed to define genomic context leading to tool failure and design novel algorithm addressing this context. Methods: The study tested the widely held but unproven hypothesis that tools fail to detect variants which lie in repeat regions. Publicly available 1000-Genomes dataset with experimentally validated variants was tested with SVDetect-tool for presence of true positives (TP) SVs versus false negative (FN) SVs, expecting that FNs would be overrepresented in repeat regions. Further, the novel algorithm designed to informatically capture the biological etiology of translocations (non-allelic homologous recombination and 3&ndashD; placement of chromosomes in cells –context) was tested using simulated dataset. Translocations were created in known translocation hotspots and the novel&ndashalgorithm; tool compared with SVDetect and BreakDancer. Results: 53% of false negative (FN) deletions were within repeat structure compared to 81% true positive (TP) deletions. Similarly, 33% FN insertions versus 42% TP, 26% FN duplication versus 57% TP and 54% FN novel sequences versus 62% TP were within repeats. Repeat structure was not driving the tool's inability to detect variants and could not be used as context. The novel algorithm with a redefined context, when tested against SVDetect and BreakDancer was able to detect 10/10 simulated translocations with 30X coverage dataset and 100% allele frequency, while SVDetect captured 4/10 and BreakDancer detected 6/10. For 15X coverage dataset with 100% allele frequency, novel algorithm was able to detect all ten translocations albeit with fewer reads supporting the same. BreakDancer detected 4/10 and SVDetect detected 2/10 Conclusion: This study showed that presence of repetitive elements in general within a structural variant did not influence the tool's ability to capture it. This context-based algorithm proved better than current tools even with half the genome coverage than accepted protocol and provides an important first step for novel translocation discovery in cancer genome.

ContributorsShetty, Sheetal (Author) / Dinu, Valentin (Thesis advisor) / Bussey, Kimberly (Committee member) / Scotch, Matthew (Committee member) / Wallstrom, Garrick (Committee member) / Arizona State University (Publisher)

Created2014

Evaluating Biomarkers for Heterogeneous Diseases: from Receiver Operating Characteristics Curves to Jittered Dot Plot and Averaged Above Mean Difference Analysis

Description

Early detection of disease is essential for alleviating disease burden, increasing success rate and decreasing mortality rate especially for cancer. To improve disease diagnostics, many candidate biomarkers have been suggested using molecular biology or image analysis techniques over the past decade. The receiver operating characteristics (ROC) curve is a standard…

Early detection of disease is essential for alleviating disease burden, increasing success rate and decreasing mortality rate especially for cancer. To improve disease diagnostics, many candidate biomarkers have been suggested using molecular biology or image analysis techniques over the past decade. The receiver operating characteristics (ROC) curve is a standard technique to evaluate a diagnostic accuracy of biomarkers, but it has some limitations especially for heterogeneous diseases. As an alternative of the ROC curve analysis, we suggest a jittered dot plot (JDP) and JDP-based evaluation measures, above mean difference (AMD) and averaged above mean difference (AAMD). We demonstrate how JDP and AMD or AAMD together better evaluate biomarkers than the standard ROC curve. We analyze real and heterogeneous basal-like breast cancer data.

ContributorsBrister, Danielle (Author) / Chung, Yunro (Thesis director) / Park, Jin (Committee member) / Barrett, The Honors College (Contributor) / School of Life Sciences (Contributor) / School of Molecular Sciences (Contributor) / School of International Letters and Cultures (Contributor) / School of Human Evolution & Social Change (Contributor)

Created2021-12

Exploration of Panviral Proteome: High-Throughput Cloning and Functional Implications in Virus-Host Interactions

Description

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic…

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.

ContributorsYu, Xiaobo (Author) / Bian, Xiaofang (Author) / Throop, Andrea (Author) / Song, Lusheng (Author) / del Moral, Lerys (Author) / Park, Jin (Author) / Seiler, Catherine (Author) / Fiacco, Michael (Author) / Steel, Jason (Author) / Hunter, Preston (Author) / Saul, Justin (Author) / Wang, Jie (Author) / Qiu, Ji (Author) / Pipas, James M. (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)

Created2013-11-30

Autoantibody Signature for the Serologic Detection of Ovarian Cancer

Description

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound…

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound IgG. Of these, 741 antigens were selected and probed with an independent set of ovarian cancer sera (n = 60 cases/60 controls). Twelve potential autoantigens were identified with sensitivities ranging from 13 to 22% at >93% specificity. These were retested using a Luminex bead array using 60 cases and 60 controls, with sensitivities ranging from 0 to 31.7% at 95% specificity. Three AAb (p53, PTPRA, and PTGFR) had area under the curve (AUC) levels >60% (p < 0.01), with the partial AUC (SPAUC) over 5 times greater than for a nondiscriminating test (p < 0.01). Using a panel of the top three AAb (p53, PTPRA, and PTGFR), if at least two AAb were positive, then the sensitivity was 23.3% at 98.3% specificity. AAb to at least one of these top three antigens were also detected in 7/20 sera (35%) of patients with low CA 125 levels and 0/15 controls. AAb to p53, PTPRA, and PTGFR are potential biomarkers for the early detection of ovarian cancer.

ContributorsAnderson, Karen (Author) / Cramer, Daniel W. (Author) / Sibani, Sahar (Author) / Wallstrom, Garrick (Author) / Wong, Jessica (Author) / Park, Jin (Author) / Qiu, Ji (Author) / Vitonis, Allison (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)

Created2015-01-01

Comparative RNA-Seq Analysis Reveals Pervasive Tissue-Specific Alternative Polyadenylation in Caenorhabditis Elegans Intestine and Muscles

Description

Background: Tissue-specific RNA plasticity broadly impacts the development, tissue identity and adaptability of all organisms, but changes in composition, expression levels and its impact on gene regulation in different somatic tissues are largely unknown. Here we developed a new method, polyA-tagging and sequencing (PAT-Seq) to isolate high-quality tissue-specific mRNA from Caenorhabditis…

Background: Tissue-specific RNA plasticity broadly impacts the development, tissue identity and adaptability of all organisms, but changes in composition, expression levels and its impact on gene regulation in different somatic tissues are largely unknown. Here we developed a new method, polyA-tagging and sequencing (PAT-Seq) to isolate high-quality tissue-specific mRNA from Caenorhabditis elegans intestine, pharynx and body muscle tissues and study changes in their tissue-specific transcriptomes and 3’UTRomes.

Results: We have identified thousands of novel genes and isoforms differentially expressed between these three tissues. The intestine transcriptome is expansive, expressing over 30% of C. elegans mRNAs, while muscle transcriptomes are smaller but contain characteristic unique gene signatures. Active promoter regions in all three tissues reveal both known and novel enriched tissue-specific elements, along with putative transcription factors, suggesting novel tissue-specific modes of transcription initiation. We have precisely mapped approximately 20,000 tissue-specific polyadenylation sites and discovered that about 30% of transcripts in somatic cells use alternative polyadenylation in a tissue-specific manner, with their 3’UTR isoforms significantly enriched with microRNA targets.

Conclusions: For the first time, PAT-Seq allowed us to directly study tissue specific gene expression changes in an in vivo setting and compare these changes between three somatic tissues from the same organism at single-base resolution within the same experiment. We pinpoint precise tissue-specific transcriptome rearrangements and for the first time link tissue-specific alternative polyadenylation to miRNA regulation, suggesting novel and unexplored tissue-specific post-transcriptional regulatory networks in somatic cells.

ContributorsBlazie, Stephen (Author) / Babb, Cody (Author) / Wilky, Henry (Author) / Rawls, Alan (Author) / Park, Jin (Author) / Mangone, Marco (Author) / College of Liberal Arts and Sciences (Contributor)

Created2015-01-20