Search Content

Expansion and Application of Pathways of Topological Rank Analysis (PoTRA) to Various Cancers

Description

Cancer is the second leading cause of death in the United States. Cancer is a serious, complex disease which causes cells to grow uncontrollably, causing millions of deaths per year [1]. Cancer is usually caused by a combination of environmental variables and biological pathways. The pathways have a very robust…

Cancer is the second leading cause of death in the United States. Cancer is a serious, complex disease which causes cells to grow uncontrollably, causing millions of deaths per year [1]. Cancer is usually caused by a combination of environmental variables and biological pathways. The pathways have a very robust structure normally, but are altered because of cancer, resulting in a loss of connectivity between pathways. In order detect these pathways, a PageRank-based method called Pathways of Topological Rank Analysis (PoTRA) was created, which measures the relative rankings of the genes in each pathway. Applying this algorithm will allow us to figure out what pathways differed significantly in areas with cancer and areas without cancer. This would allow scientists to focus on specific pathways in order to learn more about the cancer and find more effective ways to treat it. So far, analysis using PoTRA has been successfully conducted on hepatocellular carcinoma (HCC) and its subtypes, resulting in all significant pathways found being cancer-associated. Now, using the TCGA data stored in Google Cloud's BigQuery, we created a pipeline to apply PoTRA to other cancer data sets and see how well it cross-applies to other cancers. The results show that even though some modification may need to be made to adapt to other datasets, many significant pathways were found for both HCC and breast cancer.

ContributorsMahesh, Sunny Nishant (Author) / Valentin, Dinu (Thesis director) / Liu, Li (Committee member) / Computer Science and Engineering Program (Contributor) / Barrett, The Honors College (Contributor)

Created2018-05

Naïve Bayes Classification for Analyzing Prostate Cancer Treatment Outcomes

Description

Prostate cancer is the second most common kind of cancer in men. Fortunately, it has a 99% survival rate. To achieve such a survival rate, a variety of aggressive therapies are used to treat prostate cancers that are caught early. Androgen deprivation therapy (ADT) is a therapy that is given…

Prostate cancer is the second most common kind of cancer in men. Fortunately, it has a 99% survival rate. To achieve such a survival rate, a variety of aggressive therapies are used to treat prostate cancers that are caught early. Androgen deprivation therapy (ADT) is a therapy that is given in cycles to patients. This study attempted to analyze what factors in a group of 79 patients caused them to stick with or discontinue the treatment. This was done using naïve Bayes classification, a machine-learning algorithm. The usage of this algorithm identified high testosterone as an indicator of a patient persevering with the treatment, but failed to produce statistically significant high rates of prediction.

ContributorsMillea, Timothy Michael (Author) / Kostelich, Eric (Thesis director) / Kuang, Yang (Committee member) / Computer Science and Engineering Program (Contributor) / Barrett, The Honors College (Contributor)

Created2016-12

Big Data Network Analysis of Genetic Variation and Gene Expression in Individuals with Breast Cancer

Description

The advent of big data analytics tools and frameworks has allowed for a plethora of new approaches to research and analysis, making data sets that were previously too large or complex more accessible and providing methods to collect, store, and investigate non-traditional data. These tools are starting to be applied…

The advent of big data analytics tools and frameworks has allowed for a plethora of new approaches to research and analysis, making data sets that were previously too large or complex more accessible and providing methods to collect, store, and investigate non-traditional data. These tools are starting to be applied in more creative ways, and are being used to improve upon traditional computation methods through distributed computing. Statistical analysis of expression quantitative trait loci (eQTL) data has classically been performed using the open source tool PLINK - which runs on high performance computing (HPC) systems. However, progress has been made in running the statistical analysis in the ecosystem of the big data framework Hadoop, resulting in decreased run time, reduced storage footprint, reduced job micromanagement and increased data accessibility. Now that the data can be more readily manipulated, analyzed and accessed, there are opportunities to use the modularity and power of Hadoop to further process the data. This project focuses on adding a component to the data pipeline that will perform graph analysis on the data. This will provide more insight into the relation between various genetic differences in individuals with breast cancer, and the resulting variation - if any - in gene expression. Further, the investigation will look to see if there is anything to be garnered from a perspective shift; applying tools used in classical networking contexts (such as the Internet) to genetically derived networks.

ContributorsRandall, Jacob Christopher (Author) / Buetow, Kenneth (Thesis director) / Meuth, Ryan (Committee member) / Almalih, Sara (Committee member) / Computer Science and Engineering Program (Contributor) / Barrett, The Honors College (Contributor)

Created2016-12

Accuracy in Spotting Misinformation about COVID-19: A Pilot Intervention and the Role of Political Affiliation

Description

In the past year, considerable misinformation about the COVID-19 pandemic has circulated on social media platforms. Faced with this pervasive issue, it is important to identify the extent to which people are able to spot misinformation on social media and ways to improve people’s accuracy in spotting misinformation. Therefore, the…

In the past year, considerable misinformation about the COVID-19 pandemic has circulated on social media platforms. Faced with this pervasive issue, it is important to identify the extent to which people are able to spot misinformation on social media and ways to improve people’s accuracy in spotting misinformation. Therefore, the current study aims to investigate people’s accuracy in spotting misinformation, the effectiveness of a game-based intervention, and the role of political affiliation in spotting misinformation. In this study, 235 participants played a misinformation game in which they evaluated COVID-19-related tweets and indicated whether or not they thought each of the tweets contained misinformation. Misinformation accuracy was measured using game scores, which were based on the correct identification of misinformation. Findings revealed that participants’ beliefs about how accurate they are at spotting misinformation about COVID-19 did not predict their actual accuracy. Participants’ accuracy improved after playing the game, but democrats were more likely to improve than republicans.

ContributorsKang, Rachael (Author) / Kwan, Virginia (Thesis director) / Corbin, William (Committee member) / Cohen, Adam (Committee member) / Bunker, Cameron (Committee member) / Department of Psychology (Contributor) / Computer Science and Engineering Program (Contributor) / Barrett, The Honors College (Contributor)

Created2021-05

The Making of ASU Biodesign Clinical Testing Laboratory (ABCTL): Information Technology

Description

As much as SARS-CoV-2 has altered the way humans live since the beginning of 2020, this virus's deadly nature has required clinical testing to meet 2020's demands of higher throughput, higher accuracy and higher efficiency. Information technology has allowed institutions, like Arizona State University (ASU), to make strategic and operational changes to combat the…

As much as SARS-CoV-2 has altered the way humans live since the beginning of 2020, this virus's deadly nature has required clinical testing to meet 2020's demands of higher throughput, higher accuracy and higher efficiency. Information technology has allowed institutions, like Arizona State University (ASU), to make strategic and operational changes to combat the SARS-CoV-2 pandemic. At ASU, information technology was one of the six facets identified in the ongoing review of the ASU Biodesign Clinical Testing Laboratory (ABCTL) among business, communications, management/training, law, and clinical analysis. The first chapter of this manuscript covers the background of clinical laboratory automation and details the automated laboratory workflow to perform ABCTL’s COVID-19 diagnostic testing. The second chapter discusses the usability and efficiency of key information technology systems of the ABCTL. The third chapter explains the role of quality control and data management within ABCTL’s use of information technology. The fourth chapter highlights the importance of data modeling and 10 best practices when responding to future public health emergencies.

ContributorsKandan, Mani (Co-author) / Leung, Michael (Co-author) / Woo, Sabrina (Co-author) / Knox, Garrett (Co-author) / Compton, Carolyn (Thesis director) / Dudley, Sean (Committee member) / Computer Science and Engineering Program (Contributor) / Department of Information Systems (Contributor) / Barrett, The Honors College (Contributor)

Created2021-05

Prediction of Binding Affinity of T cell Receptor and Antigens using Deep Neural Networks

Description

Immunotherapy is an effective treatment for cancer which enables the patient's immune system to recognize tumor cells as pathogens. In order to design an individualized treatment, the t cell receptors (TCR) which bind to a tumor's unique antigens need to be determined. We created a convolutional neural network to predict…

Immunotherapy is an effective treatment for cancer which enables the patient's immune system to recognize tumor cells as pathogens. In order to design an individualized treatment, the t cell receptors (TCR) which bind to a tumor's unique antigens need to be determined. We created a convolutional neural network to predict the binding affinity between a given TCR and antigen to enable this.

ContributorsCai, Michael Ray (Author) / Lee, Heewook (Thesis director) / Meuth, Ryan (Committee member) / Computer Science and Engineering Program (Contributor, Contributor) / Barrett, The Honors College (Contributor)

Created2020-12

Exploring Prompt-Based Methods for COVID-19 Misinformation Classification

Description

Increasing misinformation in social media channels has become more prevalent since the beginning of the COVID-19 pandemic as countless myths and rumors have circulated over the internet. This misinformation has potentially lethal consequences as many people make important health decisions based on what they read online, thus creating an urgent…

Increasing misinformation in social media channels has become more prevalent since the beginning of the COVID-19 pandemic as countless myths and rumors have circulated over the internet. This misinformation has potentially lethal consequences as many people make important health decisions based on what they read online, thus creating an urgent need to combat it. Although many Natural Language Processing (NLP) techniques have been used to identify misinformation in text, prompt-based methods are under-studied for this task. This work explores prompt learning to classify COVID-19 related misinformation. To this extent, I analyze the effectiveness of this proposed approach on four datasets. Experimental results show that prompt-based classification achieves on average ~13% and ~6% improvement compared to a single-task and multi-task model, respectively. Moreover, analysis shows that prompt-based models can achieve competitive results compared to baselines in a few-shot learning scenario.

ContributorsBrown, Clinton (Author) / Baral, Chitta (Thesis director) / Walker, Shawn (Committee member) / Barrett, The Honors College (Contributor) / School of International Letters and Cultures (Contributor) / Computer Science and Engineering Program (Contributor)

Created2022-05

Machine Learning Approaches to Tumor Estimation of Whole Slide Images

Description

Molecular pathology makes use of estimates of tumor content (tumor percentage) for pre-analytic and analytic purposes, such as molecular oncology testing, massive parallel sequencing, or next-generation sequencing (NGS), assessment of sample acceptability, accurate quantitation of variants, assessment of copy number changes (among other applications), determination of specimen viability for testing…

Molecular pathology makes use of estimates of tumor content (tumor percentage) for pre-analytic and analytic purposes, such as molecular oncology testing, massive parallel sequencing, or next-generation sequencing (NGS), assessment of sample acceptability, accurate quantitation of variants, assessment of copy number changes (among other applications), determination of specimen viability for testing (since many assays require a minimum tumor content to report variants at the limit of detection) may all be improved with more accurate and reproducible estimates of tumor content. Currently, tumor percentages of samples submitted for molecular testing are estimated by visual examination of Hematoxylin and Eosin (H&E) stained tissue slides under the microscope by pathologists. These estimations can be automated, expedited, and rendered more accurate by applying machine learning methods on digital whole slide images (WSI).

ContributorsCirelli, Claire (Author) / Yang, Yezhou (Thesis director) / Yalim, Jason (Committee member) / Velu, Priya (Committee member) / Barrett, The Honors College (Contributor) / Computer Science and Engineering Program (Contributor)

Created2022-05

panCanSYGNAL

Description

panCanSYGNAL is a web-application designed to allow cancer researchers to search the relationships between somatic mutations, regulators, and biclusters corresponding to many cancers using a Google-like searchable database.

ContributorsWatson, Jacob (Author) / Plaisier, Christopher (Thesis director) / Clough, Michael (Committee member) / Barrett, The Honors College (Contributor) / Computer Science and Engineering Program (Contributor)

Created2022-05

Filtering by