Matching Items (6)
Filtering by

Clear all filters

133258-Thumbnail Image.png
Description
Cancer is the second leading cause of death in the United States. Cancer is a serious, complex disease which causes cells to grow uncontrollably, causing millions of deaths per year [1]. Cancer is usually caused by a combination of environmental variables and biological pathways. The pathways have a very robust

Cancer is the second leading cause of death in the United States. Cancer is a serious, complex disease which causes cells to grow uncontrollably, causing millions of deaths per year [1]. Cancer is usually caused by a combination of environmental variables and biological pathways. The pathways have a very robust structure normally, but are altered because of cancer, resulting in a loss of connectivity between pathways. In order detect these pathways, a PageRank-based method called Pathways of Topological Rank Analysis (PoTRA) was created, which measures the relative rankings of the genes in each pathway. Applying this algorithm will allow us to figure out what pathways differed significantly in areas with cancer and areas without cancer. This would allow scientists to focus on specific pathways in order to learn more about the cancer and find more effective ways to treat it. So far, analysis using PoTRA has been successfully conducted on hepatocellular carcinoma (HCC) and its subtypes, resulting in all significant pathways found being cancer-associated. Now, using the TCGA data stored in Google Cloud's BigQuery, we created a pipeline to apply PoTRA to other cancer data sets and see how well it cross-applies to other cancers. The results show that even though some modification may need to be made to adapt to other datasets, many significant pathways were found for both HCC and breast cancer.
ContributorsMahesh, Sunny Nishant (Author) / Valentin, Dinu (Thesis director) / Liu, Li (Committee member) / Computer Science and Engineering Program (Contributor) / Barrett, The Honors College (Contributor)
Created2018-05
134943-Thumbnail Image.png
Description
Prostate cancer is the second most common kind of cancer in men. Fortunately, it has a 99% survival rate. To achieve such a survival rate, a variety of aggressive therapies are used to treat prostate cancers that are caught early. Androgen deprivation therapy (ADT) is a therapy that is given

Prostate cancer is the second most common kind of cancer in men. Fortunately, it has a 99% survival rate. To achieve such a survival rate, a variety of aggressive therapies are used to treat prostate cancers that are caught early. Androgen deprivation therapy (ADT) is a therapy that is given in cycles to patients. This study attempted to analyze what factors in a group of 79 patients caused them to stick with or discontinue the treatment. This was done using naïve Bayes classification, a machine-learning algorithm. The usage of this algorithm identified high testosterone as an indicator of a patient persevering with the treatment, but failed to produce statistically significant high rates of prediction.
ContributorsMillea, Timothy Michael (Author) / Kostelich, Eric (Thesis director) / Kuang, Yang (Committee member) / Computer Science and Engineering Program (Contributor) / Barrett, The Honors College (Contributor)
Created2016-12
135041-Thumbnail Image.png
Description
The advent of big data analytics tools and frameworks has allowed for a plethora of new approaches to research and analysis, making data sets that were previously too large or complex more accessible and providing methods to collect, store, and investigate non-traditional data. These tools are starting to be applied

The advent of big data analytics tools and frameworks has allowed for a plethora of new approaches to research and analysis, making data sets that were previously too large or complex more accessible and providing methods to collect, store, and investigate non-traditional data. These tools are starting to be applied in more creative ways, and are being used to improve upon traditional computation methods through distributed computing. Statistical analysis of expression quantitative trait loci (eQTL) data has classically been performed using the open source tool PLINK - which runs on high performance computing (HPC) systems. However, progress has been made in running the statistical analysis in the ecosystem of the big data framework Hadoop, resulting in decreased run time, reduced storage footprint, reduced job micromanagement and increased data accessibility. Now that the data can be more readily manipulated, analyzed and accessed, there are opportunities to use the modularity and power of Hadoop to further process the data. This project focuses on adding a component to the data pipeline that will perform graph analysis on the data. This will provide more insight into the relation between various genetic differences in individuals with breast cancer, and the resulting variation - if any - in gene expression. Further, the investigation will look to see if there is anything to be garnered from a perspective shift; applying tools used in classical networking contexts (such as the Internet) to genetically derived networks.
ContributorsRandall, Jacob Christopher (Author) / Buetow, Kenneth (Thesis director) / Meuth, Ryan (Committee member) / Almalih, Sara (Committee member) / Computer Science and Engineering Program (Contributor) / Barrett, The Honors College (Contributor)
Created2016-12
130975-Thumbnail Image.png
Description
Immunotherapy is an effective treatment for cancer which enables the patient's immune system to recognize tumor cells as pathogens. In order to design an individualized treatment, the t cell receptors (TCR) which bind to a tumor's unique antigens need to be determined. We created a convolutional neural network to predict

Immunotherapy is an effective treatment for cancer which enables the patient's immune system to recognize tumor cells as pathogens. In order to design an individualized treatment, the t cell receptors (TCR) which bind to a tumor's unique antigens need to be determined. We created a convolutional neural network to predict the binding affinity between a given TCR and antigen to enable this.
ContributorsCai, Michael Ray (Author) / Lee, Heewook (Thesis director) / Meuth, Ryan (Committee member) / Computer Science and Engineering Program (Contributor, Contributor) / Barrett, The Honors College (Contributor)
Created2020-12
165124-Thumbnail Image.png
Description

Molecular pathology makes use of estimates of tumor content (tumor percentage) for pre-analytic and analytic purposes, such as molecular oncology testing, massive parallel sequencing, or next-generation sequencing (NGS), assessment of sample acceptability, accurate quantitation of variants, assessment of copy number changes (among other applications), determination of specimen viability for testing

Molecular pathology makes use of estimates of tumor content (tumor percentage) for pre-analytic and analytic purposes, such as molecular oncology testing, massive parallel sequencing, or next-generation sequencing (NGS), assessment of sample acceptability, accurate quantitation of variants, assessment of copy number changes (among other applications), determination of specimen viability for testing (since many assays require a minimum tumor content to report variants at the limit of detection) may all be improved with more accurate and reproducible estimates of tumor content. Currently, tumor percentages of samples submitted for molecular testing are estimated by visual examination of Hematoxylin and Eosin (H&E) stained tissue slides under the microscope by pathologists. These estimations can be automated, expedited, and rendered more accurate by applying machine learning methods on digital whole slide images (WSI).

ContributorsCirelli, Claire (Author) / Yang, Yezhou (Thesis director) / Yalim, Jason (Committee member) / Velu, Priya (Committee member) / Barrett, The Honors College (Contributor) / Computer Science and Engineering Program (Contributor)
Created2022-05
165179-Thumbnail Image.jpg
Description

panCanSYGNAL is a web-application designed to allow cancer researchers to search the relationships between somatic mutations, regulators, and biclusters corresponding to many cancers using a Google-like searchable database.

ContributorsWatson, Jacob (Author) / Plaisier, Christopher (Thesis director) / Clough, Michael (Committee member) / Barrett, The Honors College (Contributor) / Computer Science and Engineering Program (Contributor)
Created2022-05