Search Content

BioEve: user interface framework bridging IE and IR

Description

Continuous advancements in biomedical research have resulted in the production of vast amounts of scientific data and literature discussing them. The ultimate goal of computational biology is to translate these large amounts of data into actual knowledge of the complex biological processes and accurate life science models. The ability to…

Continuous advancements in biomedical research have resulted in the production of vast amounts of scientific data and literature discussing them. The ultimate goal of computational biology is to translate these large amounts of data into actual knowledge of the complex biological processes and accurate life science models. The ability to rapidly and effectively survey the literature is necessary for the creation of large scale models of the relationships among biomedical entities as well as hypothesis generation to guide biomedical research. To reduce the effort and time spent in performing these activities, an intelligent search system is required. Even though many systems aid in navigating through this wide collection of documents, the vastness and depth of this information overload can be overwhelming. An automated extraction system coupled with a cognitive search and navigation service over these document collections would not only save time and effort, but also facilitate discovery of the unknown information implicitly conveyed in the texts. This thesis presents the different approaches used for large scale biomedical named entity recognition, and the challenges faced in each. It also proposes BioEve: an integrative framework to fuse a faceted search with information extraction to provide a search service that addresses the user's desire for "completeness" of the query results, not just the top-ranked ones. This information extraction system enables discovery of important semantic relationships between entities such as genes, diseases, drugs, and cell lines and events from biomedical text on MEDLINE, which is the largest publicly available database of the world's biomedical journal literature. It is an innovative search and discovery service that makes it easier to search
avigate and discover knowledge hidden in life sciences literature. To demonstrate the utility of this system, this thesis also details a prototype enterprise quality search and discovery service that helps researchers with a guided step-by-step query refinement, by suggesting concepts enriched in intermediate results, and thereby facilitating the "discover more as you search" paradigm.

ContributorsKanwar, Pradeep (Author) / Davulcu, Hasan (Thesis advisor) / Dinu, Valentin (Committee member) / Li, Baoxin (Committee member) / Arizona State University (Publisher)

Created2010

Genetic Variants in GC, CYP2R1, and VDR Genes and Associations of Serum 25-Hydroxyvitamin D Concentrations in a Population of Hispanic and Non-Hispanic Adults Residing in San Diego County, California

Description

Vitamin D is a nutrient that is obtained through the diet and vitamin D supplementation and created from exposure to Ultraviolet B (UVB) radiation. While there are many factors that determine how much serum 25-hydroxyvitamin D (25(OH)D) concentration is in the body, little is known about how genetic variation in…

Vitamin D is a nutrient that is obtained through the diet and vitamin D supplementation and created from exposure to Ultraviolet B (UVB) radiation. While there are many factors that determine how much serum 25-hydroxyvitamin D (25(OH)D) concentration is in the body, little is known about how genetic variation in vitamin D-related genes influences serum 25(OH)D concentrations resulting from daily vitamin D intake and exposure to direct sunlight. Previous studies show that common genetic variants rs10741657 (CYP2R1), rs4588 (GC), rs228678 (GC), and rs4516035 (VDR) act as moderators and alter the effect of outdoor time and vitamin D intake on serum 25(OH)D concentrations. The objective of this study is to analyze the associations between serum 25(OH)D concentrations resulting from outdoor time and vitamin D intake, and genetic risk scores (GRS) established from previous studies involving single nucleotide polymorphisms (SNP) located on or near genes involving vitamin D synthesis, transport, activation, and degradation in 102 Hispanic and Non-Hispanic adults in the San Diego County, California. This study is a secondary analysis of data from the Community of Mine study. Global Positioning System (GPS) data collected by the Qstarz GPS device worn by each participant was used to measure outdoor time, a proxy measurement for sun exposure time. Vitamin D intake was assessed using two 24-hour dietary recalls. Blood samples were measured for serum 25(OH)D concentrations. DNA was provided to assess each participant for the various genetic variants. Adjusted analyses of the GRS and serum 25(OH)D concentrations showed that individuals with high GRS (3-4) had lower serum 25(OH)D concentrations than individuals with low GRS (0-2) for both Nissen GRS and Rivera-Paredez GRS.

ContributorsAnderson, Heather Ray (Author) / Sears, Dorothy (Thesis advisor) / Alexon, Christy (Committee member) / Dinu, Valentin (Committee member) / Jankowska, Marta (Committee member) / Arizona State University (Publisher)

Created2022

Engineering mutation-tolerant genes

Description

Ideas from coding theory are employed to theoretically demonstrate the engineering of mutation-tolerant genes, genes that can sustain up to some arbitrarily chosen number of mutations and still express the originally intended protein. Attention is restricted to tolerating substitution mutations. Future advances in genomic engineering will make possible the ability…

Ideas from coding theory are employed to theoretically demonstrate the engineering of mutation-tolerant genes, genes that can sustain up to some arbitrarily chosen number of mutations and still express the originally intended protein. Attention is restricted to tolerating substitution mutations. Future advances in genomic engineering will make possible the ability to synthesize entire genomes from scratch. This presents an opportunity to embed desirable capabilities like mutation-tolerance, which will be useful in preventing cell deaths in organisms intended for research or industrial applications in highly mutagenic environments. In the extreme case, mutation-tolerant genes (mutols) can make organisms resistant to retroviral infections.

An algebraic representation of the nucleotide bases is developed. This algebraic representation makes it possible to convert nucleotide sequences into algebraic sequences, apply mathematical ideas and convert results back into nucleotide terms. Using the algebra developed, a mapping is found from the naturally-occurring codons to an alternative set of codons which makes genes constructed from them mutation-tolerant, provided no more than one substitution mutation occurs per codon. The ideas discussed naturally extend to finding codons that can tolerate t arbitrarily chosen number of mutations per codon. Finally, random substitution events are simulated in both a wild-type green fluorescent protein (GFP) gene and its mutol variant and the amino acid sequence expressed from each post-mutation is compared with the amino acid sequence pre-mutation.

This work assumes the existence of synthetic protein-assembling entities that function like tRNAs but can read k nucleotides at a time, with k greater than or equal to 5. The realization of this assumption is presented as a challenge to the research community.

ContributorsAmpofo, Prince Kwame (Author) / Tian, Xiaojun (Thesis advisor) / Kiani, Samira (Committee member) / Kuang, Yang (Committee member) / Arizona State University (Publisher)

Created2019

Mathematical Simulation of Glioblastoma Multiform Under Treatment

Description

The analysis focuses on a two-population, three-dimensional model that attempts to accurately model the growth and diffusion of glioblastoma multiforme (GBM), a highly invasive brain cancer, throughout the brain. Analysis into the sensitivity of the model to

changes in the diffusion, growth, and death parameters was performed, in order to find…

The analysis focuses on a two-population, three-dimensional model that attempts to accurately model the growth and diffusion of glioblastoma multiforme (GBM), a highly invasive brain cancer, throughout the brain. Analysis into the sensitivity of the model to

changes in the diffusion, growth, and death parameters was performed, in order to find a set of parameter values that accurately model observed tumor growth for a given patient. Additional changes were made to the diffusion parameters to account for the arrangement of nerve tracts in the brain, resulting in varying rates of diffusion. In general, small changes in the growth rates had a large impact on the outcome of the simulations, and for each patient there exists a set of parameters that allow the model to simulate a tumor that matches observed tumor growth in the patient over a period of two or three months. Furthermore, these results are more accurate with anisotropic diffusion, rather than isotropic diffusion. However, these parameters lead to inaccurate results for patients with tumors that undergo no observable growth over the given time interval. While it is possible to simulate long-term tumor growth, the simulation requires multiple comparisons to available MRI scans in order to find a set of parameters that provide an accurate prognosis.

ContributorsTrent, Austin Lee (Author) / Kostelich, Eric (Thesis advisor) / Gumel, Abba (Committee member) / Kuang, Yang (Committee member) / Arizona State University (Publisher)

Created2020

Drug Modeling Dynamics in the Treatment of Prostate Cancer

Description

Efforts to treat prostate cancer have seen an uptick, as the world’s most commoncancer in men continues to have increasing global incidence. Clinically, metastatic
prostate cancer is most commonly treated with hormonal therapy. The idea behind
hormonal therapy is to reduce androgen production, which prostate cancer cells
require for growth. Recently, the exploration…

Efforts to treat prostate cancer have seen an uptick, as the world’s most commoncancer in men continues to have increasing global incidence. Clinically, metastatic
prostate cancer is most commonly treated with hormonal therapy. The idea behind
hormonal therapy is to reduce androgen production, which prostate cancer cells
require for growth. Recently, the exploration of the synergistic effects of the drugs
used in hormonal therapy has begun. The aim was to build off of these recent
advancements and further refine the synergistic drug model. The advancements I
implement come by addressing biological shortcomings and improving the model’s
internal mechanistic structure. The drug families being modeled, anti-androgens,
and gonadotropin-releasing hormone analogs, interact with androgen production in a
way that is not completely understood in the scientific community. Thus the models
representing the drugs show progress through their ability to capture their effect
on serum androgen. Prostate-specific antigen is the primary biomarker for prostate
cancer and is generally how population models on the subject are validated. Fitting
the model to clinical data and comparing it to other clinical models through the
ability to fit and forecast prostate-specific antigen and serum androgen is how this
improved model achieves validation. The improved model results further suggest that
the drugs’ dynamics should be considered in adaptive therapy for prostate cancer.

ContributorsReckell, Trevor (Author) / Kostelich, Eric (Thesis advisor) / Kuang, Yang (Committee member) / Mahalov, Alex (Committee member) / Arizona State University (Publisher)

Created2020

Filtering by

BioEve: user interface framework bridging IE and IR

Genetic Variants in GC, CYP2R1, and VDR Genes and Associations of Serum 25-Hydroxyvitamin D Concentrations in a Population of Hispanic and Non-Hispanic Adults Residing in San Diego County, California

Engineering mutation-tolerant genes

Mathematical Simulation of Glioblastoma Multiform Under Treatment

Drug Modeling Dynamics in the Treatment of Prostate Cancer