Matching Items (96)
149928-Thumbnail Image.png
Description
The technology expansion seen in the last decade for genomics research has permitted the generation of large-scale data sources pertaining to molecular biological assays, genomics, proteomics, transcriptomics and other modern omics catalogs. New methods to analyze, integrate and visualize these data types are essential to unveil relevant disease mechanisms. Towards

The technology expansion seen in the last decade for genomics research has permitted the generation of large-scale data sources pertaining to molecular biological assays, genomics, proteomics, transcriptomics and other modern omics catalogs. New methods to analyze, integrate and visualize these data types are essential to unveil relevant disease mechanisms. Towards these objectives, this research focuses on data integration within two scenarios: (1) transcriptomic, proteomic and functional information and (2) real-time sensor-based measurements motivated by single-cell technology. To assess relationships between protein abundance, transcriptomic and functional data, a nonlinear model was explored at static and temporal levels. The successful integration of these heterogeneous data sources through the stochastic gradient boosted tree approach and its improved predictability are some highlights of this work. Through the development of an innovative validation subroutine based on a permutation approach and the use of external information (i.e., operons), lack of a priori knowledge for undetected proteins was overcome. The integrative methodologies allowed for the identification of undetected proteins for Desulfovibrio vulgaris and Shewanella oneidensis for further biological exploration in laboratories towards finding functional relationships. In an effort to better understand diseases such as cancer at different developmental stages, the Microscale Life Science Center headquartered at the Arizona State University is pursuing single-cell studies by developing novel technologies. This research arranged and applied a statistical framework that tackled the following challenges: random noise, heterogeneous dynamic systems with multiple states, and understanding cell behavior within and across different Barrett's esophageal epithelial cell lines using oxygen consumption curves. These curves were characterized with good empirical fit using nonlinear models with simple structures which allowed extraction of a large number of features. Application of a supervised classification model to these features and the integration of experimental factors allowed for identification of subtle patterns among different cell types visualized through multidimensional scaling. Motivated by the challenges of analyzing real-time measurements, we further explored a unique two-dimensional representation of multiple time series using a wavelet approach which showcased promising results towards less complex approximations. Also, the benefits of external information were explored to improve the image representation.
ContributorsTorres Garcia, Wandaliz (Author) / Meldrum, Deirdre R. (Thesis advisor) / Runger, George C. (Thesis advisor) / Gel, Esma S. (Committee member) / Li, Jing (Committee member) / Zhang, Weiwen (Committee member) / Arizona State University (Publisher)
Created2011
151810-Thumbnail Image.png
Description
Hepatocellular carcinoma (HCC) is a malignant tumor and seventh most common cancer in human. Every year there is a significant rise in the number of patients suffering from HCC. Most clinical research has focused on HCC early detection so that there are high chances of patient's survival. Emerging advancements in

Hepatocellular carcinoma (HCC) is a malignant tumor and seventh most common cancer in human. Every year there is a significant rise in the number of patients suffering from HCC. Most clinical research has focused on HCC early detection so that there are high chances of patient's survival. Emerging advancements in functional and structural imaging techniques have provided the ability to detect microscopic changes in tumor micro environment and micro structure. The prime focus of this thesis is to validate the applicability of advanced imaging modality, Magnetic Resonance Elastography (MRE), for HCC diagnosis. The research was carried out on three HCC patient's data and three sets of experiments were conducted. The main focus was on quantitative aspect of MRE in conjunction with Texture Analysis, an advanced imaging processing pipeline and multi-variate analysis machine learning method for accurate HCC diagnosis. We analyzed the techniques to handle unbalanced data and evaluate the efficacy of sampling techniques. Along with this we studied different machine learning algorithms and developed models using them. Performance metrics such as Prediction Accuracy, Sensitivity and Specificity have been used for evaluation for the final developed model. We were able to identify the significant features in the dataset and also the selected classifier was robust in predicting the response class variable with high accuracy.
ContributorsBansal, Gaurav (Author) / Wu, Teresa (Thesis advisor) / Mitchell, Ross (Thesis advisor) / Li, Jing (Committee member) / Arizona State University (Publisher)
Created2013
151750-Thumbnail Image.png
Description
The Cape Floral Region (CFR) in southwestern South Africa is one of the most diverse in the world, with >9,000 plant species, 70% of which are endemic, in an area of only ~90,000 km2. Many have suggested that the CFR's heterogeneous environment, with respect to landscape gradients, vegetation, rainfall, elevation,

The Cape Floral Region (CFR) in southwestern South Africa is one of the most diverse in the world, with >9,000 plant species, 70% of which are endemic, in an area of only ~90,000 km2. Many have suggested that the CFR's heterogeneous environment, with respect to landscape gradients, vegetation, rainfall, elevation, and soil fertility, is responsible for the origin and maintenance of this biodiversity. While studies have struggled to link species diversity with these features, no study has attempted to associate patterns of gene flow with environmental data to determine how CFR biodiversity evolves on different scales. Here, a molecular population genetic data is presented for a widespread CFR plant, Leucadendron salignum, across 51 locations with 5-kb of chloroplast (cpDNA) and 6-kb of unlinked nuclear (nuDNA) DNA sequences in a dataset of 305 individuals. In the cpDNA dataset, significant genetic structure was found to vary on temporal and spatial scales, separating Western and Eastern Capes - the latter of which appears to be recently derived from the former - with the highest diversity in the heart of the CFR in a central region. A second study applied a statistical model using vegetation and soil composition and found fine-scale genetic divergence is better explained by this landscape resistance model than a geographic distance model. Finally, a third analysis contrasted cpDNA and nuDNA datasets, and revealed very little geographic structure in the latter, suggesting that seed and pollen dispersal can have different evolutionary genetic histories of gene flow on even small CFR scales. These three studies together caution that different genomic markers need to be considered when modeling the geographic and temporal origin of CFR groups. From a greater perspective, the results here are consistent with the hypothesis that landscape heterogeneity is one driving influence in limiting gene flow across the CFR that can lead to species diversity on fine-scales. Nonetheless, while this pattern may be true of the widespread L. salignum, the extension of this approach is now warranted for other CFR species with varying ranges and dispersal mechanisms to determine how universal these patterns of landscape genetic diversity are.
ContributorsTassone, Erica (Author) / Verrelli, Brian C (Thesis advisor) / Dowling, Thomas (Committee member) / Cartwright, Reed (Committee member) / Rosenberg, Michael S. (Committee member) / Wojciechowski, Martin (Committee member) / Arizona State University (Publisher)
Created2013
152382-Thumbnail Image.png
Description
A P-value based method is proposed for statistical monitoring of various types of profiles in phase II. The performance of the proposed method is evaluated by the average run length criterion under various shifts in the intercept, slope and error standard deviation of the model. In our proposed approach, P-values

A P-value based method is proposed for statistical monitoring of various types of profiles in phase II. The performance of the proposed method is evaluated by the average run length criterion under various shifts in the intercept, slope and error standard deviation of the model. In our proposed approach, P-values are computed at each level within a sample. If at least one of the P-values is less than a pre-specified significance level, the chart signals out-of-control. The primary advantage of our approach is that only one control chart is required to monitor several parameters simultaneously: the intercept, slope(s), and the error standard deviation. A comprehensive comparison of the proposed method and the existing KMW-Shewhart method for monitoring linear profiles is conducted. In addition, the effect that the number of observations within a sample has on the performance of the proposed method is investigated. The proposed method was also compared to the T^2 method discussed in Kang and Albin (2000) for multivariate, polynomial, and nonlinear profiles. A simulation study shows that overall the proposed P-value method performs satisfactorily for different profile types.
ContributorsAdibi, Azadeh (Author) / Montgomery, Douglas C. (Thesis advisor) / Borror, Connie (Thesis advisor) / Li, Jing (Committee member) / Zhang, Muhong (Committee member) / Arizona State University (Publisher)
Created2013
151199-Thumbnail Image.png
Description
Salmonella enterica is a gastrointestinal (GI) pathogen that can cause systemic diseases. It invades the host through the GI tract and can induce powerful immune responses in addition to disease. Thus, it is considered as a promising candidate to use as oral live vaccine vectors. Scientists have been making great

Salmonella enterica is a gastrointestinal (GI) pathogen that can cause systemic diseases. It invades the host through the GI tract and can induce powerful immune responses in addition to disease. Thus, it is considered as a promising candidate to use as oral live vaccine vectors. Scientists have been making great efforts to get a properly attenuated Salmonella vaccine strain for a long time, but could not achieve a balance between attenuation and immunogenicity. So the regulated delayed attenuation/lysis Salmonella vaccine vectors were proposed as a design to seek this balance. The research work is progressing steadily, but more improvements need to be made. As one of the possible improvements, the cyclic adenosine monophosphate (cAMP) -independent cAMP receptor protein (Crp*) is expected to protect the Crp-dependent crucial regulator, araC PBAD, in these vaccine designs from interference by glucose, which decreases synthesis of cAMP, and enhance the colonizing ability by and immunogenicity of the vaccine strains. In this study, the cAMP-independent crp gene mutation, crp-70, with or without araC PBAD promoter cassette, was introduced into existing Salmonella vaccine strains. Then the plasmid stability, growth rate, resistance to catabolite repression, colonizing ability, immunogenicity and protection to challenge of these new strains were compared with wild-type crp or araC PBAD crp strains using western blots, enzyme-linked immunosorbent assays (ELISA) and animal studies, so as to evaluate the effects of the crp-70 mutation on the vaccine strains. The performances of the crp-70 strains in some aspects were closed to or even exceeded the crp+ strains, but generally they did not exhibit the expected advantages compared to their wild-type parents. Crp-70 rescued the expression of araC PBAD fur from catabolite repression. The strain harboring araC PBAD crp-70 was severely affected by its slow growth, and its colonizing ability and immunogenicity was much weaker than the other strains. The Pcrp crp-70 strain showed relatively good ability in colonization and immune stimulation. Both the araC PBAD crp-70 and the Pcrp crp-70 strains could provide certain levels of protection against the challenge with virulent pneumococci, which were a little lower than for the crp+ strains.
ContributorsShao, Shihuan (Author) / Curtiss, Roy (Thesis advisor) / Arizona State University (Publisher)
Created2012
151176-Thumbnail Image.png
Description
Rapid advance in sensor and information technology has resulted in both spatially and temporally data-rich environment, which creates a pressing need for us to develop novel statistical methods and the associated computational tools to extract intelligent knowledge and informative patterns from these massive datasets. The statistical challenges for addressing these

Rapid advance in sensor and information technology has resulted in both spatially and temporally data-rich environment, which creates a pressing need for us to develop novel statistical methods and the associated computational tools to extract intelligent knowledge and informative patterns from these massive datasets. The statistical challenges for addressing these massive datasets lay in their complex structures, such as high-dimensionality, hierarchy, multi-modality, heterogeneity and data uncertainty. Besides the statistical challenges, the associated computational approaches are also considered essential in achieving efficiency, effectiveness, as well as the numerical stability in practice. On the other hand, some recent developments in statistics and machine learning, such as sparse learning, transfer learning, and some traditional methodologies which still hold potential, such as multi-level models, all shed lights on addressing these complex datasets in a statistically powerful and computationally efficient way. In this dissertation, we identify four kinds of general complex datasets, including "high-dimensional datasets", "hierarchically-structured datasets", "multimodality datasets" and "data uncertainties", which are ubiquitous in many domains, such as biology, medicine, neuroscience, health care delivery, manufacturing, etc. We depict the development of novel statistical models to analyze complex datasets which fall under these four categories, and we show how these models can be applied to some real-world applications, such as Alzheimer's disease research, nursing care process, and manufacturing.
ContributorsHuang, Shuai (Author) / Li, Jing (Thesis advisor) / Askin, Ronald (Committee member) / Ye, Jieping (Committee member) / Runger, George C. (Committee member) / Arizona State University (Publisher)
Created2012
135568-Thumbnail Image.png
Description
Triops (Branchiopoda: Notostraca) and Streptocephalus (Branchiopoda: Anostraca) are two crustaceans which cohabitate in ephemeral freshwater pools. They both lay desiccation resistant eggs that disperse passively to new hydrologically isolated environments. The extent of genetic distance among regions and populations is of perennial interest in animals that live in such isolated

Triops (Branchiopoda: Notostraca) and Streptocephalus (Branchiopoda: Anostraca) are two crustaceans which cohabitate in ephemeral freshwater pools. They both lay desiccation resistant eggs that disperse passively to new hydrologically isolated environments. The extent of genetic distance among regions and populations is of perennial interest in animals that live in such isolated habitats. Populations in six natural ephemeral pool habitats located in two different regions of the Sonoran Desert and a transition area between the Sonoran and Chihuahuan Deserts were sampled. Sequences from Genbank were used for reference points in the determination of species as well as to further identify regional genetic distance within species. This study estimated the amount of within and between genetic distance of individuals from each region and population through the use of a neutral marker, cytochrome oxidase I (COI). We concluded that, although the method of passive dispersal may differ between the two genera, the differences do not results in different patterns of genetic distances between regions and populations. Furthermore, we only found the putative species, Triops longicaudatus "short", with enough distinct speciation. Although Triops longicaudatus "long" and Triops newberryi may be in the early stages of speciation, this study does not find enough support to conclude that they have separated.
ContributorsMurphy Jr., Patrick Joseph (Author) / Rutowski, Ronald (Thesis director) / Cartwright, Reed (Committee member) / Lessios, Nikos (Committee member) / School of Life Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / Barrett, The Honors College (Contributor)
Created2016-05
135788-Thumbnail Image.png
Description
The Department of Defense (DoD) acquisition system is a complex system riddled with cost and schedule overruns. These cost and schedule overruns are very serious issues as the acquisition system is responsible for aiding U.S. warfighters. Hence, if the acquisition process is failing that could be a potential threat to

The Department of Defense (DoD) acquisition system is a complex system riddled with cost and schedule overruns. These cost and schedule overruns are very serious issues as the acquisition system is responsible for aiding U.S. warfighters. Hence, if the acquisition process is failing that could be a potential threat to our nation's security. Furthermore, the DoD acquisition system is responsible for proper allocation of billions of taxpayer's dollars and employs many civilians and military personnel. Much research has been done in the past on the acquisition system with little impact or success. One reason for this lack of success in improving the system is the lack of accurate models to test theories. This research is a continuation of the effort on the Enterprise Requirements and Acquisition Model (ERAM), a discrete event simulation modeling research on DoD acquisition system. We propose to extend ERAM using agent-based simulation principles due to the many interactions among the subsystems of the acquisition system. We initially identify ten sub models needed to simulate the acquisition system. This research focuses on three sub models related to the budget of acquisition programs. In this thesis, we present the data collection, data analysis, initial implementation, and initial validation needed to facilitate these sub models and lay the groundwork for a full agent-based simulation of the DoD acquisition system.
ContributorsBucknell, Sophia Robin (Author) / Wu, Teresa (Thesis director) / Li, Jing (Committee member) / Colombi, John (Committee member) / Industrial, Systems (Contributor) / Barrett, The Honors College (Contributor)
Created2016-05
136967-Thumbnail Image.png
Description
The evolution of blindness in cave animals has been heavily studied; however, little research has been done on the interaction of migration and drift on the development of blindness in these populations. In this study, a model is used to compare the effect that genetic drift has on the fixation

The evolution of blindness in cave animals has been heavily studied; however, little research has been done on the interaction of migration and drift on the development of blindness in these populations. In this study, a model is used to compare the effect that genetic drift has on the fixation of a blindness allele for varying amounts of migration and selection. For populations where the initial frequency is quite low, genetic drift plays a much larger role in the fixation of blindness than populations where the initial frequency is high. In populations where the initial frequency is high, genetic drift plays almost no role in fixation. Our results suggest that migration plays a greater role in the fate of the blindness allele than selection.
ContributorsMerry, Alexandra Leigh (Author) / Cartwright, Reed (Thesis director) / Rosenberg, Michael (Committee member) / Schwartz, Rachel (Committee member) / Barrett, The Honors College (Contributor) / School of Life Sciences (Contributor)
Created2014-05
137601-Thumbnail Image.png
Description
Thirty six percent of Americans are obese and thirty three percent are overweight; obesity has become a known killer in the U.S. yet its prevalence has maintained a firm grasp on the U.S. population and continues to spread across the globe as other countries slowly adopt the American lifestyle. A

Thirty six percent of Americans are obese and thirty three percent are overweight; obesity has become a known killer in the U.S. yet its prevalence has maintained a firm grasp on the U.S. population and continues to spread across the globe as other countries slowly adopt the American lifestyle. A survey was compiled collecting demographic and body mass index (BMI) information, as well as Tanofsky-Kraff’s (2009) “Assess Eating in the Absence of Hunger” survey questions. The survey used for this study was emailed out to Arizona State University students in Barrett, The Honors College, and the ASU School of Nutrition and Health Promotion listservs. A total of 457 participants completed the survey, 72 males and 385 females (mean age, 24.5±7.7 y; average body mass index (BMI), 23.4 ± 4.8 [a BMI of 25-29.9 is classified as overweight]). When comparing BMI with the living situation, 71% of obese students were living at home with family versus off campus with friends or alone. For comparison, 45% of normal weight students lived at home with family.  These data could help structure prevention plans targeting college students by focusing on weight gain prevention at the family level. Results from the Tanofsky-Kraff (2009) survey revealed there was not a significant relationship between external or physical cues and BMI in men or women, but there was a significant positive correlation between emotional cues and BMI in women only. Anger and sadness were the emotional cues in women related to initiating consumption past satiation and consumption following several hours of fasting. Although BMI was inversely related to physical activity in this sample (r = -0.132; p=0.005), controlling for physical activity did not impact the significant associations of BMI with anger or sadness (P>0.05).  This information is important in targeting prevention programs to address behavioral change and cognitive awareness of the effects of emotion on over-consumption.
ContributorsGarza, Andrea Marie (Author) / Johnston, Carol (Thesis director) / Jacobs, Mark (Committee member) / Coletta, Dawn (Committee member) / Barrett, The Honors College (Contributor) / Department of Psychology (Contributor) / School of Life Sciences (Contributor)
Created2013-05