The title “Regents’ Professor” is the highest faculty honor awarded at Arizona State University. It is conferred on ASU faculty who have made pioneering contributions in their areas of expertise, who have achieved a sustained level of distinction, and who enjoy national and international recognition for these accomplishments. This collection contains primarily open access works by ASU Regents' Professors.

Displaying 1 - 9 of 9
Filtering by

Clear all filters

Description

Background:
Many pharmaceutical drugs are known to be ineffective or have negative side effects in a substantial proportion of patients. Genomic advances are revealing that some non-synonymous single nucleotide variants (nsSNVs) may cause differences in drug efficacy and side effects. Therefore, it is desirable to evaluate nsSNVs of interest in their

Background:
Many pharmaceutical drugs are known to be ineffective or have negative side effects in a substantial proportion of patients. Genomic advances are revealing that some non-synonymous single nucleotide variants (nsSNVs) may cause differences in drug efficacy and side effects. Therefore, it is desirable to evaluate nsSNVs of interest in their ability to modulate the drug response.

Results:
We found that the available data on the link between drug response and nsSNV is rather modest. There were only 31 distinct drug response-altering (DR-altering) and 43 distinct drug response-neutral (DR-neutral) nsSNVs in the whole Pharmacogenomics Knowledge Base (PharmGKB). However, even with this modest dataset, it was clear that existing bioinformatics tools have difficulties in correctly predicting the known DR-altering and DR-neutral nsSNVs. They exhibited an overall accuracy of less than 50%, which was not better than random diagnosis. We found that the underlying problem is the markedly different evolutionary properties between positions harboring nsSNVs linked to drug responses and those observed for inherited diseases. To solve this problem, we developed a new diagnosis method, Drug-EvoD, which was trained on the evolutionary properties of nsSNVs associated with drug responses in a sparse learning framework. Drug-EvoD achieves a TPR of 84% and a TNR of 53%, with a balanced accuracy of 69%, which improves upon other methods significantly.

Conclusions:
The new tool will enable researchers to computationally identify nsSNVs that may affect drug responses. However, much larger training and testing datasets are needed to develop more reliable and accurate tools.

ContributorsGerek, Nevin Z. (Author) / Liu, Li (Author) / Gerold, Kristyn (Author) / Biparva, Pegah (Author) / Thomas, Eric D. (Author) / Kumar, Sudhir (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor)
Created2015-01-15
130347-Thumbnail Image.png
Description
The evolution of resistance in Staphylococcus aureus occurs rapidly, and in response to all known antimicrobial treatments. Numerous studies of model species describe compensatory roles of mutations in mediating competitive fitness, and there is growing evidence that these mutation types also drive adaptation of S. aureus strains. However, few studies

The evolution of resistance in Staphylococcus aureus occurs rapidly, and in response to all known antimicrobial treatments. Numerous studies of model species describe compensatory roles of mutations in mediating competitive fitness, and there is growing evidence that these mutation types also drive adaptation of S. aureus strains. However, few studies have tracked amino acid changes during the complete evolutionary trajectory of antibiotic adaptation or been able to predict their functional relevance. Here, we have assessed the efficacy of computational methods to predict biological resistance of a collection of clinically known Resistance Associated Mutations (RAMs). We have found that >90% of known RAMs are incorrectly predicted to be functionally neutral by at least one of the prediction methods used. By tracing the evolutionary histories of all of the false negative RAMs, we have discovered that a significant number are reversion mutations to ancestral alleles also carried in the MSSA476 methicillin-sensitive isolate. These genetic reversions are most prevalent in strains following daptomycin treatment and show a tendency to accumulate in biological pathway reactions that are distinct from those accumulating non-reversion mutations. Our studies therefore show that in addition to non-reversion mutations, reversion mutations arise in isolates exposed to new antibiotic treatments. It is possible that acquisition of reversion mutations in the genome may prevent substantial fitness costs during the progression of resistance. Our findings pose an interesting question to be addressed by further clinical studies regarding whether or not these reversion mutations lead to a renewed vulnerability of a vancomycin or daptomycin resistant strain to antibiotics administered at an earlier stage of infection.
ContributorsChampion, Mia (Author) / Gray, Vanessa (Author) / Eberhard, Carl (Author) / Kumar, Sudhir (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor)
Created2013-02-12
130367-Thumbnail Image.png
Description
Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS,

Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS, avoids the time consuming steps of de novo whole genome assembly, multiple genome alignment, and annotation.
Results
For simulations SISRS is able to identify large numbers of loci containing variable sites with phylogenetic signal. For genomic data from apes, SISRS identified thousands of variable sites, from which we produced an accurate phylogeny. Finally, we used SISRS to identify phylogenetic markers that we used to estimate the phylogeny of placental mammals. We recovered eight phylogenies that resolved the basal relationships among mammals using datasets with different levels of missing data. The three alternate resolutions of the basal relationships are consistent with the major hypotheses for the relationships among mammals, all of which have been supported previously by different molecular datasets.
Conclusions
SISRS has the potential to transform phylogenetic research. This method eliminates the need for expensive marker development in many studies by using whole genome shotgun sequence data directly. SISRS is open source and freely available at https://github.com/rachelss/SISRS/releases.
ContributorsSchwartz, Rachel (Author) / Harkins, Kelly (Author) / Stone, Anne (Author) / Cartwright, Reed (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / School of Life Sciences (Contributor)
Created2015-06-11
130365-Thumbnail Image.png
Description
Background
“Stoichioproteomics” relates the elemental composition of proteins and proteomes to variation in the physiological and ecological environment. To help harness and explore the wealth of hypotheses made possible under this framework, we introduce GRASP (http://www.graspdb.net), a public bioinformatic knowledgebase containing information on the frequencies of 20 amino acids and atomic

Background
“Stoichioproteomics” relates the elemental composition of proteins and proteomes to variation in the physiological and ecological environment. To help harness and explore the wealth of hypotheses made possible under this framework, we introduce GRASP (http://www.graspdb.net), a public bioinformatic knowledgebase containing information on the frequencies of 20 amino acids and atomic composition of their side chains. GRASP integrates comparative protein composition data with annotation data from multiple public databases. Currently, GRASP includes information on proteins of 12 sequenced Drosophila (fruit fly) proteomes, which will be expanded to include increasingly diverse organisms over time. In this paper we illustrate the potential of GRASP for testing stoichioproteomic hypotheses by conducting an exploratory investigation into the composition of 12 Drosophila proteomes, testing the prediction that protein atomic content is associated with species ecology and with protein expression levels.
Results
Elements varied predictably along multivariate axes. Species were broadly similar, with the D. willistoni proteome a clear outlier. As expected, individual protein atomic content within proteomes was influenced by protein function and amino acid biochemistry. Evolution in elemental composition across the phylogeny followed less predictable patterns, but was associated with broad ecological variation in diet. Using expression data available for D. melanogaster, we found evidence consistent with selection for efficient usage of elements within the proteome: as expected, nitrogen content was reduced in highly expressed proteins in most tissues, most strongly in the gut, where nutrients are assimilated, and least strongly in the germline.
Conclusions
The patterns identified here using GRASP provide a foundation on which to base future research into the evolution of atomic composition in Drosophila and other taxa.
ContributorsGilbert, James D. J. (Author) / Acquisti, Claudia (Author) / Martinson, Holly M. (Author) / Elser, James (Author) / Kumar, Sudhir (Author) / Fagan, William F. (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2013-09-04
130364-Thumbnail Image.png
Description
Background
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the

Background
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the gene functions, interactions, and networks. To facilitate pattern recognition and comparison, many web-based resources have been created to conduct comparative analysis based on the body part keywords and the associated images. With the fast accumulation of images from high-throughput techniques, manual inspection of images will impose a serious impediment on the pace of biological discovery. It is thus imperative to design an automated system for efficient image annotation and comparison.
Results
We present a computational framework to perform anatomical keywords annotation for Drosophila gene expression images. The spatial sparse coding approach is used to represent local patches of images in comparison with the well-known bag-of-words (BoW) method. Three pooling functions including max pooling, average pooling and Sqrt (square root of mean squared statistics) pooling are employed to transform the sparse codes to image features. Based on the constructed features, we develop both an image-level scheme and a group-level scheme to tackle the key challenges in annotating Drosophila gene expression pattern images automatically. To deal with the imbalanced data distribution inherent in image annotation tasks, the undersampling method is applied together with majority vote. Results on Drosophila embryonic expression pattern images verify the efficacy of our approach.
Conclusion
In our experiment, the three pooling functions perform comparably well in feature dimension reduction. The undersampling with majority vote is shown to be effective in tackling the problem of imbalanced data. Moreover, combining sparse coding and image-level scheme leads to consistent performance improvement in keywords annotation.
ContributorsSun, Qian (Author) / Muckatira, Sherin (Author) / Yuan, Lei (Author) / Ji, Shuiwang (Author) / Newfeld, Stuart (Author) / Kumar, Sudhir (Author) / Ye, Jieping (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor) / Ira A. Fulton Schools of Engineering (Contributor)
Created2013-12-03
130363-Thumbnail Image.png
Description
Background
Fruit fly embryogenesis is one of the best understood animal development systems, and the spatiotemporal gene expression dynamics in this process are captured by digital images. Analysis of these high-throughput images will provide novel insights into the functions, interactions, and networks of animal genes governing development. To facilitate comparative analysis,

Background
Fruit fly embryogenesis is one of the best understood animal development systems, and the spatiotemporal gene expression dynamics in this process are captured by digital images. Analysis of these high-throughput images will provide novel insights into the functions, interactions, and networks of animal genes governing development. To facilitate comparative analysis, web-based interfaces have been developed to conduct image retrieval based on body part keywords and images. Currently, the keyword annotation of spatiotemporal gene expression patterns is conducted manually. However, this manual practice does not scale with the continuously expanding collection of images. In addition, existing image retrieval systems based on the expression patterns may be made more accurate using keywords.
Results
In this article, we adapt advanced data mining and computer vision techniques to address the key challenges in annotating and retrieving fruit fly gene expression pattern images. To boost the performance of image annotation and retrieval, we propose representations integrating spatial information and sparse features, overcoming the limitations of prior schemes.
Conclusions
We perform systematic experimental studies to evaluate the proposed schemes in comparison with current methods. Experimental results indicate that the integration of spatial information and sparse features lead to consistent performance improvement in image annotation, while for the task of retrieval, sparse features alone yields better results.
ContributorsYuan, Lei (Author) / Woodard, Alexander (Author) / Ji, Shuiwang (Author) / Jiang, Yuan (Author) / Zhou, Zhi-Hua (Author) / Kumar, Sudhir (Author) / Ye, Jieping (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / Ira A. Fulton Schools of Engineering (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2012-05-23
130362-Thumbnail Image.png
Description
Background
Multicellular organisms consist of cells of many different types that are established during development. Each type of cell is characterized by the unique combination of expressed gene products as a result of spatiotemporal gene regulation. Currently, a fundamental challenge in regulatory biology is to elucidate the gene expression controls that

Background
Multicellular organisms consist of cells of many different types that are established during development. Each type of cell is characterized by the unique combination of expressed gene products as a result of spatiotemporal gene regulation. Currently, a fundamental challenge in regulatory biology is to elucidate the gene expression controls that generate the complex body plans during development. Recent advances in high-throughput biotechnologies have generated spatiotemporal expression patterns for thousands of genes in the model organism fruit fly Drosophila melanogaster. Existing qualitative methods enhanced by a quantitative analysis based on computational tools we present in this paper would provide promising ways for addressing key scientific questions.
Results
We develop a set of computational methods and open source tools for identifying co-expressed embryonic domains and the associated genes simultaneously. To map the expression patterns of many genes into the same coordinate space and account for the embryonic shape variations, we develop a mesh generation method to deform a meshed generic ellipse to each individual embryo. We then develop a co-clustering formulation to cluster the genes and the mesh elements, thereby identifying co-expressed embryonic domains and the associated genes simultaneously. Experimental results indicate that the gene and mesh co-clusters can be correlated to key developmental events during the stages of embryogenesis we study. The open source software tool has been made available at http://compbio.cs.odu.edu/fly/.
Conclusions
Our mesh generation and machine learning methods and tools improve upon the flexibility, ease-of-use and accuracy of existing methods.
ContributorsZhang, Wenlu (Author) / Feng, Daming (Author) / Li, Rongjian (Author) / Chernikov, Andrey (Author) / Chrisochoides, Nikos (Author) / Osgood, Christopher (Author) / Konikoff, Charlotte (Author) / Newfeld, Stuart (Author) / Kumar, Sudhir (Author) / Ji, Shuiwang (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2013-12-28
130361-Thumbnail Image.png
Description
Background
Neighborhood environment studies of physical activity (PA) have been mainly single-country focused. The International Prevalence Study (IPS) presented a rare opportunity to examine neighborhood features across countries. The purpose of this analysis was to: 1) detect international neighborhood typologies based on participants’ response patterns to an environment survey and 2)

Background
Neighborhood environment studies of physical activity (PA) have been mainly single-country focused. The International Prevalence Study (IPS) presented a rare opportunity to examine neighborhood features across countries. The purpose of this analysis was to: 1) detect international neighborhood typologies based on participants’ response patterns to an environment survey and 2) to estimate associations between neighborhood environment patterns and PA.
Methods
A Latent Class Analysis (LCA) was conducted on pooled IPS adults (N=11,541) aged 18 to 64 years old (mean=37.5 ±12.8 yrs; 55.6% women) from 11 countries including Belgium, Brazil, Canada, Colombia, Hong Kong, Japan, Lithuania, New Zealand, Norway, Sweden, and the U.S. This subset used the Physical Activity Neighborhood Environment Survey (PANES) that briefly assessed 7 attributes within 10–15 minutes walk of participants’ residences, including residential density, access to shops/services, recreational facilities, public transit facilities, presence of sidewalks and bike paths, and personal safety. LCA derived meaningful subgroups from participants’ response patterns to PANES items, and participants were assigned to neighborhood types. The validated short-form International Physical Activity Questionnaire (IPAQ) measured likelihood of meeting the 150 minutes/week PA guideline. To validate derived classes, meeting the guideline either by walking or total PA was regressed on neighborhood types using a weighted generalized linear regression model, adjusting for gender, age and country.
Results
A 5-subgroup solution fitted the dataset and was interpretable. Neighborhood types were labeled, “Overall Activity Supportive (52% of sample)”, “High Walkable and Unsafe with Few Recreation Facilities (16%)”, “Safe with Active Transport Facilities (12%)”, “Transit and Shops Dense with Few Amenities (15%)”, and “Safe but Activity Unsupportive (5%)”. Country representation differed by type (e.g., U.S. disproportionally represented “Safe but Activity Unsupportive”). Compared to the Safe but Activity Unsupportive, two types showed greater odds of meeting PA guideline for walking outcome (High Walkable and Unsafe with Few Recreation Facilities, OR= 2.26 (95% CI 1.18-4.31); Overall Activity Supportive, OR= 1.90 (95% CI 1.13-3.21). Significant but smaller odds ratios were also found for total PA.
Conclusions
Meaningful neighborhood patterns generalized across countries and explained practical differences in PA. These observational results support WHO/UN recommendations for programs and policies targeted to improve features of the neighborhood environment for PA.
ContributorsAdams, Marc (Author) / Ding, Ding (Author) / Sallis, James F. (Author) / Bowles, Heather R. (Author) / Ainsworth, Barbara (Author) / Bergman, Patrick (Author) / Bull, Fiona C. (Author) / Carr, Harriette (Author) / Craig, Cora L. (Author) / De Bourdeaudhuij, Ilse (Author) / Fernando Gomez, Luis (Author) / Hagstromer, Maria (Author) / Klasson-Heggebo, Lena (Author) / Inoue, Shigeru (Author) / Lefevre, Johan (Author) / Macfarlane, Duncan J. (Author) / Matsudo, Sandra (Author) / Matsudo, Victor (Author) / McLean, Grant (Author) / Murase, Norio (Author) / Sjostrom, Michael (Author) / Tomten, Heidi (Author) / Volbekiene, Vida (Author) / Bauman, Adrian (Author) / College of Health Solutions (Contributor) / School of Nutrition and Health Promotion (Contributor)
Created2013-03-07
130359-Thumbnail Image.png
Description
Background
Increasing empirical evidence supports associations between neighborhood environments and physical activity. However, since most studies were conducted in a single country, particularly western countries, the generalizability of associations in an international setting is not well understood. The current study examined whether associations between perceived attributes of neighborhood environments and physical

Background
Increasing empirical evidence supports associations between neighborhood environments and physical activity. However, since most studies were conducted in a single country, particularly western countries, the generalizability of associations in an international setting is not well understood. The current study examined whether associations between perceived attributes of neighborhood environments and physical activity differed by country.
Methods
Population representative samples from 11 countries on five continents were surveyed using comparable methodologies and measurement instruments. Neighborhood environment × country interactions were tested in logistic regression models with meeting physical activity recommendations as the outcome, adjusted for demographic characteristics. Country-specific associations were reported.
Results
Significant neighborhood environment attribute × country interactions implied some differences across countries in the association of each neighborhood attribute with meeting physical activity recommendations. Across the 11 countries, land-use mix and sidewalks had the most consistent associations with physical activity. Access to public transit, bicycle facilities, and low-cost recreation facilities had some associations with physical activity, but with less consistency across countries. There was little evidence supporting the associations of residential density and crime-related safety with physical activity in most countries.
Conclusion
There is evidence of generalizability for the associations of land use mix, and presence of sidewalks with physical activity. Associations of other neighborhood characteristics with physical activity tended to differ by country. Future studies should include objective measures of neighborhood environments, compare psychometric properties of reports across countries, and use better specified models to further understand the similarities and differences in associations across countries.
ContributorsDing, Ding (Author) / Adams, Marc (Author) / Sallis, James F. (Author) / Norman, Gregory J. (Author) / Hovell, Melbourn F. (Author) / Chambers, Christina D. (Author) / Hofstetter, C. Richard (Author) / Bowles, Heather R. (Author) / Hagstromer, Maria (Author) / Craig, Cora L. (Author) / Fernando Gomez, Luis (Author) / De Bourdeaudhuij, Ilse (Author) / Macfarlane, Duncan J. (Author) / Ainsworth, Barbara (Author) / Bergman, Patrick (Author) / Bull, Fiona C. (Author) / Carr, Harriette (Author) / Klasson-Heggebo, Lena (Author) / Inoue, Shigeru (Author) / Murase, Norio (Author) / Matsudo, Sandra (Author) / Matsudo, Victor (Author) / McLean, Grant (Author) / Sjostrom, Michael (Author) / Tomten, Heidi (Author) / Lefevre, Johan (Author) / Volbekiene, Vida (Author) / Bauman, Adrian E. (Author) / College of Health Solutions (Contributor) / School of Nutrition and Health Promotion (Contributor)
Created2013-05-14