Search Content

Image-level and group-level models for Drosophila gene expression pattern annotation

Description

Background
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the…

Background
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the gene functions, interactions, and networks. To facilitate pattern recognition and comparison, many web-based resources have been created to conduct comparative analysis based on the body part keywords and the associated images. With the fast accumulation of images from high-throughput techniques, manual inspection of images will impose a serious impediment on the pace of biological discovery. It is thus imperative to design an automated system for efficient image annotation and comparison.
Results
We present a computational framework to perform anatomical keywords annotation for Drosophila gene expression images. The spatial sparse coding approach is used to represent local patches of images in comparison with the well-known bag-of-words (BoW) method. Three pooling functions including max pooling, average pooling and Sqrt (square root of mean squared statistics) pooling are employed to transform the sparse codes to image features. Based on the constructed features, we develop both an image-level scheme and a group-level scheme to tackle the key challenges in annotating Drosophila gene expression pattern images automatically. To deal with the imbalanced data distribution inherent in image annotation tasks, the undersampling method is applied together with majority vote. Results on Drosophila embryonic expression pattern images verify the efficacy of our approach.
Conclusion
In our experiment, the three pooling functions perform comparably well in feature dimension reduction. The undersampling with majority vote is shown to be effective in tackling the problem of imbalanced data. Moreover, combining sparse coding and image-level scheme leads to consistent performance improvement in keywords annotation.

ContributorsSun, Qian (Author) / Muckatira, Sherin (Author) / Yuan, Lei (Author) / Ji, Shuiwang (Author) / Newfeld, Stuart (Author) / Kumar, Sudhir (Author) / Ye, Jieping (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor) / Ira A. Fulton Schools of Engineering (Contributor)

Created2013-12-03

GRASP [Genomic Resource Access for Stoichioproteomics]: comparative explorations of the atomic content of 12 Drosophila proteomes

Description

Background
“Stoichioproteomics” relates the elemental composition of proteins and proteomes to variation in the physiological and ecological environment. To help harness and explore the wealth of hypotheses made possible under this framework, we introduce GRASP (http://www.graspdb.net), a public bioinformatic knowledgebase containing information on the frequencies of 20 amino acids and atomic…

Background
“Stoichioproteomics” relates the elemental composition of proteins and proteomes to variation in the physiological and ecological environment. To help harness and explore the wealth of hypotheses made possible under this framework, we introduce GRASP (http://www.graspdb.net), a public bioinformatic knowledgebase containing information on the frequencies of 20 amino acids and atomic composition of their side chains. GRASP integrates comparative protein composition data with annotation data from multiple public databases. Currently, GRASP includes information on proteins of 12 sequenced Drosophila (fruit fly) proteomes, which will be expanded to include increasingly diverse organisms over time. In this paper we illustrate the potential of GRASP for testing stoichioproteomic hypotheses by conducting an exploratory investigation into the composition of 12 Drosophila proteomes, testing the prediction that protein atomic content is associated with species ecology and with protein expression levels.
Results
Elements varied predictably along multivariate axes. Species were broadly similar, with the D. willistoni proteome a clear outlier. As expected, individual protein atomic content within proteomes was influenced by protein function and amino acid biochemistry. Evolution in elemental composition across the phylogeny followed less predictable patterns, but was associated with broad ecological variation in diet. Using expression data available for D. melanogaster, we found evidence consistent with selection for efficient usage of elements within the proteome: as expected, nitrogen content was reduced in highly expressed proteins in most tissues, most strongly in the gut, where nutrients are assimilated, and least strongly in the germline.
Conclusions
The patterns identified here using GRASP provide a foundation on which to base future research into the evolution of atomic composition in Drosophila and other taxa.

ContributorsGilbert, James D. J. (Author) / Acquisti, Claudia (Author) / Martinson, Holly M. (Author) / Elser, James (Author) / Kumar, Sudhir (Author) / Fagan, William F. (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)

Created2013-09-04

A composite genome approach to identify phylogenetically informative data from next-generation sequencing

Description

Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS,…

Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS, avoids the time consuming steps of de novo whole genome assembly, multiple genome alignment, and annotation.
Results
For simulations SISRS is able to identify large numbers of loci containing variable sites with phylogenetic signal. For genomic data from apes, SISRS identified thousands of variable sites, from which we produced an accurate phylogeny. Finally, we used SISRS to identify phylogenetic markers that we used to estimate the phylogeny of placental mammals. We recovered eight phylogenies that resolved the basal relationships among mammals using datasets with different levels of missing data. The three alternate resolutions of the basal relationships are consistent with the major hypotheses for the relationships among mammals, all of which have been supported previously by different molecular datasets.
Conclusions
SISRS has the potential to transform phylogenetic research. This method eliminates the need for expensive marker development in many studies by using whole genome shotgun sequence data directly. SISRS is open source and freely available at https://github.com/rachelss/SISRS/releases.

ContributorsSchwartz, Rachel (Author) / Harkins, Kelly (Author) / Stone, Anne (Author) / Cartwright, Reed (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / School of Life Sciences (Contributor)

Created2015-06-11

Application of a High Throughput Alamar Blue Biofilm Susceptibility Assay to Staphylococcus Aureus Biofilms

Description

Background: Staphylococcus aureus and S. epidermidis biofilms differ in structure, growth and regulation, and thus the high-throughput method of evaluating biofilm susceptibility that has been published for S. epidermidis cannot be applied to S. aureus without first evaluating the assay's reproducibility and reliability with S. aureus biofilms.

Methods: Staphylococcus aureus biofilms…

Background: Staphylococcus aureus and S. epidermidis biofilms differ in structure, growth and regulation, and thus the high-throughput method of evaluating biofilm susceptibility that has been published for S. epidermidis cannot be applied to S. aureus without first evaluating the assay's reproducibility and reliability with S. aureus biofilms.

Methods: Staphylococcus aureus biofilms were treated with eleven approved antibiotics, lysostaphin, or Conflikt®, exposed to the oxidation reduction indicator Alamar blue, and reduction relative to untreated controls was determined visually and spectrophotometrically. The minimum biofilm inhibitory concentration (MBIC) was defined as ≤ 50% Alamar blue reduction and a purple/blue well 60 min after the addition of Alamar blue. Because all of the approved antibiotics had MBICs >128 μg/ml (most >2048 μg/ml), lysostaphin and Conflikt®, with relatively low MBICs, were used to correlate Alamar blue reduction with 2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-2H-tetrazolium-5-carboxanilide (XTT) reduction and viable counts (CFU/ml) for S. aureus ATCC 29213 and three clinical isolates. Alamar blue's stability and lack of toxicity allowed CFU/ml to be determined from the same wells as Alamar blue absorbances.

Results: Overall, Alamar blue reduction had excellent correlation with XTT reduction and with CFU/ml. For ATCC 29213 and two clinical isolates treated with lysostaphin or Conflikt®, Alamar blue reduction had excellent correlation with XTT reduction (r = 0.93-0.99) and with CFU/ml (r = 0.92-0.98). For one of the clinical isolates, the results were moderately correlated for Conflikt® (r = 0.76, Alamar blue vs. XTT; r = 0.81, Alamar blue vs. CFU/ml) and had excellent correlation for lysostaphin (r = 0.95, Alamar blue vs. XTT; r = 0.97, Alamar blue vs. CFU/ml).

Conclusion: A reliable, reproducible method for evaluating biofilm susceptibility was successfully applied to S. aureus biofilms. The described method provides researchers with a simple, nontoxic, relatively inexpensive, high throughput measure of viability after drug treatment. A standardized biofilm Alamar blue assay should greatly increase the rate of discovery of S. aureus biofilm specific agents.

ContributorsPettit, Robin (Author) / Weber, Christine (Author) / Pettit, George (Author) / Department of Chemistry and Biochemistry (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Molecular Sciences (Contributor)

Created2009-10-27

Evolutionary Diagnosis of Non-Synonymous Variants Involved in Differential Drug Response

Description

Background:
Many pharmaceutical drugs are known to be ineffective or have negative side effects in a substantial proportion of patients. Genomic advances are revealing that some non-synonymous single nucleotide variants (nsSNVs) may cause differences in drug efficacy and side effects. Therefore, it is desirable to evaluate nsSNVs of interest in their…

Background:
Many pharmaceutical drugs are known to be ineffective or have negative side effects in a substantial proportion of patients. Genomic advances are revealing that some non-synonymous single nucleotide variants (nsSNVs) may cause differences in drug efficacy and side effects. Therefore, it is desirable to evaluate nsSNVs of interest in their ability to modulate the drug response.

Results:
We found that the available data on the link between drug response and nsSNV is rather modest. There were only 31 distinct drug response-altering (DR-altering) and 43 distinct drug response-neutral (DR-neutral) nsSNVs in the whole Pharmacogenomics Knowledge Base (PharmGKB). However, even with this modest dataset, it was clear that existing bioinformatics tools have difficulties in correctly predicting the known DR-altering and DR-neutral nsSNVs. They exhibited an overall accuracy of less than 50%, which was not better than random diagnosis. We found that the underlying problem is the markedly different evolutionary properties between positions harboring nsSNVs linked to drug responses and those observed for inherited diseases. To solve this problem, we developed a new diagnosis method, Drug-EvoD, which was trained on the evolutionary properties of nsSNVs associated with drug responses in a sparse learning framework. Drug-EvoD achieves a TPR of 84% and a TNR of 53%, with a balanced accuracy of 69%, which improves upon other methods significantly.

Conclusions:
The new tool will enable researchers to computationally identify nsSNVs that may affect drug responses. However, much larger training and testing datasets are needed to develop more reliable and accurate tools.

ContributorsGerek, Nevin Z. (Author) / Liu, Li (Author) / Gerold, Kristyn (Author) / Biparva, Pegah (Author) / Thomas, Eric D. (Author) / Kumar, Sudhir (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor)

Created2015-01-15

Merging Economics and Epidemiology to Improve the Prediction and Management of Infectious Disease

Description

Mathematical epidemiology, one of the oldest and richest areas in mathematical biology, has significantly enhanced our understanding of how pathogens emerge, evolve, and spread. Classical epidemiological models, the standard for predicting and managing the spread of infectious disease, assume that contacts between susceptible and infectious individuals depend on their relative…

Mathematical epidemiology, one of the oldest and richest areas in mathematical biology, has significantly enhanced our understanding of how pathogens emerge, evolve, and spread. Classical epidemiological models, the standard for predicting and managing the spread of infectious disease, assume that contacts between susceptible and infectious individuals depend on their relative frequency in the population. The behavioral factors that underpin contact rates are not generally addressed. There is, however, an emerging a class of models that addresses the feedbacks between infectious disease dynamics and the behavioral decisions driving host contact. Referred to as “economic epidemiology” or “epidemiological economics,” the approach explores the determinants of decisions about the number and type of contacts made by individuals, using insights and methods from economics. We show how the approach has the potential both to improve predictions of the course of infectious disease, and to support development of novel approaches to infectious disease management.

ContributorsPerrings, Charles (Author) / Castillo-Chavez, Carlos (Author) / Chowell-Puente, Gerardo (Author) / Daszak, Peter (Author) / Fenichel, Eli P. (Author) / Finnoff, David (Author) / Horan, Richard D. (Author) / Kilpatrick, A. Marm (Author) / Kinzig, Ann (Author) / Kuminoff, Nicolai (Author) / Levin, Simon (Author) / Morin, Benjamin (Author) / Smith, Katherine F. (Author) / Springborn, Michael (Author) / Simon M. Levin Mathematical, Computational and Modeling Sciences Center (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / W.P. Carey School of Business (Contributor) / Economics (Contributor) / Julie Ann Wrigley Global Institute of Sustainability (Contributor)

Created2015-12-01

Resource Consumption, Sustainability, and Cancer

Description

Preserving a system’s viability in the presence of diversity erosion is critical if the goal is to sustainably support biodiversity. Reduction in population heterogeneity, whether inter- or intraspecies, may increase population fragility, either decreasing its ability to adapt effectively to environmental changes or facilitating the survival and success of ordinarily…

Preserving a system’s viability in the presence of diversity erosion is critical if the goal is to sustainably support biodiversity. Reduction in population heterogeneity, whether inter- or intraspecies, may increase population fragility, either decreasing its ability to adapt effectively to environmental changes or facilitating the survival and success of ordinarily rare phenotypes. The latter may result in over-representation of individuals who may participate in resource utilization patterns that can lead to over-exploitation, exhaustion, and, ultimately, collapse of both the resource and the population that depends on it. Here, we aim to identify regimes that can signal whether a consumer–resource system is capable of supporting viable degrees of heterogeneity. The framework used here is an expansion of a previously introduced consumer–resource type system of a population of individuals classified by their resource consumption. Application of the Reduction Theorem to the system enables us to evaluate the health of the system through tracking both the mean value of the parameter of resource (over)consumption, and the population variance, as both change over time. The article concludes with a discussion that highlights applicability of the proposed system to investigation of systems that are affected by particularly devastating overly adapted populations, namely cancerous cells. Potential intervention approaches for system management are discussed in the context of cancer therapies.

ContributorsKareva, Irina (Author) / Morin, Benjamin (Author) / Castillo-Chavez, Carlos (Author) / Simon M. Levin Mathematical, Computational and Modeling Sciences Center (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / Julie Ann Wrigley Global Institute of Sustainability (Contributor)

Created2015-02-01

Controlling Surface Defects and Photophysics in TiO2 Nanoparticles

Description

Titanium dioxide (TiO2) is widely used for photocatalysis and solar cell applications, and the electronic structure of bulk TiO2 is well understood. However, the surface structure of nanoparticulate TiO2, which has a key role in properties such as solubility and catalytic activity, still remains controversial. Detailed understanding of surface defect…

Titanium dioxide (TiO2) is widely used for photocatalysis and solar cell applications, and the electronic structure of bulk TiO2 is well understood. However, the surface structure of nanoparticulate TiO2, which has a key role in properties such as solubility and catalytic activity, still remains controversial. Detailed understanding of surface defect structures may help explain reactivity and overall materials performance in a wide range of applications. In this work we address the solubility problem and surface defects control on TiO2 nanoparticles. We report the synthesis and characterization of ∼4 nm TiO2 anatase spherical nanoparticles that are soluble and stable in a wide range of organic solvents and water. By controlling the temperature during the synthesis, we are able to tailor the density of defect states on the surface of the TiO2 nanoparticles without affecting parameters such as size, shape, core crystallinity, and solubility. The morphology of both kinds of nanoparticles was determined by TEM. EPR experiments were used to characterize the surface defects, and transient absorption measurements demonstrate the influence of the TiO2 defect states on photoinduced electron transfer dynamics.

ContributorsLlansola Portoles, Manuel (Author) / Bergkamp, Jesse (Author) / Finkelstein Shapiro, Daniel (Author) / Sherman, Benjamin (Author) / Kodis, Gerdenis (Author) / Dimitrijevic, Nada M. (Author) / Gust, Devens (Author) / Moore, Thomas (Author) / Moore, Ana (Author) / Department of Chemistry and Biochemistry (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Molecular Sciences (Contributor) / Center for Bioenergy and Photosynthesis (Contributor)

Created2014-11-13

The Telomere Binding Protein TRF2 Induces Chromatin Compaction

Description

Mammalian telomeres are specialized chromatin structures that require the telomere binding protein, TRF2, for maintaining chromosome stability. In addition to its ability to modulate DNA repair activities, TRF2 also has direct effects on DNA structure and topology. Given that mammalian telomeric chromatin includes nucleosomes, we investigated the effect of this…

Mammalian telomeres are specialized chromatin structures that require the telomere binding protein, TRF2, for maintaining chromosome stability. In addition to its ability to modulate DNA repair activities, TRF2 also has direct effects on DNA structure and topology. Given that mammalian telomeric chromatin includes nucleosomes, we investigated the effect of this protein on chromatin structure. TRF2 bound to reconstituted telomeric nucleosomal fibers through both its basic N-terminus and its C-terminal DNA binding domain. Analytical agarose gel electrophoresis (AAGE) studies showed that TRF2 promoted the folding of nucleosomal arrays into more compact structures by neutralizing negative surface charge. A construct containing the N-terminal and TRFH domains together altered the charge and radius of nucleosomal arrays similarly to full-length TRF2 suggesting that TRF2-driven changes in global chromatin structure were largely due to these regions. However, the most compact chromatin structures were induced by the isolated basic N-terminal region, as judged by both AAGE and atomic force microscopy. Although the N-terminal region condensed nucleosomal array fibers, the TRFH domain, known to alter DNA topology, was required for stimulation of a strand invasion-like reaction with nucleosomal arrays. Optimal strand invasion also required the C-terminal DNA binding domain. Furthermore, the reaction was not stimulated on linear histone-free DNA. Our data suggest that nucleosomal chromatin has the ability to facilitate this activity of TRF2 which is thought to be involved in stabilizing looped telomere structures.

ContributorsBaker, Asmaa M. (Author) / Fu, Qiang (Author) / Hayward, William (Author) / Victoria, Samuel (Author) / Pedroso, Ilene M. (Author) / Lindsay, Stuart (Author) / Fletcher, Terace M. (Author) / Department of Chemistry and Biochemistry (Contributor) / Biodesign Institute (Contributor) / Single Molecule Biophysics (Contributor)

Created2011-04-19

Mass Media and the Contagion of Fear: The Case of Ebola in America

Description

Background
In the weeks following the first imported case of Ebola in the U. S. on September 29, 2014, coverage of the very limited outbreak dominated the news media, in a manner quite disproportionate to the actual threat to national public health; by the end of October, 2014, there were only…

Background
In the weeks following the first imported case of Ebola in the U. S. on September 29, 2014, coverage of the very limited outbreak dominated the news media, in a manner quite disproportionate to the actual threat to national public health; by the end of October, 2014, there were only four laboratory confirmed cases of Ebola in the entire nation. Public interest in these events was high, as reflected in the millions of Ebola-related Internet searches and tweets performed in the month following the first confirmed case. Use of trending Internet searches and tweets has been proposed in the past for real-time prediction of outbreaks (a field referred to as “digital epidemiology”), but accounting for the biases of public panic has been problematic. In the case of the limited U. S. Ebola outbreak, we know that the Ebola-related searches and tweets originating the U. S. during the outbreak were due only to public interest or panic, providing an unprecedented means to determine how these dynamics affect such data, and how news media may be driving these trends.
Methodology
We examine daily Ebola-related Internet search and Twitter data in the U. S. during the six week period ending Oct 31, 2014. TV news coverage data were obtained from the daily number of Ebola-related news videos appearing on two major news networks. We fit the parameters of a mathematical contagion model to the data to determine if the news coverage was a significant factor in the temporal patterns in Ebola-related Internet and Twitter data.
Conclusions
We find significant evidence of contagion, with each Ebola-related news video inspiring tens of thousands of Ebola-related tweets and Internet searches. Between 65% to 76% of the variance in all samples is described by the news media contagion model.

ContributorsTowers, Sherry (Author) / Afzal, Shehzad (Author) / Bernal, Gilbert (Author) / Bliss, Nadya (Author) / Brown, Shala (Author) / Espinoza, Baltazar (Author) / Jackson, Jasmine (Author) / Judson-Garcia, Julia (Author) / Khan, Maryam (Author) / Lin, Michael (Author) / Mamada, Robert (Author) / Moreno, Victor (Author) / Nazari, Fereshteh (Author) / Okuneye, Kamaldeen (Author) / Ross, Mary (Author) / Rodriguez, Claudia (Author) / Medlock, Jan (Author) / Ebert, David (Author) / Castillo-Chavez, Carlos (Author) / Simon M. Levin Mathematical, Computational and Modeling Sciences Center (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / Mary Lou Fulton Teachers College (Contributor) / Educational Leadership and Innovation (Contributor) / School of Mathematical and Statistical Sciences (Contributor) / Global Security Initiative (Contributor)

Created2015-06-11

ASU Regents' Professors Open Access Works

Filtering by

Image-level and group-level models for Drosophila gene expression pattern annotation

GRASP [Genomic Resource Access for Stoichioproteomics]: comparative explorations of the atomic content of 12 Drosophila proteomes

A composite genome approach to identify phylogenetically informative data from next-generation sequencing

Application of a High Throughput Alamar Blue Biofilm Susceptibility Assay to Staphylococcus Aureus Biofilms

Evolutionary Diagnosis of Non-Synonymous Variants Involved in Differential Drug Response

Merging Economics and Epidemiology to Improve the Prediction and Management of Infectious Disease

Resource Consumption, Sustainability, and Cancer

Controlling Surface Defects and Photophysics in TiO2 Nanoparticles

The Telomere Binding Protein TRF2 Induces Chromatin Compaction

Mass Media and the Contagion of Fear: The Case of Ebola in America