Matching Items (135)
Filtering by

Clear all filters

150163-Thumbnail Image.png
Description
With the advent of the X-ray free-electron laser (XFEL), an opportunity has arisen to break the nexus between radiation dose and spatial resolution in diffractive imaging, by outrunning radiation damage altogether when using single X-ray pulses so brief that they terminate before atomic motion commences. This dissertation concerns the application

With the advent of the X-ray free-electron laser (XFEL), an opportunity has arisen to break the nexus between radiation dose and spatial resolution in diffractive imaging, by outrunning radiation damage altogether when using single X-ray pulses so brief that they terminate before atomic motion commences. This dissertation concerns the application of XFELs to biomolecular imaging in an effort to overcome the severe challenges associated with radiation damage and macroscopic protein crystal growth. The method of femtosecond protein nanocrystallography (fsPNX) is investigated, and a new method for extracting crystallographic structure factors is demonstrated on simulated data and on the first experimental fsPNX data obtained at an XFEL. Errors are assessed based on standard metrics familiar to the crystallography community. It is shown that resulting structure factors match the quality of those measured conventionally, at least to 9 angstrom resolution. A new method for ab-initio phasing of coherently-illuminated nanocrystals is then demonstrated on simulated data. The method of correlated fluctuation small-angle X-ray scattering (CFSAXS) is also investigated as an alternative route to biomolecular structure determination, without the use of crystals. It is demonstrated that, for a constrained two-dimensional geometry, a projection image of a single particle can be formed, ab-initio and without modeling parameters, from measured diffracted intensity correlations arising from disordered ensembles of identical particles illuminated simultaneously. The method is demonstrated experimentally, based on soft X-ray diffraction from disordered but identical nanoparticles, providing the first experimental proof-of-principle result. Finally, the fundamental limitations of CFSAXS is investigated through both theory and simulations. It is found that the signal-to-noise ratio (SNR) for CFSAXS data is essentially independent of the number of particles exposed in each diffraction pattern. The dependence of SNR on particle size and resolution is considered, and realistic estimates are made (with the inclusion of solvent scatter) of the SNR for protein solution scattering experiments utilizing an XFEL source.
ContributorsKirian, Richard A (Author) / Spence, John C. H. (Committee member) / Doak, R. Bruce (Committee member) / Weierstall, Uwe (Committee member) / Bennett, Peter (Committee member) / Treacy, Michael M. J. (Committee member) / Arizona State University (Publisher)
Created2011
150076-Thumbnail Image.png
Description
One of the most important issues in femtosecond free electron laser X-ray diraction is to reconstruct the 3D charge density of molecule from a mass of diraction snapshots. In order to determine the orientation of single molecule from diraction patterns, we rst determine the moments and products of inertia of

One of the most important issues in femtosecond free electron laser X-ray diraction is to reconstruct the 3D charge density of molecule from a mass of diraction snapshots. In order to determine the orientation of single molecule from diraction patterns, we rst determine the moments and products of inertia of this from 2D experiment data (diraction patterns or EM images to obtain the elements of the inertia tensor. If diraction patterns from uniformly random orientations or some preferred orientations are collected, the principal axes of the molecule can be extracted, together with the Euler angles which relate the principal axes of the molecule to the laboratory frame axes. This is achieved by nding the maximum and minimum values for the measured moments from many single-molecule patterns. Simulations for GroEL protein indicates that the calculation of the autocorrelation help eliminate the Poisson noise in Cryo- EM images and can make correct orientation determination. The eect of water jacket surrounding the protein molecule is studied based on molecular dynamics simulation result. The intensities from water and interference is found to suppress those from protein itself. A method is proposed and applied to the simulation data to show the possibility for it to overcome the water background problem. The scattering between Bragg re ections from nanocrystals is used to aid solution of the phase problem. We describe a method for reconstructing the charge density of a typical molecule within a single unit cell, if suciently nely-sampled diraction data are available from many nanocrystals of dierent sizes lying in the same orientations without knowledge of the distribution of particle size or requiring atomic-resolution data. Triple correlation of the diraction patterns are made use of to reconiii
ContributorsWang, Xiaoyu (Author) / Spence, John C.H. (Thesis advisor) / Schmidt, Kevin (Committee member) / Doak, R. Bruce (Committee member) / Weierstall, Uwe (Committee member) / Ros, Robert (Committee member) / Arizona State University (Publisher)
Created2011
130364-Thumbnail Image.png
Description
Background
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the

Background
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the gene functions, interactions, and networks. To facilitate pattern recognition and comparison, many web-based resources have been created to conduct comparative analysis based on the body part keywords and the associated images. With the fast accumulation of images from high-throughput techniques, manual inspection of images will impose a serious impediment on the pace of biological discovery. It is thus imperative to design an automated system for efficient image annotation and comparison.
Results
We present a computational framework to perform anatomical keywords annotation for Drosophila gene expression images. The spatial sparse coding approach is used to represent local patches of images in comparison with the well-known bag-of-words (BoW) method. Three pooling functions including max pooling, average pooling and Sqrt (square root of mean squared statistics) pooling are employed to transform the sparse codes to image features. Based on the constructed features, we develop both an image-level scheme and a group-level scheme to tackle the key challenges in annotating Drosophila gene expression pattern images automatically. To deal with the imbalanced data distribution inherent in image annotation tasks, the undersampling method is applied together with majority vote. Results on Drosophila embryonic expression pattern images verify the efficacy of our approach.
Conclusion
In our experiment, the three pooling functions perform comparably well in feature dimension reduction. The undersampling with majority vote is shown to be effective in tackling the problem of imbalanced data. Moreover, combining sparse coding and image-level scheme leads to consistent performance improvement in keywords annotation.
ContributorsSun, Qian (Author) / Muckatira, Sherin (Author) / Yuan, Lei (Author) / Ji, Shuiwang (Author) / Newfeld, Stuart (Author) / Kumar, Sudhir (Author) / Ye, Jieping (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor) / Ira A. Fulton Schools of Engineering (Contributor)
Created2013-12-03
130365-Thumbnail Image.png
Description
Background
“Stoichioproteomics” relates the elemental composition of proteins and proteomes to variation in the physiological and ecological environment. To help harness and explore the wealth of hypotheses made possible under this framework, we introduce GRASP (http://www.graspdb.net), a public bioinformatic knowledgebase containing information on the frequencies of 20 amino acids and atomic

Background
“Stoichioproteomics” relates the elemental composition of proteins and proteomes to variation in the physiological and ecological environment. To help harness and explore the wealth of hypotheses made possible under this framework, we introduce GRASP (http://www.graspdb.net), a public bioinformatic knowledgebase containing information on the frequencies of 20 amino acids and atomic composition of their side chains. GRASP integrates comparative protein composition data with annotation data from multiple public databases. Currently, GRASP includes information on proteins of 12 sequenced Drosophila (fruit fly) proteomes, which will be expanded to include increasingly diverse organisms over time. In this paper we illustrate the potential of GRASP for testing stoichioproteomic hypotheses by conducting an exploratory investigation into the composition of 12 Drosophila proteomes, testing the prediction that protein atomic content is associated with species ecology and with protein expression levels.
Results
Elements varied predictably along multivariate axes. Species were broadly similar, with the D. willistoni proteome a clear outlier. As expected, individual protein atomic content within proteomes was influenced by protein function and amino acid biochemistry. Evolution in elemental composition across the phylogeny followed less predictable patterns, but was associated with broad ecological variation in diet. Using expression data available for D. melanogaster, we found evidence consistent with selection for efficient usage of elements within the proteome: as expected, nitrogen content was reduced in highly expressed proteins in most tissues, most strongly in the gut, where nutrients are assimilated, and least strongly in the germline.
Conclusions
The patterns identified here using GRASP provide a foundation on which to base future research into the evolution of atomic composition in Drosophila and other taxa.
ContributorsGilbert, James D. J. (Author) / Acquisti, Claudia (Author) / Martinson, Holly M. (Author) / Elser, James (Author) / Kumar, Sudhir (Author) / Fagan, William F. (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2013-09-04
130367-Thumbnail Image.png
Description
Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS,

Background
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS, avoids the time consuming steps of de novo whole genome assembly, multiple genome alignment, and annotation.
Results
For simulations SISRS is able to identify large numbers of loci containing variable sites with phylogenetic signal. For genomic data from apes, SISRS identified thousands of variable sites, from which we produced an accurate phylogeny. Finally, we used SISRS to identify phylogenetic markers that we used to estimate the phylogeny of placental mammals. We recovered eight phylogenies that resolved the basal relationships among mammals using datasets with different levels of missing data. The three alternate resolutions of the basal relationships are consistent with the major hypotheses for the relationships among mammals, all of which have been supported previously by different molecular datasets.
Conclusions
SISRS has the potential to transform phylogenetic research. This method eliminates the need for expensive marker development in many studies by using whole genome shotgun sequence data directly. SISRS is open source and freely available at https://github.com/rachelss/SISRS/releases.
ContributorsSchwartz, Rachel (Author) / Harkins, Kelly (Author) / Stone, Anne (Author) / Cartwright, Reed (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Human Evolution and Social Change (Contributor) / School of Life Sciences (Contributor)
Created2015-06-11
130369-Thumbnail Image.png
Description

This paper discusses the properties of cancer cells from a new perspective based on an analogy with phase transitions in physical systems. Similarities in terms of instabilities and attractor states are outlined and differences discussed. While physical phase transitions typically occur at or near thermodynamic equilibrium, a normal-to-cancer (NTC) transition

This paper discusses the properties of cancer cells from a new perspective based on an analogy with phase transitions in physical systems. Similarities in terms of instabilities and attractor states are outlined and differences discussed. While physical phase transitions typically occur at or near thermodynamic equilibrium, a normal-to-cancer (NTC) transition is a dynamical non-equilibrium phenomenon, which depends on both metabolic energy supply and local physiological conditions. A number of implications for preventative and therapeutic strategies are outlined.

Created2011-08-25
130370-Thumbnail Image.png
Description

Background:
Drosophila gene expression pattern images document the spatiotemporal dynamics of gene expression during embryogenesis. A comparative analysis of these images could provide a fundamentally important way for studying the regulatory networks governing development. To facilitate pattern comparison and searching, groups of images in the Berkeley Drosophila Genome Project (BDGP) high-throughput

Background:
Drosophila gene expression pattern images document the spatiotemporal dynamics of gene expression during embryogenesis. A comparative analysis of these images could provide a fundamentally important way for studying the regulatory networks governing development. To facilitate pattern comparison and searching, groups of images in the Berkeley Drosophila Genome Project (BDGP) high-throughput study were annotated with a variable number of anatomical terms manually using a controlled vocabulary. Considering that the number of available images is rapidly increasing, it is imperative to design computational methods to automate this task.

Results:
We present a computational method to annotate gene expression pattern images automatically. The proposed method uses the bag-of-words scheme to utilize the existing information on pattern annotation and annotates images using a model that exploits correlations among terms. The proposed method can annotate images individually or in groups (e.g., according to the developmental stage). In addition, the proposed method can integrate information from different two-dimensional views of embryos. Results on embryonic patterns from BDGP data demonstrate that our method significantly outperforms other methods.

Conclusion:
The proposed bag-of-words scheme is effective in representing a set of annotations assigned to a group of images, and the model employed to annotate images successfully captures the correlations among different controlled vocabulary terms. The integration of existing annotation information from multiple embryonic views improves annotation performance.

ContributorsJi, Shuiwang (Author) / Li, Ying-Xin (Author) / Zhou, Zhi-Hua (Author) / Kumar, Sudhir (Author) / Ye, Jieping (Author) / Biodesign Institute (Contributor) / Ira A. Fulton Schools of Engineering (Contributor) / School of Electrical, Computer and Energy Engineering (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2009-04-21
130372-Thumbnail Image.png
Description

Background: Staphylococcus aureus and S. epidermidis biofilms differ in structure, growth and regulation, and thus the high-throughput method of evaluating biofilm susceptibility that has been published for S. epidermidis cannot be applied to S. aureus without first evaluating the assay's reproducibility and reliability with S. aureus biofilms.

Methods: Staphylococcus aureus biofilms

Background: Staphylococcus aureus and S. epidermidis biofilms differ in structure, growth and regulation, and thus the high-throughput method of evaluating biofilm susceptibility that has been published for S. epidermidis cannot be applied to S. aureus without first evaluating the assay's reproducibility and reliability with S. aureus biofilms.

Methods: Staphylococcus aureus biofilms were treated with eleven approved antibiotics, lysostaphin, or Conflikt®, exposed to the oxidation reduction indicator Alamar blue, and reduction relative to untreated controls was determined visually and spectrophotometrically. The minimum biofilm inhibitory concentration (MBIC) was defined as ≤ 50% Alamar blue reduction and a purple/blue well 60 min after the addition of Alamar blue. Because all of the approved antibiotics had MBICs >128 μg/ml (most >2048 μg/ml), lysostaphin and Conflikt®, with relatively low MBICs, were used to correlate Alamar blue reduction with 2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-2H-tetrazolium-5-carboxanilide (XTT) reduction and viable counts (CFU/ml) for S. aureus ATCC 29213 and three clinical isolates. Alamar blue's stability and lack of toxicity allowed CFU/ml to be determined from the same wells as Alamar blue absorbances.

Results: Overall, Alamar blue reduction had excellent correlation with XTT reduction and with CFU/ml. For ATCC 29213 and two clinical isolates treated with lysostaphin or Conflikt®, Alamar blue reduction had excellent correlation with XTT reduction (r = 0.93-0.99) and with CFU/ml (r = 0.92-0.98). For one of the clinical isolates, the results were moderately correlated for Conflikt® (r = 0.76, Alamar blue vs. XTT; r = 0.81, Alamar blue vs. CFU/ml) and had excellent correlation for lysostaphin (r = 0.95, Alamar blue vs. XTT; r = 0.97, Alamar blue vs. CFU/ml).

Conclusion: A reliable, reproducible method for evaluating biofilm susceptibility was successfully applied to S. aureus biofilms. The described method provides researchers with a simple, nontoxic, relatively inexpensive, high throughput measure of viability after drug treatment. A standardized biofilm Alamar blue assay should greatly increase the rate of discovery of S. aureus biofilm specific agents.

Created2009-10-27
130373-Thumbnail Image.png
Description
Premise of the study: Land-plant plastid genomes have only rarely undergone significant changes in gene content and order. Thus, discovery of additional examples adds power to tests for causes of such genome-scale structural changes.
Methods: Using next-generation sequence data, we assembled the plastid genome of saguaro cactus and probed the nuclear

Premise of the study: Land-plant plastid genomes have only rarely undergone significant changes in gene content and order. Thus, discovery of additional examples adds power to tests for causes of such genome-scale structural changes.
Methods: Using next-generation sequence data, we assembled the plastid genome of saguaro cactus and probed the nuclear genome for transferred plastid genes and functionally related nuclear genes. We combined these results with available data across Cactaceae and seed plants more broadly to infer the history of gene loss and to assess the strength of phylogenetic association between gene loss and loss of the inverted repeat (IR).
Key results: The saguaro plastid genome is the smallest known for an obligately photosynthetic angiosperm (∼113 kb), having lost the IR and plastid ndh genes. This loss supports a statistically strong association across seed plants between the loss of ndh genes and the loss of the IR. Many nonplastid copies of plastid ndh genes were found in the nuclear genome, but none had intact reading frames; nor did three related nuclear-encoded subunits. However, nuclear pgr5, which functions in a partially redundant pathway, was intact.
Conclusions: The existence of an alternative pathway redundant with the function of the plastid NADH dehydrogenase-like complex (NDH) complex may permit loss of the plastid ndh gene suite in photoautotrophs like saguaro. Loss of these genes may be a recurring mechanism for overall plastid genome size reduction, especially in combination with loss of the IR.
ContributorsSanderson, Michael J. (Author) / Copetti, Dario (Author) / Burquez, Alberto (Author) / Bustamante, Enriquena (Author) / Charboneau, Joseph L. M. (Author) / Eguiarte, Luis E. (Author) / Kumar, Sudhir (Author) / Lee, Hyun Oh (Author) / Lee, Junki (Author) / McMahon, Michelle (Author) / Steele, Kelly (Author) / Wing, Rod (Author) / Yang, Tae-Jin (Author) / Zwickl, Derrick (Author) / Wojciechowski, Martin (Author) / College of Integrative Sciences and Arts (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2015-07-01
130375-Thumbnail Image.png
Description
This article reviews the range of delivery platforms that have been developed for the PySAL open source Python library for spatial analysis. This includes traditional desktop software (with a graphical user interface, command line or embedded in a computational notebook), open spatial analytics middleware, and web, cloud and distributed open

This article reviews the range of delivery platforms that have been developed for the PySAL open source Python library for spatial analysis. This includes traditional desktop software (with a graphical user interface, command line or embedded in a computational notebook), open spatial analytics middleware, and web, cloud and distributed open geospatial analytics for decision support. A common thread throughout the discussion is the emphasis on openness, interoperability, and provenance management in a scientific workflow. The code base of the PySAL library provides the common computing framework underlying all delivery mechanisms.
ContributorsRey, Sergio (Author) / Anselin, Luc (Author) / Li, Xun (Author) / Pahle, Robert (Author) / Laura, Jason (Author) / Li, Wenwen (Author) / Koschinsky, Julia (Author) / College of Liberal Arts and Sciences (Contributor) / School of Geographical Sciences and Urban Planning (Contributor) / Computational Spatial Science (Contributor)
Created2015-06-01