This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 36
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
Description

Photosynthesis, a process catalysed by plants, algae and cyanobacteria converts sunlight to energy thus sustaining all higher life on Earth. Two large membrane protein complexes, photosystem I and II (PSI and PSII), act in series to catalyse the light-driven reactions in photosynthesis. PSII catalyses the light-driven water splitting process, which

Photosynthesis, a process catalysed by plants, algae and cyanobacteria converts sunlight to energy thus sustaining all higher life on Earth. Two large membrane protein complexes, photosystem I and II (PSI and PSII), act in series to catalyse the light-driven reactions in photosynthesis. PSII catalyses the light-driven water splitting process, which maintains the Earth’s oxygenic atmosphere. In this process, the oxygen-evolving complex (OEC) of PSII cycles through five states, S0 to S4, in which four electrons are sequentially extracted from the OEC in four light-driven charge-separation events. Here we describe time resolved experiments on PSII nano/microcrystals from Thermosynechococcus elongatus performed with the recently developed technique of serial femtosecond crystallography. Structures have been determined from PSII in the dark S1 state and after double laser excitation (putative S3 state) at 5 and 5.5 Å resolution, respectively. The results provide evidence that PSII undergoes significant conformational changes at the electron acceptor side and at the Mn4CaO5 core of the OEC. These include an elongation of the metal cluster, accompanied by changes in the protein environment, which could allow for binding of the second substrate water molecule between the more distant protruding Mn (referred to as the ‘dangler’ Mn) and the Mn3CaOx cubane in the S2 to S3 transition, as predicted by spectroscopic and computational studies. This work shows the great potential for time-resolved serial femtosecond crystallography for investigation of catalytic processes in biomolecules.

ContributorsKupitz, Christopher (Author) / Basu, Shibom (Author) / Grotjohann, Ingo (Author) / Fromme, Raimund (Author) / Zatsepin, Nadia (Author) / Rendek, Kimberly (Author) / Hunter, Mark (Author) / Shoeman, Robert L. (Author) / White, Thomas A. (Author) / Wang, Dingjie (Author) / James, Daniel (Author) / Yang, Jay-How (Author) / Cobb, Danielle (Author) / Reeder, Brenda (Author) / Sierra, Raymond G. (Author) / Liu, Haiguang (Author) / Barty, Anton (Author) / Aquila, Andrew L. (Author) / Deponte, Daniel (Author) / Kirian, Richard (Author) / Bari, Sadia (Author) / Bergkamp, Jesse (Author) / Beyerlein, Kenneth R. (Author) / Bogan, Michael J. (Author) / Caleman, Carl (Author) / Chao, Tzu-Chiao (Author) / Conrad, Chelsie (Author) / Davis, Katherine M. (Author) / Department of Chemistry and Biochemistry (Contributor)
Created2014-09-11
Description

We present results from experiments at the Linac Coherent Light Source (LCLS) demonstrating that serial femtosecond crystallography (SFX) can be performed to high resolution (~2.5 Å) using protein microcrystals deposited on an ultra-thin silicon nitride membrane and embedded in a preservation medium at room temperature. Data can be acquired at

We present results from experiments at the Linac Coherent Light Source (LCLS) demonstrating that serial femtosecond crystallography (SFX) can be performed to high resolution (~2.5 Å) using protein microcrystals deposited on an ultra-thin silicon nitride membrane and embedded in a preservation medium at room temperature. Data can be acquired at a high acquisition rate using x-ray free electron laser sources to overcome radiation damage, while sample consumption is dramatically reduced compared to flowing jet methods. We achieved a peak data acquisition rate of 10 Hz with a hit rate of ~38%, indicating that a complete data set could be acquired in about one 12-hour LCLS shift using the setup described here, or in even less time using hardware optimized for fixed target SFX. This demonstration opens the door to ultra low sample consumption SFX using the technique of diffraction-before-destruction on proteins that exist in only small quantities and/or do not produce the copious quantities of microcrystals required for flowing jet methods.

ContributorsHunter, Mark S. (Author) / Segelke, Brent (Author) / Messerschmidt, Marc (Author) / Williams, Garth J. (Author) / Zatsepin, Nadia (Author) / Barty, Anton (Author) / Benner, W. Henry (Author) / Carlson, David B. (Author) / Coleman, Matthew (Author) / Graf, Alexander (Author) / Hau-Riege, Stefan P. (Author) / Pardini, Tommaso (Author) / Seibert, M. Marvin (Author) / Evans, James (Author) / Boutet, Sebastien (Author) / Frank, Matthias (Author) / College of Liberal Arts and Sciences (Contributor)
Created2014-08-12
Description

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our understanding of such complex systems. However, the data at our disposal are often not easily comparable, have limited scope and scale, and are based on disparate underlying frameworks inhibiting synthesis, meta-analysis, and the validation of findings. Research efforts are further hampered when case inclusion criteria, variable definitions, coding schema, and inter-coder reliability testing are not made explicit in the presentation of research and shared among the research community. This paper first outlines challenges experienced by researchers engaged in a large-scale coding project; then highlights valuable lessons learned; and finally discusses opportunities for further research on comparative case study analysis focusing on social-ecological systems and common pool resources. Includes supplemental materials and appendices published in the International Journal of the Commons 2016 Special Issue. Volume 10 - Issue 2 - 2016.

ContributorsRatajczyk, Elicia (Author) / Brady, Ute (Author) / Baggio, Jacopo (Author) / Barnett, Allain J. (Author) / Perez Ibarra, Irene (Author) / Rollins, Nathan (Author) / Rubinos, Cathy (Author) / Shin, Hoon Cheol (Author) / Yu, David (Author) / Aggarwal, Rimjhim (Author) / Anderies, John (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-09
128166-Thumbnail Image.png
Description

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance eventually brought a revolution in how scholars (and graduate students) were trained and worked. This revolution never occurred in K-12 or university education such that we now teach young students in much the way that scholars were taught in the dark ages, we teach them what is already known rather than the process of knowing. Citizen science offers a way to change K-12 and university education and, in doing so, complete the renaissance. Here we offer an example of such an approach and call for change in the way students are taught science, change that is more possible than it has ever been and is, nonetheless, five hundred years delayed.

Created2016-03-01
128591-Thumbnail Image.png
Description

Gas seeps emanating from Yanartaş (Chimera), Turkey, have been documented for thousands of years. Active serpentinization produces hydrogen and a range of carbon gases that may provide fuel for life. Here we report a newly discovered, ephemeral fluid seep emanating from a small gas vent at Yanartaş. Fluids and biofilms

Gas seeps emanating from Yanartaş (Chimera), Turkey, have been documented for thousands of years. Active serpentinization produces hydrogen and a range of carbon gases that may provide fuel for life. Here we report a newly discovered, ephemeral fluid seep emanating from a small gas vent at Yanartaş. Fluids and biofilms were sampled at the source and points downstream. We describe site conditions, and provide microbiological data in the form of enrichment cultures, Scanning electron microscopy (SEM), carbon and nitrogen isotopic composition of solids, and PCR screens of nitrogen cycle genes. Source fluids are pH 11.95, with a Ca:Mg of ~200, and sediments under the ignited gas seep measure 60°C. Collectively, these data suggest the fluid is the product of active serpentinization at depth. Source sediments are primarily calcite and alteration products (chlorite and montmorillonite). Downstream, biofilms are mixed with montmorillonite. SEM shows biofilms distributed homogeneously with carbonates. Organic carbon accounts for 60% of the total carbon at the source, decreasing downstream to <15% as inorganic carbon precipitates. δ13C ratios of the organic carbon fraction of solids are depleted (−25 to −28‰) relative to the carbonates (−11 to −20‰). We conclude that heterotrophic processes are dominant throughout the surface ecosystem, and carbon fixation may be key down channel. δ15N ratios ~3‰, and absence of nifH in extracted DNA suggest that nitrogen fixation is not occurring in sediments. However, the presence of narG and nirS at most locations and in enrichments indicates genomic potential for nitrate and nitrite reduction. This small seep with shallow run-off is likely ephemeral, but abundant preserved microterracettes in the outflow and the surrounding area suggest it has been present for some time. This site and others like it present an opportunity for investigations of preserved deep biosphere signatures, and subsurface-surface interactions.

ContributorsMeyer-Dombard, D'Arcy R. (Author) / Woycheese, Kristin M. (Author) / Yargicoglu, Erin N. (Author) / Cardace, Dawn (Author) / Shock, Everett (Author) / Gulecal-Pektas, Yasemin (Author) / Temel, Mustafa (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-01-19
127872-Thumbnail Image.png
Description

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial ecology of a singular built environment, the International Space Station (ISS). This ISS sampling involved the collection and microbial analysis (via 16S rRNA gene PCR) of 15 surfaces sampled by swabs onboard the ISS. This sampling was a component of Project MERCCURI (Microbial Ecology Research Combining Citizen and University Researchers on ISS). Learning more about the microbial inhabitants of the “buildings” in which we travel through space will take on increasing importance, as plans for human exploration continue, with the possibility of colonization of other planets and moons.

Results: Sterile swabs were used to sample 15 surfaces onboard the ISS. The sites sampled were designed to be analogous to samples collected for (1) the Wildlife of Our Homes project and (2) a study of cell phones and shoes that were concurrently being collected for another component of Project MERCCURI. Sequencing of the 16S rRNA genes amplified from DNA extracted from each swab was used to produce a census of the microbes present on each surface sampled. We compared the microbes found on the ISS swabs to those from both homes on Earth and data from the Human Microbiome Project.

Conclusions: While significantly different from homes on Earth and the Human Microbiome Project samples analyzed here, the microbial community composition on the ISS was more similar to home surfaces than to the human microbiome samples. The ISS surfaces are OTU-rich with 1,036–4,294 operational taxonomic units (OTUs per sample). There was no discernible biogeography of microbes on the 15 ISS surfaces, although this may be a reflection of the small sample size we were able to obtain.

ContributorsLang, Jenna M. (Author) / Coil, David A. (Author) / Neches, Russell Y. (Author) / Brown, Wendy E. (Author) / Cavalier, Darlene (Author) / Severance, Mark (Author) / Hampton-Marcell, Jarrad T. (Author) / Gilbert, Jack A. (Author) / Eisen, Jonathan A. (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-12-05
127836-Thumbnail Image.png
Description

Previous proof-of-concept measurements on single-layer two-dimensional membrane-protein crystals performed at X-ray free-electron lasers (FELs) have demonstrated that the collection of meaningful diffraction patterns, which is not possible at synchrotrons because of radiation-damage issues, is feasible. Here, the results obtained from the analysis of a thousand single-shot, room-temperature X-ray FEL diffraction

Previous proof-of-concept measurements on single-layer two-dimensional membrane-protein crystals performed at X-ray free-electron lasers (FELs) have demonstrated that the collection of meaningful diffraction patterns, which is not possible at synchrotrons because of radiation-damage issues, is feasible. Here, the results obtained from the analysis of a thousand single-shot, room-temperature X-ray FEL diffraction images from two-dimensional crystals of a bacteriorhodopsin mutant are reported in detail. The high redundancy in the measurements boosts the intensity signal-to-noise ratio, so that the values of the diffracted intensities can be reliably determined down to the detector-edge resolution of 4 Å. The results show that two-dimensional serial crystallography at X-ray FELs is a suitable method to study membrane proteins to near-atomic length scales at ambient temperature. The method presented here can be extended to pump–probe studies of optically triggered structural changes on submillisecond timescales in two-dimensional crystals, which allow functionally relevant large-scale motions that may be quenched in three-dimensional crystals.

ContributorsCasadei, Cecilia M. (Author) / Tsai, Ching-Ju (Author) / Barty, Anton (Author) / Hunter, Mark S. (Author) / Zatsepin, Nadia (Author) / Padeste, Celestino (Author) / Capitani, Guido (Author) / Benner, W. Henry (Author) / Boutet, Sebastien (Author) / Hau-Riege, Stefan P. (Author) / Kupitz, Christopher (Author) / Messerschmidt, Marc (Author) / Ogren, John I. (Author) / Pardini, Tom (Author) / Rothschild, Kenneth J. (Author) / Sala, Leonardo (Author) / Segelke, Brent (Author) / Williams, Garth J. (Author) / Evans, James E. (Author) / Li, Xiao-Dan (Author) / Coleman, Matthew (Author) / Pedrini, Bill (Author) / Frank, Matthias (Author) / College of Liberal Arts and Sciences (Contributor)
Created2018-01
128510-Thumbnail Image.png
Description

We describe the deposition of four datasets consisting of X-ray diffraction images acquired using serial femtosecond crystallography experiments on microcrystals of human G protein-coupled receptors, grown and delivered in lipidic cubic phase, at the Linac Coherent Light Source. The receptors are: the human serotonin receptor 2B in complex with an

We describe the deposition of four datasets consisting of X-ray diffraction images acquired using serial femtosecond crystallography experiments on microcrystals of human G protein-coupled receptors, grown and delivered in lipidic cubic phase, at the Linac Coherent Light Source. The receptors are: the human serotonin receptor 2B in complex with an agonist ergotamine, the human δ-opioid receptor in complex with a bi-functional peptide ligand DIPP-NH2, the human smoothened receptor in complex with an antagonist cyclopamine, and finally the human angiotensin II type 1 receptor in complex with the selective antagonist ZD7155. All four datasets have been deposited, with minimal processing, in an HDF5-based file format, which can be used directly for crystallographic processing with CrystFEL or other software. We have provided processing scripts and supporting files for recent versions of CrystFEL, which can be used to validate the data.

ContributorsWhite, Thomas A. (Author) / Barty, Anton (Author) / Liu, Wei (Author) / Ishchenko, Andrii (Author) / Zhang, Haitao (Author) / Gati, Cornelius (Author) / Zatsepin, Nadia (Author) / Basu, Shibom (Author) / Oberthur, Dominik (Author) / Metz, Markus (Author) / Beyerlein, Kenneth R. (Author) / Yoon, Chun Hong (Author) / Yefanov, Oleksandr M. (Author) / James, Daniel (Author) / Wang, Dingjie (Author) / Messerschmidt, Marc (Author) / Koglin, Jason E. (Author) / Boutet, Sebastien (Author) / Weierstall, Uwe (Author) / Cherezov, Vadim (Author) / College of Liberal Arts and Sciences (Contributor)
Created2016-08-01
128562-Thumbnail Image.png
Description

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To identify the flow direction of clickstreams, we define the “flow distance” of nodes (Li), which measures the average number of steps a random walker takes to reach the ith node. It is observed that Li is related with the clicks (Ci) to news stories and the age (Ti) of stories. Putting these three variables together help us understand the rise and decay of news stories from a network perspective. We also find that the studied clickstream networks preserve a stable structure over time, leading to the scaling between users and clicks. The universal scaling behavior is confirmed by the 1,000 Web forums. We suggest that the tree-like, stable structure of clickstream networks reveals the time-sensitive preference of users in online browsing. To test our assumption, we discuss three models on individual browsing behavior, and compare the simulation results with empirical data.

ContributorsWang, Cheng-Jun (Author) / Wu, Lingfei (Author) / Zhang, Jiang (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-28