This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 42
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
Description

We present results from experiments at the Linac Coherent Light Source (LCLS) demonstrating that serial femtosecond crystallography (SFX) can be performed to high resolution (~2.5 Å) using protein microcrystals deposited on an ultra-thin silicon nitride membrane and embedded in a preservation medium at room temperature. Data can be acquired at

We present results from experiments at the Linac Coherent Light Source (LCLS) demonstrating that serial femtosecond crystallography (SFX) can be performed to high resolution (~2.5 Å) using protein microcrystals deposited on an ultra-thin silicon nitride membrane and embedded in a preservation medium at room temperature. Data can be acquired at a high acquisition rate using x-ray free electron laser sources to overcome radiation damage, while sample consumption is dramatically reduced compared to flowing jet methods. We achieved a peak data acquisition rate of 10 Hz with a hit rate of ~38%, indicating that a complete data set could be acquired in about one 12-hour LCLS shift using the setup described here, or in even less time using hardware optimized for fixed target SFX. This demonstration opens the door to ultra low sample consumption SFX using the technique of diffraction-before-destruction on proteins that exist in only small quantities and/or do not produce the copious quantities of microcrystals required for flowing jet methods.

ContributorsHunter, Mark S. (Author) / Segelke, Brent (Author) / Messerschmidt, Marc (Author) / Williams, Garth J. (Author) / Zatsepin, Nadia (Author) / Barty, Anton (Author) / Benner, W. Henry (Author) / Carlson, David B. (Author) / Coleman, Matthew (Author) / Graf, Alexander (Author) / Hau-Riege, Stefan P. (Author) / Pardini, Tommaso (Author) / Seibert, M. Marvin (Author) / Evans, James (Author) / Boutet, Sebastien (Author) / Frank, Matthias (Author) / College of Liberal Arts and Sciences (Contributor)
Created2014-08-12
Description

Photosynthesis, a process catalysed by plants, algae and cyanobacteria converts sunlight to energy thus sustaining all higher life on Earth. Two large membrane protein complexes, photosystem I and II (PSI and PSII), act in series to catalyse the light-driven reactions in photosynthesis. PSII catalyses the light-driven water splitting process, which

Photosynthesis, a process catalysed by plants, algae and cyanobacteria converts sunlight to energy thus sustaining all higher life on Earth. Two large membrane protein complexes, photosystem I and II (PSI and PSII), act in series to catalyse the light-driven reactions in photosynthesis. PSII catalyses the light-driven water splitting process, which maintains the Earth’s oxygenic atmosphere. In this process, the oxygen-evolving complex (OEC) of PSII cycles through five states, S0 to S4, in which four electrons are sequentially extracted from the OEC in four light-driven charge-separation events. Here we describe time resolved experiments on PSII nano/microcrystals from Thermosynechococcus elongatus performed with the recently developed technique of serial femtosecond crystallography. Structures have been determined from PSII in the dark S1 state and after double laser excitation (putative S3 state) at 5 and 5.5 Å resolution, respectively. The results provide evidence that PSII undergoes significant conformational changes at the electron acceptor side and at the Mn4CaO5 core of the OEC. These include an elongation of the metal cluster, accompanied by changes in the protein environment, which could allow for binding of the second substrate water molecule between the more distant protruding Mn (referred to as the ‘dangler’ Mn) and the Mn3CaOx cubane in the S2 to S3 transition, as predicted by spectroscopic and computational studies. This work shows the great potential for time-resolved serial femtosecond crystallography for investigation of catalytic processes in biomolecules.

ContributorsKupitz, Christopher (Author) / Basu, Shibom (Author) / Grotjohann, Ingo (Author) / Fromme, Raimund (Author) / Zatsepin, Nadia (Author) / Rendek, Kimberly (Author) / Hunter, Mark (Author) / Shoeman, Robert L. (Author) / White, Thomas A. (Author) / Wang, Dingjie (Author) / James, Daniel (Author) / Yang, Jay-How (Author) / Cobb, Danielle (Author) / Reeder, Brenda (Author) / Sierra, Raymond G. (Author) / Liu, Haiguang (Author) / Barty, Anton (Author) / Aquila, Andrew L. (Author) / Deponte, Daniel (Author) / Kirian, Richard (Author) / Bari, Sadia (Author) / Bergkamp, Jesse (Author) / Beyerlein, Kenneth R. (Author) / Bogan, Michael J. (Author) / Caleman, Carl (Author) / Chao, Tzu-Chiao (Author) / Conrad, Chelsie (Author) / Davis, Katherine M. (Author) / Department of Chemistry and Biochemistry (Contributor)
Created2014-09-11
129243-Thumbnail Image.png
Description

It has been suggested that the extended intensity profiles surrounding Bragg reflections that arise when a series of finite crystals of varying size and shape are illuminated by the intense, coherent illumination of an x-ray free-electron laser may enable the crystal’s unit-cell electron density to be obtained ab initio via

It has been suggested that the extended intensity profiles surrounding Bragg reflections that arise when a series of finite crystals of varying size and shape are illuminated by the intense, coherent illumination of an x-ray free-electron laser may enable the crystal’s unit-cell electron density to be obtained ab initio via well-established iterative phasing algorithms. Such a technique could have a significant impact on the field of biological structure determination since it avoids the need for a priori information from similar known structures, multiple measurements near resonant atomic absorption energies, isomorphic derivative crystals, or atomic-resolution data. Here, we demonstrate this phasing technique on diffraction patterns recorded from artificial two-dimensional microcrystals using the seeded soft x-ray free-electron laser FERMI. We show that the technique is effective when the illuminating wavefront has nonuniform phase and amplitude, and when the diffraction intensities cannot be measured uniformly throughout reciprocal space because of a limited signal-to-noise ratio.

ContributorsKirian, Richard (Author) / Bean, Richard J. (Author) / Beyerlein, Kenneth R. (Author) / Barthelmess, Miriam (Author) / Yoon, Chun Hong (Author) / Wang, Fenglin (Author) / Capotondi, Flavio (Author) / Pedersoli, Emanuele (Author) / Barty, Anton (Author) / Chapman, Henry N. (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-02-12
129259-Thumbnail Image.png
Description

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all they are a form of codified self-regulation. While codes can be beneficial, it argues that when we scratch below the surface, there are many problems at their root. In terms of efficacy, codes can serve as a form of ethical window dressing, rather than effective rules for behavior. But even more that, codes can degrade the meaning behind being a good person who acts ethically for the right reasons.

Created2013-11-30
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02
128995-Thumbnail Image.png
Description

Background: Obesity is a metabolic disease caused by environmental and genetic factors. However, the epigenetic mechanisms of obesity are incompletely understood. The aim of our study was to investigate the role of skeletal muscle DNA methylation in combination with transcriptomic changes in obesity.

Results: Muscle biopsies were obtained basally from lean (n = 12; BMI = 23.4 ± 0.7

Background: Obesity is a metabolic disease caused by environmental and genetic factors. However, the epigenetic mechanisms of obesity are incompletely understood. The aim of our study was to investigate the role of skeletal muscle DNA methylation in combination with transcriptomic changes in obesity.

Results: Muscle biopsies were obtained basally from lean (n = 12; BMI = 23.4 ± 0.7 kg/m[superscript 2]) and obese (n = 10; BMI = 32.9 ± 0.7 kg/m[superscript 2]) participants in combination with euglycemic-hyperinsulinemic clamps to assess insulin sensitivity. We performed reduced representation bisulfite sequencing (RRBS) next-generation methylation and microarray analyses on DNA and RNA isolated from vastus lateralis muscle biopsies. There were 13,130 differentially methylated cytosines (DMC; uncorrected P < 0.05) that were altered in the promoter and untranslated (5' and 3'UTR) regions in the obese versus lean analysis. Microarray analysis revealed 99 probes that were significantly (corrected P < 0.05) altered. Of these, 12 genes (encompassing 22 methylation sites) demonstrated a negative relationship between gene expression and DNA methylation. Specifically, sorbin and SH3 domain containing 3 (SORBS3) which codes for the adapter protein vinexin was significantly decreased in gene expression (fold change −1.9) and had nine DMCs that were significantly increased in methylation in obesity (methylation differences ranged from 5.0 to 24.4 %). Moreover, differentially methylated region (DMR) analysis identified a region in the 5'UTR (Chr.8:22,423,530–22,423,569) of SORBS3 that was increased in methylation by 11.2 % in the obese group. The negative relationship observed between DNA methylation and gene expression for SORBS3 was validated by a site-specific sequencing approach, pyrosequencing, and qRT-PCR. Additionally, we performed transcription factor binding analysis and identified a number of transcription factors whose binding to the differentially methylated sites or region may contribute to obesity.

Conclusions: These results demonstrate that obesity alters the epigenome through DNA methylation and highlights novel transcriptomic changes in SORBS3 in skeletal muscle.

ContributorsDay, Samantha (Author) / Coletta, Rich (Author) / Kim, Joon Young (Author) / Campbell, Latoya (Author) / Benjamin, Tonya R. (Author) / Roust, Lori R. (Author) / De Filippis, Elena A. (Author) / Dinu, Valentin (Author) / Shaibi, Gabriel (Author) / Mandarino, Lawrence J. (Author) / Coletta, Dawn (Author) / College of Liberal Arts and Sciences (Contributor)
Created2016-07-18
128932-Thumbnail Image.png
Description

We have previously hypothesized a biological pathway of activity-dependent synaptic plasticity proteins that addresses the dual genetic and environmental contributions to schizophrenia. Accordingly, variations in the immediate early gene EGR3, and its target ARC, should influence schizophrenia susceptibility. We used a pooled Next-Generation Sequencing approach to identify variants across these

We have previously hypothesized a biological pathway of activity-dependent synaptic plasticity proteins that addresses the dual genetic and environmental contributions to schizophrenia. Accordingly, variations in the immediate early gene EGR3, and its target ARC, should influence schizophrenia susceptibility. We used a pooled Next-Generation Sequencing approach to identify variants across these genes in U.S. populations of European (EU) and African (AA) descent. Three EGR3 and one ARC SNP were selected and genotyped for validation, and three SNPs were tested for association in a replication cohort. In the EU group of 386 schizophrenia cases and 150 controls EGR3 SNP rs1877670 and ARC SNP rs35900184 showed significant associations (p = 0.0078 and p = 0.0275, respectively). In the AA group of 185 cases and 50 controls, only the ARC SNP revealed significant association (p = 0.0448). The ARC SNP did not show association in the Han Chinese (CH) population. However, combining the EU, AA, and CH groups revealed a highly significant association of ARC SNP rs35900184 (p = 2.353 x 10-7; OR [95% CI] = 1.54 [1.310–1.820]). These findings support previously reported associations between EGR3 and schizophrenia. Moreover, this is the first report associating an ARC SNP with schizophrenia and supports recent large-scale GWAS findings implicating the ARC complex in schizophrenia risk. These results support the need for further investigation of the proposed pathway of environmentally responsive, synaptic plasticity-related, schizophrenia genes.

ContributorsHuentelman, Matthew J. (Author) / Muppana, Leela (Author) / Courneveaux, Jason J. (Author) / Dinu, Valentin (Author) / Pruzin, Jeremy J. (Author) / Reiman, Rebecca (Author) / Borish, Cassie N. (Author) / De Both, Matt (Author) / Ahmed, Amber (Author) / Todorov, Alexandre (Author) / Cloninger, C. Robert (Author) / Zhang, Rui (Author) / Ma, Jie (Author) / Gallitano, Amelia L. (Author) / College of Health Solutions (Contributor)
Created2015-10-16
128958-Thumbnail Image.png
Description

Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt

Background: Immunosignaturing is a new peptide microarray based technology for profiling of humoral immune responses. Despite new challenges, immunosignaturing gives us the opportunity to explore new and fundamentally different research questions. In addition to classifying samples based on disease status, the complex patterns and latent factors underlying immunosignatures, which we attempt to model, may have a diverse range of applications.

Methods: We investigate the utility of a number of statistical methods to determine model performance and address challenges inherent in analyzing immunosignatures. Some of these methods include exploratory and confirmatory factor analyses, classical significance testing, structural equation and mixture modeling.

Results: We demonstrate an ability to classify samples based on disease status and show that immunosignaturing is a very promising technology for screening and presymptomatic screening of disease. In addition, we are able to model complex patterns and latent factors underlying immunosignatures. These latent factors may serve as biomarkers for disease and may play a key role in a bioinformatic method for antibody discovery.

Conclusion: Based on this research, we lay out an analytic framework illustrating how immunosignatures may be useful as a general method for screening and presymptomatic screening of disease as well as antibody discovery.

ContributorsBrown, Justin (Author) / Stafford, Phillip (Author) / Johnston, Stephen (Author) / Dinu, Valentin (Author) / College of Health Solutions (Contributor)
Created2011-08-19
129066-Thumbnail Image.png
Description

Background: Glioblastoma is the most aggressive primary central nervous tumor and carries a very poor prognosis. Invasion precludes effective treatment and virtually assures tumor recurrence. In the current study, we applied analytical and bioinformatics approaches to identify a set of microRNAs (miRs) from several different human glioblastoma cell lines that exhibit

Background: Glioblastoma is the most aggressive primary central nervous tumor and carries a very poor prognosis. Invasion precludes effective treatment and virtually assures tumor recurrence. In the current study, we applied analytical and bioinformatics approaches to identify a set of microRNAs (miRs) from several different human glioblastoma cell lines that exhibit significant differential expression between migratory (edge) and migration-restricted (core) cell populations. The hypothesis of the study is that differential expression of miRs provides an epigenetic mechanism to drive cell migration and invasion.

Results: Our research data comprise gene expression values for a set of 805 human miRs collected from matched pairs of migratory and migration-restricted cell populations from seven different glioblastoma cell lines. We identified 62 down-regulated and 2 up-regulated miRs that exhibit significant differential expression in the migratory (edge) cell population compared to matched migration-restricted (core) cells. We then conducted target prediction and pathway enrichment analysis with these miRs to investigate potential associated gene and pathway targets. Several miRs in the list appear to directly target apoptosis related genes. The analysis identifies a set of genes that are predicted by 3 different algorithms, further emphasizing the potential validity of these miRs to promote glioblastoma.

Conclusions: The results of this study identify a set of miRs with potential for decreased expression in invasive glioblastoma cells. The verification of these miRs and their associated targeted proteins provides new insights for further investigation into therapeutic interventions. The methodological approaches employed here could be applied to the study of other diseases to provide biomedical researchers and clinicians with increased opportunities for therapeutic interventions.

ContributorsBradley, Barrie (Author) / Loftus, Joseph C. (Author) / Mielke, Clinton (Author) / Dinu, Valentin (Author) / College of Health Solutions (Contributor)
Created2014-01-18