This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 51
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
129537-Thumbnail Image.png
Description

There are many proteomic applications that require large collections of purified protein, but parallel production of large numbers of different proteins remains a very challenging task. To help meet the needs of the scientific community, we have developed a human protein production pipeline. Using high-throughput (HT) methods, we transferred the

There are many proteomic applications that require large collections of purified protein, but parallel production of large numbers of different proteins remains a very challenging task. To help meet the needs of the scientific community, we have developed a human protein production pipeline. Using high-throughput (HT) methods, we transferred the genes of 31 full-length proteins into three expression vectors, and expressed the collection as N-terminal HaloTag fusion proteins in Escherichia coli and two commercial cell-free (CF) systems, wheat germ extract (WGE) and HeLa cell extract (HCE). Expression was assessed by labeling the fusion proteins specifically and covalently with a fluorescent HaloTag ligand and detecting its fluorescence on a LabChip[superscript ®] GX microfluidic capillary gel electrophoresis instrument. This automated, HT assay provided both qualitative and quantitative assessment of recombinant protein. E. coli was only capable of expressing 20% of the test collection in the supernatant fraction with ≥20 μg yields, whereas CF systems had ≥83% success rates. We purified expressed proteins using an automated HaloTag purification method. We purified 20, 33, and 42% of the test collection from E. coli, WGE, and HCE, respectively, with yields ≥1 μg and ≥90% purity. Based on these observations, we have developed a triage strategy for producing full-length human proteins in these three expression systems.

ContributorsSaul, Justin (Author) / Petritis, Brianne (Author) / Sau, Sujay (Author) / Rauf, Femina (Author) / Gaskin, Michael (Author) / Ober-Reynolds, Benjamin (Author) / Mineyev, Irina (Author) / Magee, Mitch (Author) / Chaput, John (Author) / Qiu, Ji (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2014-08-01
Description

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.

ContributorsYu, Xiaobo (Author) / Bian, Xiaofang (Author) / Throop, Andrea (Author) / Song, Lusheng (Author) / del Moral, Lerys (Author) / Park, Jin (Author) / Seiler, Catherine (Author) / Fiacco, Michael (Author) / Steel, Jason (Author) / Hunter, Preston (Author) / Saul, Justin (Author) / Wang, Jie (Author) / Qiu, Ji (Author) / Pipas, James M. (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2013-11-30
129363-Thumbnail Image.png
Description

Explosive extrusion of cold material from the interior of icy bodies, or cryovolcanism, has been observed on Enceladus and, perhaps, Europa, Triton, and Ceres. It may explain the observed evidence for a young surface on Charon (Pluto’s surface is masked by frosts). Here, we evaluate prerequisites for cryovolcanism on dwarf

Explosive extrusion of cold material from the interior of icy bodies, or cryovolcanism, has been observed on Enceladus and, perhaps, Europa, Triton, and Ceres. It may explain the observed evidence for a young surface on Charon (Pluto’s surface is masked by frosts). Here, we evaluate prerequisites for cryovolcanism on dwarf planet-class Kuiper belt objects (KBOs). We first review the likely spatial and temporal extent of subsurface liquid, proposed mechanisms to overcome the negative buoyancy of liquid water in ice, and the volatile inventory of KBOs. We then present a new geochemical equilibrium model for volatile exsolution and its ability to drive upward crack propagation. This novel approach bridges geophysics and geochemistry, and extends geochemical modeling to the seldom-explored realm of liquid water at subzero temperatures. We show that carbon monoxide (CO) is a key volatile for gas-driven fluid ascent; whereas CO2 and sulfur gases only play a minor role. N2, CH4, and H2 exsolution may also drive explosive cryovolcanism if hydrothermal activity produces these species in large amounts (a few percent with respect to water). Another important control on crack propagation is the internal structure: a hydrated core makes explosive cryovolcanism easier, but an undifferentiated crust does not. We briefly discuss other controls on ascent such as fluid freezing on crack walls, and outline theoretical advances necessary to better understand cryovolcanic processes. Finally, we make testable predictions for the 2015 New Horizons flyby of the Pluto-Charon system.

ContributorsNeveu, Marc (Author) / Desch, Steven (Author) / Shock, Everett (Author) / Glein, C. R. (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-01-15
129259-Thumbnail Image.png
Description

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all they are a form of codified self-regulation. While codes can be beneficial, it argues that when we scratch below the surface, there are many problems at their root. In terms of efficacy, codes can serve as a form of ethical window dressing, rather than effective rules for behavior. But even more that, codes can degrade the meaning behind being a good person who acts ethically for the right reasons.

Created2013-11-30
129278-Thumbnail Image.png
Description

We report a device to fill an array of small chemical reaction chambers (microreactors) with reagent and then seal them using pressurized viscous liquid acting through a flexible membrane. The device enables multiple, independent chemical reactions involving free floating intermediate molecules without interference from neighboring reactions or external environments. The

We report a device to fill an array of small chemical reaction chambers (microreactors) with reagent and then seal them using pressurized viscous liquid acting through a flexible membrane. The device enables multiple, independent chemical reactions involving free floating intermediate molecules without interference from neighboring reactions or external environments. The device is validated by protein expressed in situ directly from DNA in a microarray of ~10,000 spots with no diffusion during three hours incubation. Using the device to probe for an autoantibody cancer biomarker in blood serum sample gave five times higher signal to background ratio compared to standard protein microarray expressed on a flat microscope slide. Physical design principles to effectively fill the array of microreactors with reagent and experimental results of alternate methods for sealing the microreactors are presented.

ContributorsWiktor, Peter (Author) / Brunner, Al (Author) / Kahn, Peter (Author) / Qiu, Ji (Author) / Magee, Mitch (Author) / Bian, Xiaofang (Author) / Karthikeyan, Kailash (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2015-03-04
129310-Thumbnail Image.png
Description

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound IgG. Of these, 741 antigens were selected and probed with an independent set of ovarian cancer sera (n = 60 cases/60 controls). Twelve potential autoantigens were identified with sensitivities ranging from 13 to 22% at >93% specificity. These were retested using a Luminex bead array using 60 cases and 60 controls, with sensitivities ranging from 0 to 31.7% at 95% specificity. Three AAb (p53, PTPRA, and PTGFR) had area under the curve (AUC) levels >60% (p < 0.01), with the partial AUC (SPAUC) over 5 times greater than for a nondiscriminating test (p < 0.01). Using a panel of the top three AAb (p53, PTPRA, and PTGFR), if at least two AAb were positive, then the sensitivity was 23.3% at 98.3% specificity. AAb to at least one of these top three antigens were also detected in 7/20 sera (35%) of patients with low CA 125 levels and 0/15 controls. AAb to p53, PTPRA, and PTGFR are potential biomarkers for the early detection of ovarian cancer.

ContributorsAnderson, Karen (Author) / Cramer, Daniel W. (Author) / Sibani, Sahar (Author) / Wallstrom, Garrick (Author) / Wong, Jessica (Author) / Park, Jin (Author) / Qiu, Ji (Author) / Vitonis, Allison (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2015-01-01
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02
128923-Thumbnail Image.png
Description

The unicellular microalga Haematococcus pluvialis has emerged as a promising biomass feedstock for the ketocarotenoid astaxanthin and neutral lipid triacylglycerol. Motile flagellates, resting palmella cells, and cysts are the major life cycle stages of H. pluvialis. Fast-growing motile cells are usually used to induce astaxanthin and triacylglycerol biosynthesis under stress

The unicellular microalga Haematococcus pluvialis has emerged as a promising biomass feedstock for the ketocarotenoid astaxanthin and neutral lipid triacylglycerol. Motile flagellates, resting palmella cells, and cysts are the major life cycle stages of H. pluvialis. Fast-growing motile cells are usually used to induce astaxanthin and triacylglycerol biosynthesis under stress conditions (high light or nutrient starvation); however, productivity of biomass and bioproducts are compromised due to the susceptibility of motile cells to stress. This study revealed that the Photosystem II (PSII) reaction center D1 protein, the manganese-stabilizing protein PsbO, and several major membrane glycerolipids (particularly for chloroplast membrane lipids monogalactosyldiacylglycerol and phosphatidylglycerol), decreased dramatically in motile cells under high light (HL). In contrast, palmella cells, which are transformed from motile cells after an extended period of time under favorable growth conditions, have developed multiple protective mechanisms - including reduction in chloroplast membrane lipids content, downplay of linear photosynthetic electron transport, and activating nonphotochemical quenching mechanisms - while accumulating triacylglycerol. Consequently, the membrane lipids and PSII proteins (D1 and PsbO) remained relatively stable in palmella cells subjected to HL. Introducing palmella instead of motile cells to stress conditions may greatly increase astaxanthin and lipid production in H. pluvialis culture.

ContributorsWang, Baobei (Author) / Zhang, Zhen (Author) / Hu, Qiang (Author) / Sommerfeld, Milton (Author) / Lu, Yinghua (Author) / Han, Danxiang (Author) / College of Liberal Arts and Sciences (Contributor)
Created2014-09-15
128925-Thumbnail Image.png
Description

Uncovering the chemical and physical links between natural environments and microbial communities is becoming increasingly amenable owing to geochemical observations and metagenomic sequencing. At the hot spring known as Bison Pool in Yellowstone National Park, the cooling of the water in the outflow channel is associated with an increase in

Uncovering the chemical and physical links between natural environments and microbial communities is becoming increasingly amenable owing to geochemical observations and metagenomic sequencing. At the hot spring known as Bison Pool in Yellowstone National Park, the cooling of the water in the outflow channel is associated with an increase in oxidation potential estimated from multiple field-based measurements. Representative groups of proteins whose sequences were derived from metagenomic data also exhibit an increase in average oxidation state of carbon in the protein molecules with distance from the hot-spring source. The energetic requirements of reactions to form selected proteins used in the model were computed using amino-acid group additivity for the standard molal thermodynamic properties of the proteins, and the relative chemical stabilities of the proteins were investigated by varying temperature, pH and oxidation state, expressed as activity of dissolved hydrogen. The relative stabilities of the proteins were found to track the locations of the sampling sites when the calculations included a function for hydrogen activity that increases with temperature and is higher, or more reducing, than values consistent with measurements of dissolved oxygen, sulfide and oxidation-reduction potential in the field. These findings imply that spatial patterns in the amino acid compositions of proteins can be linked, through energetics of overall chemical reactions representing the formation of the proteins, to the environmental conditions at this hot spring, even if microbial cells maintain considerably different internal conditions. Further applications of the thermodynamic calculations are possible for other natural microbial ecosystems.

ContributorsDick, Jeffrey (Author) / Shock, Everett (Author) / College of Liberal Arts and Sciences (Contributor)
Created2011-08-11