This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 36
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
129537-Thumbnail Image.png
Description

There are many proteomic applications that require large collections of purified protein, but parallel production of large numbers of different proteins remains a very challenging task. To help meet the needs of the scientific community, we have developed a human protein production pipeline. Using high-throughput (HT) methods, we transferred the

There are many proteomic applications that require large collections of purified protein, but parallel production of large numbers of different proteins remains a very challenging task. To help meet the needs of the scientific community, we have developed a human protein production pipeline. Using high-throughput (HT) methods, we transferred the genes of 31 full-length proteins into three expression vectors, and expressed the collection as N-terminal HaloTag fusion proteins in Escherichia coli and two commercial cell-free (CF) systems, wheat germ extract (WGE) and HeLa cell extract (HCE). Expression was assessed by labeling the fusion proteins specifically and covalently with a fluorescent HaloTag ligand and detecting its fluorescence on a LabChip[superscript ®] GX microfluidic capillary gel electrophoresis instrument. This automated, HT assay provided both qualitative and quantitative assessment of recombinant protein. E. coli was only capable of expressing 20% of the test collection in the supernatant fraction with ≥20 μg yields, whereas CF systems had ≥83% success rates. We purified expressed proteins using an automated HaloTag purification method. We purified 20, 33, and 42% of the test collection from E. coli, WGE, and HCE, respectively, with yields ≥1 μg and ≥90% purity. Based on these observations, we have developed a triage strategy for producing full-length human proteins in these three expression systems.

ContributorsSaul, Justin (Author) / Petritis, Brianne (Author) / Sau, Sujay (Author) / Rauf, Femina (Author) / Gaskin, Michael (Author) / Ober-Reynolds, Benjamin (Author) / Mineyev, Irina (Author) / Magee, Mitch (Author) / Chaput, John (Author) / Qiu, Ji (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2014-08-01
Description

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.

ContributorsYu, Xiaobo (Author) / Bian, Xiaofang (Author) / Throop, Andrea (Author) / Song, Lusheng (Author) / del Moral, Lerys (Author) / Park, Jin (Author) / Seiler, Catherine (Author) / Fiacco, Michael (Author) / Steel, Jason (Author) / Hunter, Preston (Author) / Saul, Justin (Author) / Wang, Jie (Author) / Qiu, Ji (Author) / Pipas, James M. (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2013-11-30
129259-Thumbnail Image.png
Description

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all they are a form of codified self-regulation. While codes can be beneficial, it argues that when we scratch below the surface, there are many problems at their root. In terms of efficacy, codes can serve as a form of ethical window dressing, rather than effective rules for behavior. But even more that, codes can degrade the meaning behind being a good person who acts ethically for the right reasons.

Created2013-11-30
129278-Thumbnail Image.png
Description

We report a device to fill an array of small chemical reaction chambers (microreactors) with reagent and then seal them using pressurized viscous liquid acting through a flexible membrane. The device enables multiple, independent chemical reactions involving free floating intermediate molecules without interference from neighboring reactions or external environments. The

We report a device to fill an array of small chemical reaction chambers (microreactors) with reagent and then seal them using pressurized viscous liquid acting through a flexible membrane. The device enables multiple, independent chemical reactions involving free floating intermediate molecules without interference from neighboring reactions or external environments. The device is validated by protein expressed in situ directly from DNA in a microarray of ~10,000 spots with no diffusion during three hours incubation. Using the device to probe for an autoantibody cancer biomarker in blood serum sample gave five times higher signal to background ratio compared to standard protein microarray expressed on a flat microscope slide. Physical design principles to effectively fill the array of microreactors with reagent and experimental results of alternate methods for sealing the microreactors are presented.

ContributorsWiktor, Peter (Author) / Brunner, Al (Author) / Kahn, Peter (Author) / Qiu, Ji (Author) / Magee, Mitch (Author) / Bian, Xiaofang (Author) / Karthikeyan, Kailash (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2015-03-04
129310-Thumbnail Image.png
Description

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound IgG. Of these, 741 antigens were selected and probed with an independent set of ovarian cancer sera (n = 60 cases/60 controls). Twelve potential autoantigens were identified with sensitivities ranging from 13 to 22% at >93% specificity. These were retested using a Luminex bead array using 60 cases and 60 controls, with sensitivities ranging from 0 to 31.7% at 95% specificity. Three AAb (p53, PTPRA, and PTGFR) had area under the curve (AUC) levels >60% (p < 0.01), with the partial AUC (SPAUC) over 5 times greater than for a nondiscriminating test (p < 0.01). Using a panel of the top three AAb (p53, PTPRA, and PTGFR), if at least two AAb were positive, then the sensitivity was 23.3% at 98.3% specificity. AAb to at least one of these top three antigens were also detected in 7/20 sera (35%) of patients with low CA 125 levels and 0/15 controls. AAb to p53, PTPRA, and PTGFR are potential biomarkers for the early detection of ovarian cancer.

ContributorsAnderson, Karen (Author) / Cramer, Daniel W. (Author) / Sibani, Sahar (Author) / Wallstrom, Garrick (Author) / Wong, Jessica (Author) / Park, Jin (Author) / Qiu, Ji (Author) / Vitonis, Allison (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2015-01-01
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02
129082-Thumbnail Image.png
Description

Introduction: Juvenile idiopathic arthritis (JIA) is a heterogeneous disease characterized by chronic joint inflammation of unknown cause in children. JIA is an autoimmune disease and small numbers of autoantibodies have been reported in JIA patients. The identification of antibody markers could improve the existing clinical management of patients.

Methods: A pilot study was

Introduction: Juvenile idiopathic arthritis (JIA) is a heterogeneous disease characterized by chronic joint inflammation of unknown cause in children. JIA is an autoimmune disease and small numbers of autoantibodies have been reported in JIA patients. The identification of antibody markers could improve the existing clinical management of patients.

Methods: A pilot study was performed on the application of a high-throughput platform, the nucleic acid programmable protein array (NAPPA), to assess the levels of antibodies present in the systemic circulation and synovial joint of a small cohort of juvenile arthritis patients. Plasma and synovial fluid from 10 JIA patients was screened for antibodies against 768 proteins on NAPPAs.

Results: Quantitative reproducibility of NAPPAs was demonstrated with > 0.95 intra-array and inter-array correlations. A strong correlation was also observed for the levels of antibodies between plasma and synovial fluid across the study cohort (r = 0.96). Differences in the levels of 18 antibodies were revealed between sample types across all patients. Patients were segregated into two clinical subtypes with distinct antibody signatures by unsupervised hierarchical cluster analysis.

Conclusion: The NAPPAs provide a high-throughput quantitatively reproducible platform to screen for disease-specific autoantibodies at the proteome level on a microscope slide. The strong correlation between the circulating antibody levels and those of the inflamed joint represents a novel finding and provides confidence to use plasma for discovery of autoantibodies in JIA, thus circumventing the challenges associated with joint aspiration. We expect that autoantibody profiling of JIA patients on NAPPAs could yield antibody markers that can act as criteria to stratify patients, predict outcomes and understand disease etiology at the molecular level.

ContributorsGibson, David S. (Author) / Qiu, Ji (Author) / Mendoza, D. Eliseo A. (Author) / Barker, Kristi (Author) / Rooney, Madeleine E. (Author) / LaBaer, Joshua (Author)
Created2012-04-17
Description

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our understanding of such complex systems. However, the data at our disposal are often not easily comparable, have limited scope and scale, and are based on disparate underlying frameworks inhibiting synthesis, meta-analysis, and the validation of findings. Research efforts are further hampered when case inclusion criteria, variable definitions, coding schema, and inter-coder reliability testing are not made explicit in the presentation of research and shared among the research community. This paper first outlines challenges experienced by researchers engaged in a large-scale coding project; then highlights valuable lessons learned; and finally discusses opportunities for further research on comparative case study analysis focusing on social-ecological systems and common pool resources. Includes supplemental materials and appendices published in the International Journal of the Commons 2016 Special Issue. Volume 10 - Issue 2 - 2016.

ContributorsRatajczyk, Elicia (Author) / Brady, Ute (Author) / Baggio, Jacopo (Author) / Barnett, Allain J. (Author) / Perez Ibarra, Irene (Author) / Rollins, Nathan (Author) / Rubinos, Cathy (Author) / Shin, Hoon Cheol (Author) / Yu, David (Author) / Aggarwal, Rimjhim (Author) / Anderies, John (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-09
129171-Thumbnail Image.png
Description

This article advances recent scholarship on energy security by arguing that the concept is best understood as a sociotechnical imaginary, a collective vision for a “good society” realized through technoscientific-oriented policies. Focusing on the 1952 Resources for Freedom report, the authors trace the genealogy of energy security, elucidating how it

This article advances recent scholarship on energy security by arguing that the concept is best understood as a sociotechnical imaginary, a collective vision for a “good society” realized through technoscientific-oriented policies. Focusing on the 1952 Resources for Freedom report, the authors trace the genealogy of energy security, elucidating how it establishes a morality of efficiency that orients policy action under the guise of security toward the liberalizing of markets in resource states and a robust program of energy research and development in the United States. This evidence challenges the pervasive historical anchoring of the concept in the 1970s and illustrates the importance of the genealogical approach for the emerging literature on energy and sociotechnical imaginaries. Exploring the genealogy of energy security also unpacks key social, political, and economic undercurrents that disrupt the seeming universality of the language of energy, leading the authors to question whether energy security discourse is appropriate for guiding policy action during ongoing global energy transitions.

Created2015-09-01