This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 55
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
129537-Thumbnail Image.png
Description

There are many proteomic applications that require large collections of purified protein, but parallel production of large numbers of different proteins remains a very challenging task. To help meet the needs of the scientific community, we have developed a human protein production pipeline. Using high-throughput (HT) methods, we transferred the

There are many proteomic applications that require large collections of purified protein, but parallel production of large numbers of different proteins remains a very challenging task. To help meet the needs of the scientific community, we have developed a human protein production pipeline. Using high-throughput (HT) methods, we transferred the genes of 31 full-length proteins into three expression vectors, and expressed the collection as N-terminal HaloTag fusion proteins in Escherichia coli and two commercial cell-free (CF) systems, wheat germ extract (WGE) and HeLa cell extract (HCE). Expression was assessed by labeling the fusion proteins specifically and covalently with a fluorescent HaloTag ligand and detecting its fluorescence on a LabChip[superscript ®] GX microfluidic capillary gel electrophoresis instrument. This automated, HT assay provided both qualitative and quantitative assessment of recombinant protein. E. coli was only capable of expressing 20% of the test collection in the supernatant fraction with ≥20 μg yields, whereas CF systems had ≥83% success rates. We purified expressed proteins using an automated HaloTag purification method. We purified 20, 33, and 42% of the test collection from E. coli, WGE, and HCE, respectively, with yields ≥1 μg and ≥90% purity. Based on these observations, we have developed a triage strategy for producing full-length human proteins in these three expression systems.

ContributorsSaul, Justin (Author) / Petritis, Brianne (Author) / Sau, Sujay (Author) / Rauf, Femina (Author) / Gaskin, Michael (Author) / Ober-Reynolds, Benjamin (Author) / Mineyev, Irina (Author) / Magee, Mitch (Author) / Chaput, John (Author) / Qiu, Ji (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2014-08-01
Description

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic

Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.

ContributorsYu, Xiaobo (Author) / Bian, Xiaofang (Author) / Throop, Andrea (Author) / Song, Lusheng (Author) / del Moral, Lerys (Author) / Park, Jin (Author) / Seiler, Catherine (Author) / Fiacco, Michael (Author) / Steel, Jason (Author) / Hunter, Preston (Author) / Saul, Justin (Author) / Wang, Jie (Author) / Qiu, Ji (Author) / Pipas, James M. (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2013-11-30
Description

Background: Meiotic recombination has traditionally been explained based on the structural requirement to stabilize homologous chromosome pairs to ensure their proper meiotic segregation. Competing hypotheses seek to explain the emerging findings of significant heterogeneity in recombination rates within and between genomes, but intraspecific comparisons of genome-wide recombination patterns are rare.

Background: Meiotic recombination has traditionally been explained based on the structural requirement to stabilize homologous chromosome pairs to ensure their proper meiotic segregation. Competing hypotheses seek to explain the emerging findings of significant heterogeneity in recombination rates within and between genomes, but intraspecific comparisons of genome-wide recombination patterns are rare. The honey bee (Apis mellifera) exhibits the highest rate of genomic recombination among multicellular animals with about five cross-over events per chromatid.

Results: Here, we present a comparative analysis of recombination rates across eight genetic linkage maps of the honey bee genome to investigate which genomic sequence features are correlated with recombination rate and with its variation across the eight data sets, ranging in average marker spacing ranging from 1 Mbp to 120 kbp. Overall, we found that GC content explained best the variation in local recombination rate along chromosomes at the analyzed 100 kbp scale. In contrast, variation among the different maps was correlated to the abundance of microsatellites and several specific tri- and tetra-nucleotides.

Conclusions: The combined evidence from eight medium-scale recombination maps of the honey bee genome suggests that recombination rate variation in this highly recombining genome might be due to the DNA configuration instead of distinct sequence motifs. However, more fine-scale analyses are needed. The empirical basis of eight differing genetic maps allowed for robust conclusions about the correlates of the local recombination rates and enabled the study of the relation between DNA features and variability in local recombination rates, which is particularly relevant in the honey bee genome with its exceptionally high recombination rate.

ContributorsRoss, Caitlin R. (Author) / DeFelice, Dominick S. (Author) / Hunt, Greg J. (Author) / Ihle, Kate (Author) / Amdam, Gro (Author) / Rueppell, Olav (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-02-21
129259-Thumbnail Image.png
Description

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all they are a form of codified self-regulation. While codes can be beneficial, it argues that when we scratch below the surface, there are many problems at their root. In terms of efficacy, codes can serve as a form of ethical window dressing, rather than effective rules for behavior. But even more that, codes can degrade the meaning behind being a good person who acts ethically for the right reasons.

Created2013-11-30
129278-Thumbnail Image.png
Description

We report a device to fill an array of small chemical reaction chambers (microreactors) with reagent and then seal them using pressurized viscous liquid acting through a flexible membrane. The device enables multiple, independent chemical reactions involving free floating intermediate molecules without interference from neighboring reactions or external environments. The

We report a device to fill an array of small chemical reaction chambers (microreactors) with reagent and then seal them using pressurized viscous liquid acting through a flexible membrane. The device enables multiple, independent chemical reactions involving free floating intermediate molecules without interference from neighboring reactions or external environments. The device is validated by protein expressed in situ directly from DNA in a microarray of ~10,000 spots with no diffusion during three hours incubation. Using the device to probe for an autoantibody cancer biomarker in blood serum sample gave five times higher signal to background ratio compared to standard protein microarray expressed on a flat microscope slide. Physical design principles to effectively fill the array of microreactors with reagent and experimental results of alternate methods for sealing the microreactors are presented.

ContributorsWiktor, Peter (Author) / Brunner, Al (Author) / Kahn, Peter (Author) / Qiu, Ji (Author) / Magee, Mitch (Author) / Bian, Xiaofang (Author) / Karthikeyan, Kailash (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2015-03-04
129310-Thumbnail Image.png
Description

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound

Sera from patients with ovarian cancer contain autoantibodies (AAb) to tumor-derived proteins that are potential biomarkers for early detection. To detect AAb, we probed high-density programmable protein microarrays (NAPPA) expressing 5177 candidate tumor antigens with sera from patients with serous ovarian cancer (n = 34 cases/30 controls) and measured bound IgG. Of these, 741 antigens were selected and probed with an independent set of ovarian cancer sera (n = 60 cases/60 controls). Twelve potential autoantigens were identified with sensitivities ranging from 13 to 22% at >93% specificity. These were retested using a Luminex bead array using 60 cases and 60 controls, with sensitivities ranging from 0 to 31.7% at 95% specificity. Three AAb (p53, PTPRA, and PTGFR) had area under the curve (AUC) levels >60% (p < 0.01), with the partial AUC (SPAUC) over 5 times greater than for a nondiscriminating test (p < 0.01). Using a panel of the top three AAb (p53, PTPRA, and PTGFR), if at least two AAb were positive, then the sensitivity was 23.3% at 98.3% specificity. AAb to at least one of these top three antigens were also detected in 7/20 sera (35%) of patients with low CA 125 levels and 0/15 controls. AAb to p53, PTPRA, and PTGFR are potential biomarkers for the early detection of ovarian cancer.

ContributorsAnderson, Karen (Author) / Cramer, Daniel W. (Author) / Sibani, Sahar (Author) / Wallstrom, Garrick (Author) / Wong, Jessica (Author) / Park, Jin (Author) / Qiu, Ji (Author) / Vitonis, Allison (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2015-01-01
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02
128873-Thumbnail Image.png
Description

Background: Healthy individuals on the lower end of the insulin sensitivity spectrum also have a reduced gene expression response to exercise for specific genes. The goal of this study was to determine the relationship between insulin sensitivity and exercise-induced gene expression in an unbiased, global manner.

Methods and Findings: Euglycemic clamps were used

Background: Healthy individuals on the lower end of the insulin sensitivity spectrum also have a reduced gene expression response to exercise for specific genes. The goal of this study was to determine the relationship between insulin sensitivity and exercise-induced gene expression in an unbiased, global manner.

Methods and Findings: Euglycemic clamps were used to measure insulin sensitivity and muscle biopsies were done at rest and 30 minutes after a single acute exercise bout in 14 healthy participants. Changes in mRNA expression were assessed using microarrays, and miRNA analysis was performed in a subset of 6 of the participants using sequencing techniques. Following exercise, 215 mRNAs were changed at the probe level (Bonferroni-corrected P<0.00000115). Pathway and Gene Ontology analysis showed enrichment in MAP kinase signaling, transcriptional regulation and DNA binding. Changes in several transcription factor mRNAs were correlated with insulin sensitivity, including MYC, r=0.71; SNF1LK, r=0.69; and ATF3, r= 0.61 (5 corrected for false discovery rate). Enrichment in the 5’-UTRs of exercise-responsive genes suggested regulation by common transcription factors, especially EGR1. miRNA species of interest that changed after exercise included miR-378, which is located in an intron of the PPARGC1B gene.

Conclusions: These results indicate that transcription factor gene expression responses to exercise depend highly on insulin sensitivity in healthy people. The overall pattern suggests a coordinated cycle by which exercise and insulin sensitivity regulate gene expression in muscle.

ContributorsMcLean, Carrie (Author) / Mielke, Clinton (Author) / Cordova, Jeanine (Author) / Langlais, Paul R. (Author) / Bowen, Benjamin (Author) / Miranda, Danielle (Author) / Coletta, Dawn (Author) / Mandarino, Lawrence (Author) / College of Health Solutions (Contributor)
Created2015-05-18
128736-Thumbnail Image.png
Description

Honeybee workers are essentially sterile female helpers that make up the majority of individuals in a colony. Workers display a marked change in physiology when they transition from in-nest tasks to foraging. Recent technological advances have made it possible to unravel the metabolic modifications associated with this transition. Previous studies

Honeybee workers are essentially sterile female helpers that make up the majority of individuals in a colony. Workers display a marked change in physiology when they transition from in-nest tasks to foraging. Recent technological advances have made it possible to unravel the metabolic modifications associated with this transition. Previous studies have revealed extensive remodeling of brain, thorax, and hypopharyngeal gland biochemistry. However, data on changes in the abdomen is scarce. To narrow this gap we investigated the proteomic composition of abdominal tissue in the days typically preceding the onset of foraging in honeybee workers.

In order to get a broader representation of possible protein dynamics, we used workers of two genotypes with differences in the age at which they initiate foraging. This approach was combined with RNA interference-mediated downregulation of an insulin/insulin-like signaling component that is central to foraging behavior, the insulin receptor substrate (irs), and with measurements of glucose and lipid levels.
Our data provide new insight into the molecular underpinnings of phenotypic plasticity in the honeybee, invoke parallels with vertebrate metabolism, and support an integrated and irs-dependent association of carbohydrate and lipid metabolism with the transition from in-nest tasks to foraging.

ContributorsChan, Queenie W. T. (Author) / Mutti, Navdeep (Author) / Foster, Leonard J. (Author) / Kocher, Sarah D. (Author) / Amdam, Gro (Author) / Wolschin, Florian (Author) / College of Liberal Arts and Sciences (Contributor)
Created2011-09-28