This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 44
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
129567-Thumbnail Image.png
Description

Human protein diversity arises as a result of alternative splicing, single nucleotide polymorphisms (SNPs) and posttranslational modifications. Because of these processes, each protein can exists as multiple variants in vivo. Tailored strategies are needed to study these protein variants and understand their role in health and disease. In this work

Human protein diversity arises as a result of alternative splicing, single nucleotide polymorphisms (SNPs) and posttranslational modifications. Because of these processes, each protein can exists as multiple variants in vivo. Tailored strategies are needed to study these protein variants and understand their role in health and disease. In this work we utilized quantitative mass spectrometric immunoassays to determine the protein variants concentration of beta-2-microglobulin, cystatin C, retinol binding protein, and transthyretin, in a population of 500 healthy individuals. Additionally, we determined the longitudinal concentration changes for the protein variants from four individuals over a 6 month period. Along with the native forms of the four proteins, 13 posttranslationally modified variants and 7 SNP-derived variants were detected and their concentration determined. Correlations of the variants concentration with geographical origin, gender, and age of the individuals were also examined. This work represents an important step toward building a catalog of protein variants concentrations and examining their longitudinal changes.

ContributorsTrenchevska, Olgica (Author) / Phillips, David A. (Author) / Nelson, Randall (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2014-06-23
129370-Thumbnail Image.png
Description

Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements

Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species.

ContributorsSchrader, Lukas (Author) / Kim, Jay W. (Author) / Ence, Daniel (Author) / Zimin, Aleksey (Author) / Klein, Antonia (Author) / Wyschetzki, Katharina (Author) / Weichselgartner, Tobias (Author) / Kemena, Carsten (Author) / Stoekl, Johannes (Author) / Schultner, Eva (Author) / Wurm, Yannick (Author) / Smith, Christopher D. (Author) / Yandell, Mark (Author) / Heinze, Juergen (Author) / Gadau, Juergen (Author) / Oettler, Jan (Author) / College of Liberal Arts and Sciences (Contributor)
Created2014-12-01
129259-Thumbnail Image.png
Description

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all they are a form of codified self-regulation. While codes can be beneficial, it argues that when we scratch below the surface, there are many problems at their root. In terms of efficacy, codes can serve as a form of ethical window dressing, rather than effective rules for behavior. But even more that, codes can degrade the meaning behind being a good person who acts ethically for the right reasons.

Created2013-11-30
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02
128687-Thumbnail Image.png
Description

Proteins can exist as multiple proteoforms in vivo, as a result of alternative splicing and single-nucleotide polymorphisms (SNPs), as well as posttranslational processing. To address their clinical significance in a context of diagnostic information, proteoforms require a more in-depth analysis. Mass spectrometric immunoassays (MSIA) have been devised for studying structural

Proteins can exist as multiple proteoforms in vivo, as a result of alternative splicing and single-nucleotide polymorphisms (SNPs), as well as posttranslational processing. To address their clinical significance in a context of diagnostic information, proteoforms require a more in-depth analysis. Mass spectrometric immunoassays (MSIA) have been devised for studying structural diversity in human proteins. MSIA enables protein profiling in a simple and high-throughput manner, by combining the selectivity of targeted immunoassays, with the specificity of mass spectrometric detection. MSIA has been used for qualitative and quantitative analysis of single and multiple proteoforms, distinguishing between normal fluctuations and changes related to clinical conditions. This mini review offers an overview of the development and application of mass spectrometric immunoassays for clinical and population proteomics studies. Provided are examples of some recent developments, and also discussed are the trends and challenges in mass spectrometry-based immunoassays for the next-phase of clinical applications.

ContributorsTrenchevska, Olgica (Author) / Nelson, Randall (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2016-03-17
129005-Thumbnail Image.png
Description

Background: Counselor behaviors that mediate the efficacy of motivational interviewing (MI) are not well understood, especially when applied to health behavior promotion. We hypothesized that client change talk mediates the relationship between counselor variables and subsequent client behavior change.

Methods: Purposeful sampling identified individuals from a prospective randomized worksite trial using an MI

Background: Counselor behaviors that mediate the efficacy of motivational interviewing (MI) are not well understood, especially when applied to health behavior promotion. We hypothesized that client change talk mediates the relationship between counselor variables and subsequent client behavior change.

Methods: Purposeful sampling identified individuals from a prospective randomized worksite trial using an MI intervention to promote firefighters’ healthy diet and regular exercise that increased dietary intake of fruits and vegetables (n = 21) or did not increase intake of fruits and vegetables (n = 22). MI interactions were coded using the Motivational Interviewing Skill Code (MISC 2.1) to categorize counselor and firefighter verbal utterances. Both Bayesian and frequentist mediation analyses were used to investigate whether client change talk mediated the relationship between counselor skills and behavior change.

Results: Counselors’ global spirit, empathy, and direction and MI-consistent behavioral counts (e.g., reflections, open questions, affirmations, emphasize control) significantly correlated with firefighters’ total client change talk utterances (rs = 0.42, 0.40, 0.30, and 0.61, respectively), which correlated significantly with their fruit and vegetable intake increase (r = 0.33). Both Bayesian and frequentist mediation analyses demonstrated that findings were consistent with hypotheses, such that total client change talk mediated the relationship between counselor’s skills—MI-consistent behaviors [Bayesian mediated effect: αβ = .06 (.03), 95% CI = .02, .12] and MI spirit [Bayesian mediated effect: αβ = .06 (.03), 95% CI = .01, .13]—and increased fruit and vegetable consumption.

Conclusion: Motivational interviewing is a resource- and time-intensive intervention, and is currently being applied in many arenas. Previous research has identified the importance of counselor behaviors and client change talk in the treatment of substance use disorders. Our results indicate that similar mechanisms may underlie the effects of MI for dietary change. These results inform MI training and application by identifying those processes critical for MI success in health promotion domains.

ContributorsPirlott, Angela (Author) / Kisbu-Sakarya, Yasemin (Author) / DeFrancesco, Carol A. (Author) / Elliot, Diane L. (Author) / MacKinnon, David (Author) / College of Liberal Arts and Sciences (Contributor)
Created2012-06-08
128933-Thumbnail Image.png
Description

Introduction: Apolipoprotein C-III (apoC-III) regulates triglyceride (TG) metabolism. In plasma, apoC-III exists in non-sialylated (apoC-III0a without glycosylation and apoC-III[subscript 0b] with glycosylation), monosialylated (apoC-III1) or disialylated (apoC-III2) proteoforms. Our aim was to clarify the relationship between apoC-III sialylation proteoforms with fasting plasma TG concentrations.

Methods: In 204 non-diabetic adolescent participants, the

Introduction: Apolipoprotein C-III (apoC-III) regulates triglyceride (TG) metabolism. In plasma, apoC-III exists in non-sialylated (apoC-III0a without glycosylation and apoC-III[subscript 0b] with glycosylation), monosialylated (apoC-III1) or disialylated (apoC-III2) proteoforms. Our aim was to clarify the relationship between apoC-III sialylation proteoforms with fasting plasma TG concentrations.

Methods: In 204 non-diabetic adolescent participants, the relative abundance of apoC-III plasma proteoforms was measured using mass spectrometric immunoassay.

Results: Compared with the healthy weight subgroup (n = 16), the ratios of apoC-III0a, apoC-III0b, and apoC-III1 to apoC-III2 were significantly greater in overweight (n = 33) and obese participants (n = 155). These ratios were positively correlated with BMI z-scores and negatively correlated with measures of insulin sensitivity (S[subscript i]). The relationship of apoC-III1 / apoC-III2 with Si persisted after adjusting for BMI (p = 0.02). Fasting TG was correlated with the ratio of apoC-III0a / apoC-III2 (r = 0.47, p<0.001), apoC-III0b / apoC-III2 (r = 0.41, p<0.001), apoC-III1 / apoC-III2 (r = 0.43, p<0.001). By examining apoC-III concentrations, the association of apoC-III proteoforms with TG was driven by apoC-III0a (r = 0.57, p<0.001), apoC-III0b (r = 0.56. p<0.001) and apoC-III1 (r = 0.67, p<0.001), but not apoC-III2 (r = 0.006, p = 0.9) concentrations, indicating that apoC-III relationship with plasma TG differed in apoC-III2 compared with the other proteoforms.

Conclusion: We conclude that apoC-III0a, apoC-III0b, and apoC-III1, but not apoC-III2 appear to be under metabolic control and associate with fasting plasma TG. Measurement of apoC-III proteoforms can offer insights into the biology of TG metabolism in obesity.

ContributorsYassine, Hussein N. (Author) / Trenchevska, Olgica (Author) / Ramrakhiani, Ambika (Author) / Parekh, Aarushi (Author) / Koska, Juraj (Author) / Walker, Ryan W. (Author) / Billheimer, Dean (Author) / Reaven, Peter D. (Author) / Yen, Frances T. (Author) / Nelson, Randall (Author) / Goran, Michael I. (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2015-12-03
Description

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our understanding of such complex systems. However, the data at our disposal are often not easily comparable, have limited scope and scale, and are based on disparate underlying frameworks inhibiting synthesis, meta-analysis, and the validation of findings. Research efforts are further hampered when case inclusion criteria, variable definitions, coding schema, and inter-coder reliability testing are not made explicit in the presentation of research and shared among the research community. This paper first outlines challenges experienced by researchers engaged in a large-scale coding project; then highlights valuable lessons learned; and finally discusses opportunities for further research on comparative case study analysis focusing on social-ecological systems and common pool resources. Includes supplemental materials and appendices published in the International Journal of the Commons 2016 Special Issue. Volume 10 - Issue 2 - 2016.

ContributorsRatajczyk, Elicia (Author) / Brady, Ute (Author) / Baggio, Jacopo (Author) / Barnett, Allain J. (Author) / Perez Ibarra, Irene (Author) / Rollins, Nathan (Author) / Rubinos, Cathy (Author) / Shin, Hoon Cheol (Author) / Yu, David (Author) / Aggarwal, Rimjhim (Author) / Anderies, John (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-09
Description

We present a phylogeographic study of at least six reproductively isolated lineages of new world harvester ants within the Pogonomyrmex barbatus and P. rugosus species group. The genetic and geographic relationships within this clade are complex: Four of the identified lineages show genetic caste determination (GCD) and are divided into

We present a phylogeographic study of at least six reproductively isolated lineages of new world harvester ants within the Pogonomyrmex barbatus and P. rugosus species group. The genetic and geographic relationships within this clade are complex: Four of the identified lineages show genetic caste determination (GCD) and are divided into two pairs. Each pair has evolved under a mutualistic system that necessitates sympatry. These paired lineages are dependent upon one another because their GCD requires interlineage matings for the production of F1 hybrid workers, and intralineage matings are required to produce queens. This GCD system maintains genetic isolation among these interdependent lineages, while simultaneously requiring co-expansion and emigration as their distributions have changed over time. It has also been demonstrated that three of these four GCD lineages have undergone historical hybridization, but the narrower sampling range of previous studies has left questions on the hybrid parentage, breadth, and age of these groups. Thus, reconstructing the phylogenetic and geographic history of this group allows us to evaluate past insights and hypotheses and to plan future inquiries in a more complete historical biogeographic context. Using mitochondrial DNA sequences sampled across most of the morphospecies’ ranges in the U.S.A. and Mexico, we conducted a detailed phylogeographic study. Remarkably, our results indicate that one of the GCD lineage pairs has experienced a dramatic range expansion, despite the genetic load and fitness costs of the GCD system. Our analyses also reveal a complex pattern of vicariance and dispersal in Pogonomyrmex harvester ants that is largely concordant with models of late Miocene, Pliocene, and Pleistocene range shifts among various arid-adapted taxa in North America.

ContributorsMott, Brendon (Author) / Gadau, Juergen (Author) / Anderson, Kirk E. (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-07-01