Matching Items (24)
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
Description

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our

On-going efforts to understand the dynamics of coupled social-ecological (or more broadly, coupled infrastructure) systems and common pool resources have led to the generation of numerous datasets based on a large number of case studies. This data has facilitated the identification of important factors and fundamental principles which increase our understanding of such complex systems. However, the data at our disposal are often not easily comparable, have limited scope and scale, and are based on disparate underlying frameworks inhibiting synthesis, meta-analysis, and the validation of findings. Research efforts are further hampered when case inclusion criteria, variable definitions, coding schema, and inter-coder reliability testing are not made explicit in the presentation of research and shared among the research community. This paper first outlines challenges experienced by researchers engaged in a large-scale coding project; then highlights valuable lessons learned; and finally discusses opportunities for further research on comparative case study analysis focusing on social-ecological systems and common pool resources. Includes supplemental materials and appendices published in the International Journal of the Commons 2016 Special Issue. Volume 10 - Issue 2 - 2016.

ContributorsRatajczyk, Elicia (Author) / Brady, Ute (Author) / Baggio, Jacopo (Author) / Barnett, Allain J. (Author) / Perez Ibarra, Irene (Author) / Rollins, Nathan (Author) / Rubinos, Cathy (Author) / Shin, Hoon Cheol (Author) / Yu, David (Author) / Aggarwal, Rimjhim (Author) / Anderies, John (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-09
128166-Thumbnail Image.png
Description

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance eventually brought a revolution in how scholars (and graduate students) were trained and worked. This revolution never occurred in K-12 or university education such that we now teach young students in much the way that scholars were taught in the dark ages, we teach them what is already known rather than the process of knowing. Citizen science offers a way to change K-12 and university education and, in doing so, complete the renaissance. Here we offer an example of such an approach and call for change in the way students are taught science, change that is more possible than it has ever been and is, nonetheless, five hundred years delayed.

Created2016-03-01
127872-Thumbnail Image.png
Description

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial ecology of a singular built environment, the International Space Station (ISS). This ISS sampling involved the collection and microbial analysis (via 16S rRNA gene PCR) of 15 surfaces sampled by swabs onboard the ISS. This sampling was a component of Project MERCCURI (Microbial Ecology Research Combining Citizen and University Researchers on ISS). Learning more about the microbial inhabitants of the “buildings” in which we travel through space will take on increasing importance, as plans for human exploration continue, with the possibility of colonization of other planets and moons.

Results: Sterile swabs were used to sample 15 surfaces onboard the ISS. The sites sampled were designed to be analogous to samples collected for (1) the Wildlife of Our Homes project and (2) a study of cell phones and shoes that were concurrently being collected for another component of Project MERCCURI. Sequencing of the 16S rRNA genes amplified from DNA extracted from each swab was used to produce a census of the microbes present on each surface sampled. We compared the microbes found on the ISS swabs to those from both homes on Earth and data from the Human Microbiome Project.

Conclusions: While significantly different from homes on Earth and the Human Microbiome Project samples analyzed here, the microbial community composition on the ISS was more similar to home surfaces than to the human microbiome samples. The ISS surfaces are OTU-rich with 1,036–4,294 operational taxonomic units (OTUs per sample). There was no discernible biogeography of microbes on the 15 ISS surfaces, although this may be a reflection of the small sample size we were able to obtain.

ContributorsLang, Jenna M. (Author) / Coil, David A. (Author) / Neches, Russell Y. (Author) / Brown, Wendy E. (Author) / Cavalier, Darlene (Author) / Severance, Mark (Author) / Hampton-Marcell, Jarrad T. (Author) / Gilbert, Jack A. (Author) / Eisen, Jonathan A. (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-12-05
128562-Thumbnail Image.png
Description

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To identify the flow direction of clickstreams, we define the “flow distance” of nodes (Li), which measures the average number of steps a random walker takes to reach the ith node. It is observed that Li is related with the clicks (Ci) to news stories and the age (Ti) of stories. Putting these three variables together help us understand the rise and decay of news stories from a network perspective. We also find that the studied clickstream networks preserve a stable structure over time, leading to the scaling between users and clicks. The universal scaling behavior is confirmed by the 1,000 Web forums. We suggest that the tree-like, stable structure of clickstream networks reveals the time-sensitive preference of users in online browsing. To test our assumption, we discuss three models on individual browsing behavior, and compare the simulation results with empirical data.

ContributorsWang, Cheng-Jun (Author) / Wu, Lingfei (Author) / Zhang, Jiang (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-28
128559-Thumbnail Image.png
Description

Many adaptive systems sit near a tipping or critical point. For systems near a critical point small changes to component behaviour can induce large-scale changes in aggregate structure and function. Criticality can be adaptive when the environment is changing, but entails reduced robustness through sensitivity. This tradeoff can be resolved

Many adaptive systems sit near a tipping or critical point. For systems near a critical point small changes to component behaviour can induce large-scale changes in aggregate structure and function. Criticality can be adaptive when the environment is changing, but entails reduced robustness through sensitivity. This tradeoff can be resolved when criticality can be tuned. We address the control of finite measures of criticality using data on fight sizes from an animal society model system (Macaca nemestrina, n=48). We find that a heterogeneous, socially organized system, like homogeneous, spatial systems (flocks and schools), sits near a critical point; the contributions individuals make to collective phenomena can be quantified; there is heterogeneity in these contributions; and distance from the critical point (DFC) can be controlled through biologically plausible mechanisms exploiting heterogeneity. We propose two alternative hypotheses for why a system decreases the distance from the critical point.

ContributorsDaniels, Bryan (Author) / Krakauer, David (Author) / Flack, Jessica (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-02-10
128744-Thumbnail Image.png
Description

Sequential affect dynamics generated during the interaction of intimate dyads, such as married couples, are associated with a cascade of effects - some good and some bad - on each partner, close family members, and other social contacts. Although the effects are well documented, the probabilistic structures associated with micro-social

Sequential affect dynamics generated during the interaction of intimate dyads, such as married couples, are associated with a cascade of effects - some good and some bad - on each partner, close family members, and other social contacts. Although the effects are well documented, the probabilistic structures associated with micro-social processes connected to the varied outcomes remain enigmatic. Using extant data we developed a method of classifying and subsequently generating couple dynamics using a Hierarchical Dirichlet Process Hidden semi-Markov Model (HDP-HSMM). Our findings indicate that several key aspects of existing models of marital interaction are inadequate: affect state emissions and their durations, along with the expected variability differences between distressed and nondistressed couples are present but highly nuanced; and most surprisingly, heterogeneity among highly satisfied couples necessitate that they be divided into subgroups. We review how this unsupervised learning technique generates plausible dyadic sequences that are sensitive to relationship quality and provide a natural mechanism for computational models of behavioral and affective micro-social processes.

ContributorsGriffin, William (Author) / Li, Xun (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-05-17
128856-Thumbnail Image.png
Description

Background: Phosphatase and TENsin (PTEN) homolog is a negative regulator that takes part in IIS (insulin/insulin-like signaling) and Egfr (epidermal growth factor receptor) activation in Drosophila melanogaster. IIS and Egfr signaling events are also involved in the developmental process of queen and worker differentiation in honey bees (Apis mellifera). Here, we

Background: Phosphatase and TENsin (PTEN) homolog is a negative regulator that takes part in IIS (insulin/insulin-like signaling) and Egfr (epidermal growth factor receptor) activation in Drosophila melanogaster. IIS and Egfr signaling events are also involved in the developmental process of queen and worker differentiation in honey bees (Apis mellifera). Here, we characterized the bee PTEN gene homologue for the first time and begin to explore its potential function during bee development and adult life.

Results: Honey bee PTEN is alternatively spliced, resulting in three splice variants. Next, we show that the expression of PTEN can be down-regulated by RNA interference (RNAi) in the larval stage, when female caste fate is determined. Relative to controls, we observed that RNAi efficacy is dependent on the amount of PTEN dsRNA that is delivered to larvae. For larvae fed queen or worker diets containing a high amount of PTEN dsRNA, PTEN knockdown was significant at a whole-body level but lethal. A lower dosage did not result in a significant gene down-regulation. Finally, we compared same-aged adult workers with different behavior: nursing vs. foraging. We show that between nurses and foragers, PTEN isoforms were differentially expressed within brain, ovary and fat body tissues. All isoforms were expressed at higher levels in the brain and ovaries of the foragers. In fat body, isoform B was expressed at higher level in the nurse bees.

Conclusion: Our results suggest that PTEN plays a central role during growth and development in queen- and worker-destined honey bees. In adult workers, moreover, tissue-specific patterns of PTEN isoform expression are correlated with differences in complex division of labor between same-aged individuals. Therefore, we propose that knowledge on the roles of IIS and Egfr activity in developmental and behavioral control may increase through studies of how PTEN functions can impact bee social phenotypes.

ContributorsMutti, Navdeep (Author) / Wang, Ying (Author) / Kaftanoglu, Osman (Author) / Amdam, Gro (Author) / College of Liberal Arts and Sciences (Contributor)
Created2011-07-14
128820-Thumbnail Image.png
Description

In vitro rearing is an important and useful tool for honey bee (Apis mellifera L.) studies. However, it often results in intercastes between queens and workers, which are normally are not seen in hive-reared bees, except when larvae older than three days are grafted for queen rearing. Morphological classification (queen

In vitro rearing is an important and useful tool for honey bee (Apis mellifera L.) studies. However, it often results in intercastes between queens and workers, which are normally are not seen in hive-reared bees, except when larvae older than three days are grafted for queen rearing. Morphological classification (queen versus worker or intercastes) of bees produced by this method can be subjective and generally depends on size differences. Here, we propose an alternative method for caste classification of female honey bees reared in vitro, based on weight at emergence, ovariole number, spermatheca size and size and shape, and features of the head, mandible and basitarsus. Morphological measurements were made with both traditional morphometric and geometric morphometrics techniques. The classifications were performed by principal component analysis, using naturally developed queens and workers as controls. First, the analysis included all the characters. Subsequently, a new analysis was made without the information about ovariole number and spermatheca size. Geometric morphometrics was less dependent on ovariole number and spermatheca information for caste and intercaste identification. This is useful, since acquiring information concerning these reproductive structures requires time-consuming dissection and they are not accessible when abdomens have been removed for molecular assays or in dried specimens. Additionally, geometric morphometrics divided intercastes into more discrete phenotype subsets. We conclude that morphometric geometrics are superior to traditional morphometrics techniques for identification and classification of honey bee castes and intermediates.

ContributorsDe Souza, Daiana A. (Author) / Wang, Ying (Author) / Kaftanoglu, Osman (Author) / De Jong, David (Author) / Amdam, Gro (Author) / Goncalves, Lionel S. (Author) / Francoy, Tiago M. (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-04-20
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02