This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 28
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
128166-Thumbnail Image.png
Description

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance eventually brought a revolution in how scholars (and graduate students) were trained and worked. This revolution never occurred in K-12 or university education such that we now teach young students in much the way that scholars were taught in the dark ages, we teach them what is already known rather than the process of knowing. Citizen science offers a way to change K-12 and university education and, in doing so, complete the renaissance. Here we offer an example of such an approach and call for change in the way students are taught science, change that is more possible than it has ever been and is, nonetheless, five hundred years delayed.

Created2016-03-01
127872-Thumbnail Image.png
Description

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial ecology of a singular built environment, the International Space Station (ISS). This ISS sampling involved the collection and microbial analysis (via 16S rRNA gene PCR) of 15 surfaces sampled by swabs onboard the ISS. This sampling was a component of Project MERCCURI (Microbial Ecology Research Combining Citizen and University Researchers on ISS). Learning more about the microbial inhabitants of the “buildings” in which we travel through space will take on increasing importance, as plans for human exploration continue, with the possibility of colonization of other planets and moons.

Results: Sterile swabs were used to sample 15 surfaces onboard the ISS. The sites sampled were designed to be analogous to samples collected for (1) the Wildlife of Our Homes project and (2) a study of cell phones and shoes that were concurrently being collected for another component of Project MERCCURI. Sequencing of the 16S rRNA genes amplified from DNA extracted from each swab was used to produce a census of the microbes present on each surface sampled. We compared the microbes found on the ISS swabs to those from both homes on Earth and data from the Human Microbiome Project.

Conclusions: While significantly different from homes on Earth and the Human Microbiome Project samples analyzed here, the microbial community composition on the ISS was more similar to home surfaces than to the human microbiome samples. The ISS surfaces are OTU-rich with 1,036–4,294 operational taxonomic units (OTUs per sample). There was no discernible biogeography of microbes on the 15 ISS surfaces, although this may be a reflection of the small sample size we were able to obtain.

ContributorsLang, Jenna M. (Author) / Coil, David A. (Author) / Neches, Russell Y. (Author) / Brown, Wendy E. (Author) / Cavalier, Darlene (Author) / Severance, Mark (Author) / Hampton-Marcell, Jarrad T. (Author) / Gilbert, Jack A. (Author) / Eisen, Jonathan A. (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-12-05
128562-Thumbnail Image.png
Description

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To identify the flow direction of clickstreams, we define the “flow distance” of nodes (Li), which measures the average number of steps a random walker takes to reach the ith node. It is observed that Li is related with the clicks (Ci) to news stories and the age (Ti) of stories. Putting these three variables together help us understand the rise and decay of news stories from a network perspective. We also find that the studied clickstream networks preserve a stable structure over time, leading to the scaling between users and clicks. The universal scaling behavior is confirmed by the 1,000 Web forums. We suggest that the tree-like, stable structure of clickstream networks reveals the time-sensitive preference of users in online browsing. To test our assumption, we discuss three models on individual browsing behavior, and compare the simulation results with empirical data.

ContributorsWang, Cheng-Jun (Author) / Wu, Lingfei (Author) / Zhang, Jiang (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-28
128559-Thumbnail Image.png
Description

Many adaptive systems sit near a tipping or critical point. For systems near a critical point small changes to component behaviour can induce large-scale changes in aggregate structure and function. Criticality can be adaptive when the environment is changing, but entails reduced robustness through sensitivity. This tradeoff can be resolved

Many adaptive systems sit near a tipping or critical point. For systems near a critical point small changes to component behaviour can induce large-scale changes in aggregate structure and function. Criticality can be adaptive when the environment is changing, but entails reduced robustness through sensitivity. This tradeoff can be resolved when criticality can be tuned. We address the control of finite measures of criticality using data on fight sizes from an animal society model system (Macaca nemestrina, n=48). We find that a heterogeneous, socially organized system, like homogeneous, spatial systems (flocks and schools), sits near a critical point; the contributions individuals make to collective phenomena can be quantified; there is heterogeneity in these contributions; and distance from the critical point (DFC) can be controlled through biologically plausible mechanisms exploiting heterogeneity. We propose two alternative hypotheses for why a system decreases the distance from the critical point.

ContributorsDaniels, Bryan (Author) / Krakauer, David (Author) / Flack, Jessica (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-02-10
129072-Thumbnail Image.png
Description

Background: Many studies used the older ActiGraph (7164) for physical activity measurement, but this model has been replaced with newer ones (e.g., GT3X+). The assumption that new generation models are more accurate has been questioned, especially for measuring lower intensity levels. The low-frequency extension (LFE) increases the low-intensity sensitivity of newer

Background: Many studies used the older ActiGraph (7164) for physical activity measurement, but this model has been replaced with newer ones (e.g., GT3X+). The assumption that new generation models are more accurate has been questioned, especially for measuring lower intensity levels. The low-frequency extension (LFE) increases the low-intensity sensitivity of newer models, but its comparability with older models is unknown. This study compared step counts and physical activity collected with the 7164 and GT3X + using the Normal Filter and the LFE (GT3X+N and GT3X+LFE, respectively).

Findings: Twenty-five adults wore 2 accelerometer models simultaneously for 3Âdays and were instructed to engage in typical behaviors. Average daily step counts and minutes per day in nonwear, sedentary, light, moderate, and vigorous activity were calculated. Repeated measures ANOVAs with post-hoc pairwise comparisons were used to compare mean values. Means for the GT3X+N and 7164 were significantly different in 4 of the 6 categories (p < .05). The GT3X+N showed 2041 fewer steps per day and more sedentary, less light, and less moderate than the 7164 (+25.6, -31.2, -2.9 mins/day, respectively). The GT3X+LFE showed non-significant differences in 5 of 6 categories but recorded significantly more steps (+3597 steps/day; p < .001) than the 7164.

Conclusion: Studies using the newer ActiGraphs should employ the LFE for greater sensitivity to lower intensity activity and more comparable activity results with studies using the older models. Newer generation ActiGraphs do not produce comparable step counts to the older generation devices with the Normal filter or the LFE.

ContributorsCain, Kelli L. (Author) / Conway, Terry L. (Author) / Adams, Marc (Author) / Husak, Lisa E. (Author) / Sallis, James F. (Author) / College of Health Solutions (Contributor)
Created2013-04-25
128744-Thumbnail Image.png
Description

Sequential affect dynamics generated during the interaction of intimate dyads, such as married couples, are associated with a cascade of effects - some good and some bad - on each partner, close family members, and other social contacts. Although the effects are well documented, the probabilistic structures associated with micro-social

Sequential affect dynamics generated during the interaction of intimate dyads, such as married couples, are associated with a cascade of effects - some good and some bad - on each partner, close family members, and other social contacts. Although the effects are well documented, the probabilistic structures associated with micro-social processes connected to the varied outcomes remain enigmatic. Using extant data we developed a method of classifying and subsequently generating couple dynamics using a Hierarchical Dirichlet Process Hidden semi-Markov Model (HDP-HSMM). Our findings indicate that several key aspects of existing models of marital interaction are inadequate: affect state emissions and their durations, along with the expected variability differences between distressed and nondistressed couples are present but highly nuanced; and most surprisingly, heterogeneity among highly satisfied couples necessitate that they be divided into subgroups. We review how this unsupervised learning technique generates plausible dyadic sequences that are sensitive to relationship quality and provide a natural mechanism for computational models of behavioral and affective micro-social processes.

ContributorsGriffin, William (Author) / Li, Xun (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-05-17
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02
129016-Thumbnail Image.png
Description

Background: Advancements in geographic information systems over the past two decades have increased the specificity by which an individual’s neighborhood environment may be spatially defined for physical activity and health research. This study investigated how different types of street network buffering methods compared in measuring a set of commonly used built

Background: Advancements in geographic information systems over the past two decades have increased the specificity by which an individual’s neighborhood environment may be spatially defined for physical activity and health research. This study investigated how different types of street network buffering methods compared in measuring a set of commonly used built environment measures (BEMs) and tested their performance on associations with physical activity outcomes.

Methods: An internationally-developed set of objective BEMs using three different spatial buffering techniques were used to evaluate the relative differences in resulting explanatory power on self-reported physical activity outcomes. BEMs were developed in five countries using ‘sausage,’ ‘detailed-trimmed,’ and ‘detailed,’ network buffers at a distance of 1 km around participant household addresses (n = 5883).

Results: BEM values were significantly different (p < 0.05) for 96% of sausage versus detailed-trimmed buffer comparisons and 89% of sausage versus detailed network buffer comparisons. Results showed that BEM coefficients in physical activity models did not differ significantly across buffering methods, and in most cases BEM associations with physical activity outcomes had the same level of statistical significance across buffer types. However, BEM coefficients differed in significance for 9% of the sausage versus detailed models, which may warrant further investigation.

Conclusions: Results of this study inform the selection of spatial buffering methods to estimate physical activity outcomes using an internationally consistent set of BEMs. Using three different network-based buffering methods, the findings indicate significant variation among BEM values, however associations with physical activity outcomes were similar across each buffering technique. The study advances knowledge by presenting consistently assessed relationships between three different network buffer types and utilitarian travel, sedentary behavior, and leisure-oriented physical activity outcomes.

ContributorsFrank, Lawrence D. (Author) / Fox, Eric H. (Author) / Ulmer, Jared M. (Author) / Chapman, James E. (Author) / Kershaw, Suzanne E. (Author) / Sallis, James F. (Author) / Conway, Terry L. (Author) / Cerin, Ester (Author) / Cain, Kelli L. (Author) / Adams, Marc (Author) / Smith, Graham R. (Author) / Hinckson, Erica (Author) / Mavoa, Suzanne (Author) / Christiansen, Lars B. (Author) / Hino, Adriano Akira F. (Author) / Lopes, Adalberto A. S. (Author) / Schipperijn, Jasper (Author) / College of Health Solutions (Contributor)
Created2017-01-23
129015-Thumbnail Image.png
Description

Background: The World Health Organization recommends strategies to improve urban design, public transportation, and recreation facilities to facilitate physical activity for non-communicable disease prevention for an increasingly urbanized global population. Most evidence supporting environmental associations with physical activity comes from single countries or regions with limited variation in urban form. This

Background: The World Health Organization recommends strategies to improve urban design, public transportation, and recreation facilities to facilitate physical activity for non-communicable disease prevention for an increasingly urbanized global population. Most evidence supporting environmental associations with physical activity comes from single countries or regions with limited variation in urban form. This paper documents variation in comparable built environment features across countries from diverse regions.

Methods: The International Physical Activity and the Environment Network (IPEN) study of adults aimed to measure the full range of variation in the built environment using geographic information systems (GIS) across 12 countries on 5 continents. Investigators in Australia, Belgium, Brazil, Colombia, the Czech Republic, Denmark, China, Mexico, New Zealand, Spain, the United Kingdom, and the United States followed a common research protocol to develop internationally comparable measures. Using detailed instructions, GIS-based measures included features such as walkability (i.e., residential density, street connectivity, mix of land uses), and access to public transit, parks, and private recreation facilities around each participant’s residential address using 1-km and 500-m street network buffers.

Results: Eleven of 12 countries and 15 cities had objective GIS data on built environment features. We observed a 38-fold difference in median residential densities, a 5-fold difference in median intersection densities and an 18-fold difference in median park densities. Hong Kong had the highest and North Shore, New Zealand had the lowest median walkability index values, representing a difference of 9 standard deviations in GIS-measured walkability.

Conclusions: Results show that comparable measures can be created across a range of cultural settings revealing profound global differences in urban form relevant to physical activity. These measures allow cities to be ranked more precisely than previously possible. The highly variable measures of urban form will be used to explain individuals’ physical activity, sedentary behaviors, body mass index, and other health outcomes on an international basis. Present measures provide the ability to estimate dose–response relationships from projected changes to the built environment that would otherwise be impossible.

ContributorsAdams, Marc (Author) / Frank, Lawrence D. (Author) / Schipperijn, Jasper (Author) / Smith, Graham (Author) / Chapman, James (Author) / Christiansen, Lars B. (Author) / Coffee, Neil (Author) / Salvo, Deborah (Author) / du Toit, Lorinne (Author) / Dygryn, Jan (Author) / Hino, Adriano Akira Ferreira (Author) / Lai, Poh-chin (Author) / Mavoa, Suzanne (Author) / Pinzon, Jose David (Author) / Van de Weghe, Nico (Author) / Cerin, Ester (Author) / Davey, Rachel (Author) / Macfarlane, Duncan (Author) / Owen, Neville (Author) / Sallis, James F. (Author) / College of Health Solutions (Contributor)
Created2014-10-25