This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 32
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
141505-Thumbnail Image.png
Description

High proportions of autistic children suffer from gastrointestinal (GI) disorders, implying a link between autism and abnormalities in gut microbial functions. Increasing evidence from recent high-throughput sequencing analyses indicates that disturbances in composition and diversity of gut microbiome are associated with various disease conditions. However, microbiome-level studies on autism are

High proportions of autistic children suffer from gastrointestinal (GI) disorders, implying a link between autism and abnormalities in gut microbial functions. Increasing evidence from recent high-throughput sequencing analyses indicates that disturbances in composition and diversity of gut microbiome are associated with various disease conditions. However, microbiome-level studies on autism are limited and mostly focused on pathogenic bacteria. Therefore, here we aimed to define systemic changes in gut microbiome associated with autism and autism-related GI problems. We recruited 20 neurotypical and 20 autistic children accompanied by a survey of both autistic severity and GI symptoms. By pyrosequencing the V2/V3 regions in bacterial 16S rDNA from fecal DNA samples, we compared gut microbiomes of GI symptom-free neurotypical children with those of autistic children mostly presenting GI symptoms. Unexpectedly, the presence of autistic symptoms, rather than the severity of GI symptoms, was associated with less diverse gut microbiomes. Further, rigorous statistical tests with multiple testing corrections showed significantly lower abundances of the genera Prevotella, Coprococcus, and unclassified Veillonellaceae in autistic samples. These are intriguingly versatile carbohydrate-degrading and/or fermenting bacteria, suggesting a potential influence of unusual diet patterns observed in autistic children. However, multivariate analyses showed that autism-related changes in both overall diversity and individual genus abundances were correlated with the presence of autistic symptoms but not with their diet patterns. Taken together, autism and accompanying GI symptoms were characterized by distinct and less diverse gut microbial compositions with lower levels of Prevotella, Coprococcus, and unclassified Veillonellaceae.

ContributorsKang, Dae Wook (Author) / Park, Jin (Author) / Ilhan, Zehra (Author) / Wallstrom, Garrick (Author) / LaBaer, Joshua (Author) / Adams, James (Author) / Krajmalnik-Brown, Rosa (Author) / Biodesign Institute (Contributor)
Created2013-06-03
128260-Thumbnail Image.png
Description

Lineage-committed cells of many tissues exhibit substantial plasticity in contexts such as wound healing and tumorigenesis, but the regulation of this process is not well understood. We identified the Hippo transducer WWTR1/TAZ in a screen of transcription factors that are able to prompt lineage switching of mammary epithelial cells. Forced

Lineage-committed cells of many tissues exhibit substantial plasticity in contexts such as wound healing and tumorigenesis, but the regulation of this process is not well understood. We identified the Hippo transducer WWTR1/TAZ in a screen of transcription factors that are able to prompt lineage switching of mammary epithelial cells. Forced expression of TAZ in luminal cells induces them to adopt basal characteristics, and depletion of TAZ in basal and/or myoepithelial cells leads to luminal differentiation. In human and mouse tissues, TAZ is active only in basal cells and is critical for basal cell maintenance during homeostasis. Accordingly, loss of TAZ affects mammary gland development, leading to an imbalance of luminal and basal populations as well as branching defects. Mechanistically, TAZ interacts with components of the SWI/SNF complex to modulate lineage-specific gene expression. Collectively, these findings uncover a new role for Hippo signaling in the determination of lineage identity through recruitment of chromatin-remodeling complexes.

ContributorsSkibinski, Adam (Author) / Breindel, Jerrica L. (Author) / Prat, Aleix (Author) / Galvan, Patricia (Author) / Smith, Elizabeth (Author) / Rolfs, Andreas (Author) / Gupta, Piyush B. (Author) / LaBaer, Joshua (Author) / Kuperwasser, Charlotte (Author) / Biodesign Institute (Contributor)
Created2014-03-27
128250-Thumbnail Image.png
Description

Many drugs are effective in the early stage of treatment, but patients develop drug resistance after a certain period of treatment, causing failure of the therapy. An important example is Herceptin, a popular monoclonal antibody drug for breast cancer by specifically targeting human epidermal growth factor receptor 2 (Her2). Here

Many drugs are effective in the early stage of treatment, but patients develop drug resistance after a certain period of treatment, causing failure of the therapy. An important example is Herceptin, a popular monoclonal antibody drug for breast cancer by specifically targeting human epidermal growth factor receptor 2 (Her2). Here we demonstrate a quantitative binding kinetics analysis of drug-target interactions to investigate the molecular scale origin of drug resistance. Using a surface plasmon resonance imaging, we measured the in situ Herceptin-Her2 binding kinetics in single intact cancer cells for the first time, and observed significantly weakened Herceptin-Her2 interactions in Herceptin-resistant cells, compared to those in Herceptin-sensitive cells. We further showed that the steric hindrance of Mucin-4, a membrane protein, was responsible for the altered drug-receptor binding. This effect of a third molecule on drug-receptor interactions cannot be studied using traditional purified protein methods, demonstrating the importance of the present intact cell-based binding kinetics analysis.

ContributorsWang, Wei (Author) / Yin, Linliang (Author) / Gonzalez-Malerva, Laura (Author) / Wang, Shaopeng (Author) / Yu, Xiaobo (Author) / Eaton, Seron (Author) / Zhang, Shengtao (Author) / Chen, Hong-Yuan (Author) / LaBaer, Joshua (Author) / Tao, Nongjian (Author) / Biodesign Institute (Contributor)
Created2014-10-14
128166-Thumbnail Image.png
Description

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance

At the end of the dark ages, anatomy was taught as though everything that could be known was known. Scholars learned about what had been discovered rather than how to make discoveries. This was true even though the body (and the rest of biology) was very poorly understood. The renaissance eventually brought a revolution in how scholars (and graduate students) were trained and worked. This revolution never occurred in K-12 or university education such that we now teach young students in much the way that scholars were taught in the dark ages, we teach them what is already known rather than the process of knowing. Citizen science offers a way to change K-12 and university education and, in doing so, complete the renaissance. Here we offer an example of such an approach and call for change in the way students are taught science, change that is more possible than it has ever been and is, nonetheless, five hundred years delayed.

Created2016-03-01
128577-Thumbnail Image.png
Description

Nucleic Acid Programmable Protein Arrays (NAPPA) have emerged as a powerful and innovative technology for the screening of biomarkers and the study of protein-protein interactions, among others possible applications. The principal advantages are the high specificity and sensitivity that this platform offers. Moreover, compared to conventional protein microarrays, NAPPA technology

Nucleic Acid Programmable Protein Arrays (NAPPA) have emerged as a powerful and innovative technology for the screening of biomarkers and the study of protein-protein interactions, among others possible applications. The principal advantages are the high specificity and sensitivity that this platform offers. Moreover, compared to conventional protein microarrays, NAPPA technology avoids the necessity of protein purification, which is expensive and time-consuming, by substituting expression in situ with an in vitro transcription/translation kit. In summary, NAPPA arrays have been broadly employed in different studies improving knowledge about diseases and responses to treatments. Here, we review the principal advances and applications performed using this platform during the last years.

ContributorsDiez, Paula (Author) / Gonzalez-Gonzalez, Maria (Author) / Lourido, Lucia (Author) / Degano, Rosa M. (Author) / Ibarrola, Nieves (Author) / Casado-Vela, Juan (Author) / LaBaer, Joshua (Author) / Fuentes, Manuel (Author) / Biodesign Institute (Contributor)
Created2015-04-24
127872-Thumbnail Image.png
Description

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial

Background: Modern advances in sequencing technology have enabled the census of microbial members of many natural ecosystems. Recently, attention is increasingly being paid to the microbial residents of human-made, built ecosystems, both private (homes) and public (subways, office buildings, and hospitals). Here, we report results of the characterization of the microbial ecology of a singular built environment, the International Space Station (ISS). This ISS sampling involved the collection and microbial analysis (via 16S rRNA gene PCR) of 15 surfaces sampled by swabs onboard the ISS. This sampling was a component of Project MERCCURI (Microbial Ecology Research Combining Citizen and University Researchers on ISS). Learning more about the microbial inhabitants of the “buildings” in which we travel through space will take on increasing importance, as plans for human exploration continue, with the possibility of colonization of other planets and moons.

Results: Sterile swabs were used to sample 15 surfaces onboard the ISS. The sites sampled were designed to be analogous to samples collected for (1) the Wildlife of Our Homes project and (2) a study of cell phones and shoes that were concurrently being collected for another component of Project MERCCURI. Sequencing of the 16S rRNA genes amplified from DNA extracted from each swab was used to produce a census of the microbes present on each surface sampled. We compared the microbes found on the ISS swabs to those from both homes on Earth and data from the Human Microbiome Project.

Conclusions: While significantly different from homes on Earth and the Human Microbiome Project samples analyzed here, the microbial community composition on the ISS was more similar to home surfaces than to the human microbiome samples. The ISS surfaces are OTU-rich with 1,036–4,294 operational taxonomic units (OTUs per sample). There was no discernible biogeography of microbes on the 15 ISS surfaces, although this may be a reflection of the small sample size we were able to obtain.

ContributorsLang, Jenna M. (Author) / Coil, David A. (Author) / Neches, Russell Y. (Author) / Brown, Wendy E. (Author) / Cavalier, Darlene (Author) / Severance, Mark (Author) / Hampton-Marcell, Jarrad T. (Author) / Gilbert, Jack A. (Author) / Eisen, Jonathan A. (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-12-05
127868-Thumbnail Image.png
Description

Rationale: Cell-free protein microarrays display naturally-folded proteins based on just-in-time in situ synthesis, and have made important contributions to basic and translational research. However, the risk of spot-to-spot cross-talk from protein diffusion during expression has limited the feature density of these arrays.

Methods: In this work, we developed the Multiplexed Nucleic

Rationale: Cell-free protein microarrays display naturally-folded proteins based on just-in-time in situ synthesis, and have made important contributions to basic and translational research. However, the risk of spot-to-spot cross-talk from protein diffusion during expression has limited the feature density of these arrays.

Methods: In this work, we developed the Multiplexed Nucleic Acid Programmable Protein Array (M-NAPPA), which significantly increases the number of displayed proteins by multiplexing as many as five different gene plasmids within a printed spot.

Results: Even when proteins of different sizes were displayed within the same feature, they were readily detected using protein-specific antibodies. Protein-protein interactions and serological antibody assays using human viral proteome microarrays demonstrated that comparable hits were detected by M-NAPPA and non-multiplexed NAPPA arrays. An ultra-high density proteome microarray displaying > 16k proteins on a single microscope slide was produced by combining M-NAPPA with a photolithography-based silicon nano-well platform. Finally, four new tuberculosis-related antigens in guinea pigs vaccinated with Bacillus Calmette-Guerin (BCG) were identified with M-NAPPA and validated with ELISA.

Conclusion: All data demonstrate that multiplexing features on a protein microarray offer a cost-effective fabrication approach and have the potential to facilitate high throughput translational research.

ContributorsYu, Xiaobo (Author) / Song, Lusheng (Author) / Petritis, Brianne (Author) / Bian, Xiaofang (Author) / Wang, Haoyu (Author) / Viloria, Jennifer (Author) / Park, Jin (Author) / Bui, Hoang (Author) / Li, Han (Author) / Wang, Jie (Author) / Liu, Lei (Author) / Yang, Liuhui (Author) / Duan, Hu (Author) / McMurray, David N. (Author) / Achkar, Jacqueline M. (Author) / Magee, Mitch (Author) / Qiu, Ji (Author) / LaBaer, Joshua (Author) / Biodesign Institute (Contributor)
Created2017-09-20
128481-Thumbnail Image.png
Description

Autoantibodies refer to antibodies that target self-antigens, which can play pivotal roles in maintaining homeostasis, distinguishing normal from tumor tissue and trigger autoimmune diseases. In the last three decades, tremendous efforts have been devoted to elucidate the generation, evolution and functions of autoantibodies, as well as their target autoantigens. However,

Autoantibodies refer to antibodies that target self-antigens, which can play pivotal roles in maintaining homeostasis, distinguishing normal from tumor tissue and trigger autoimmune diseases. In the last three decades, tremendous efforts have been devoted to elucidate the generation, evolution and functions of autoantibodies, as well as their target autoantigens. However, reports of these countless previously identified autoantigens are randomly dispersed in the literature. Here, we constructed an AAgAtlas database 1.0 using text-mining and manual curation. We extracted 45 830 autoantigen-related abstracts and 94 313 sentences from PubMed using the keywords of either ‘autoantigen’ or ‘autoantibody’ or their lexical variants, which were further refined to 25 520 abstracts, 43 253 sentences and 3984 candidates by our bio-entity recognizer based on the Protein Ontology. Finally, we identified 1126 genes as human autoantigens and 1071 related human diseases, with which we constructed a human autoantigen database (AAgAtlas database 1.0). The database provides a user-friendly interface to conveniently browse, retrieve and download human autoantigens as well as their associated diseases. The database is freely accessible at http://biokb.ncpsb.org/aagatlas/. We believe this database will be a valuable resource to track and understand human autoantigens as well as to investigate their functions in basic and translational research.

ContributorsWang, Dan (Author) / Yang, Liuhui (Author) / Zhang, Ping (Author) / LaBaer, Joshua (Author) / Hermjakob, Henning (Author) / Li, Dong (Author) / Yu, Xiaobo (Author) / Biodesign Institute (Contributor)
Created2016-10-19
128562-Thumbnail Image.png
Description

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To identify the flow direction of clickstreams, we define the “flow distance” of nodes (Li), which measures the average number of steps a random walker takes to reach the ith node. It is observed that Li is related with the clicks (Ci) to news stories and the age (Ti) of stories. Putting these three variables together help us understand the rise and decay of news stories from a network perspective. We also find that the studied clickstream networks preserve a stable structure over time, leading to the scaling between users and clicks. The universal scaling behavior is confirmed by the 1,000 Web forums. We suggest that the tree-like, stable structure of clickstream networks reveals the time-sensitive preference of users in online browsing. To test our assumption, we discuss three models on individual browsing behavior, and compare the simulation results with empirical data.

ContributorsWang, Cheng-Jun (Author) / Wu, Lingfei (Author) / Zhang, Jiang (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-09-28