This growing collection consists of scholarly works authored by ASU-affiliated faculty, staff, and community members, and it contains many open access articles. ASU-affiliated authors are encouraged to Share Your Work in KEEP.

Displaying 1 - 10 of 50
Filtering by

Clear all filters

141461-Thumbnail Image.png
Description
In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they

In the digital humanities, there is a constant need to turn images and PDF files into plain text to apply analyses such as topic modelling, named entity recognition, and other techniques. However, although there exist different solutions to extract text embedded in PDF files or run OCR on images, they typically require additional training (for example, scholars have to learn how to use the command line) or are difficult to automate without programming skills. The Giles Ecosystem is a distributed system based on Apache Kafka that allows users to upload documents for text and image extraction. The system components are implemented using Java and the Spring Framework and are available under an Open Source license on GitHub (https://github.com/diging/).
ContributorsLessios-Damerow, Julia (Contributor) / Peirson, Erick (Contributor) / Laubichler, Manfred (Contributor) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2017-09-28
129567-Thumbnail Image.png
Description

Human protein diversity arises as a result of alternative splicing, single nucleotide polymorphisms (SNPs) and posttranslational modifications. Because of these processes, each protein can exists as multiple variants in vivo. Tailored strategies are needed to study these protein variants and understand their role in health and disease. In this work

Human protein diversity arises as a result of alternative splicing, single nucleotide polymorphisms (SNPs) and posttranslational modifications. Because of these processes, each protein can exists as multiple variants in vivo. Tailored strategies are needed to study these protein variants and understand their role in health and disease. In this work we utilized quantitative mass spectrometric immunoassays to determine the protein variants concentration of beta-2-microglobulin, cystatin C, retinol binding protein, and transthyretin, in a population of 500 healthy individuals. Additionally, we determined the longitudinal concentration changes for the protein variants from four individuals over a 6 month period. Along with the native forms of the four proteins, 13 posttranslationally modified variants and 7 SNP-derived variants were detected and their concentration determined. Correlations of the variants concentration with geographical origin, gender, and age of the individuals were also examined. This work represents an important step toward building a catalog of protein variants concentrations and examining their longitudinal changes.

ContributorsTrenchevska, Olgica (Author) / Phillips, David A. (Author) / Nelson, Randall (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2014-06-23
129462-Thumbnail Image.png
Description

We develop a general framework to analyze the controllability of multiplex networks using multiple-relation networks and multiple-layer networks with interlayer couplings as two classes of prototypical systems. In the former, networks associated with different physical variables share the same set of nodes and in the latter, diffusion processes take place.

We develop a general framework to analyze the controllability of multiplex networks using multiple-relation networks and multiple-layer networks with interlayer couplings as two classes of prototypical systems. In the former, networks associated with different physical variables share the same set of nodes and in the latter, diffusion processes take place. We find that, for a multiple-relation network, a layer exists that dominantly determines the controllability of the whole network and, for a multiple-layer network, a small fraction of the interconnections can enhance the controllability remarkably. Our theory is generally applicable to other types of multiplex networks as well, leading to significant insights into the control of complex network systems with diverse structures and interacting patterns.

ContributorsYuan, Zhengzhong (Author) / Zhao, Chen (Author) / Wang, Wen-Xu (Author) / Di, Zengru (Author) / Lai, Ying-Cheng (Author) / Ira A. Fulton Schools of Engineering (Contributor)
Created2014-10-24
Description

The effects of urbanization on ozone levels have been widely investigated over cities primarily located in temperate and/or humid regions. In this study, nested WRF-Chem simulations with a finest grid resolution of 1 km are conducted to investigate ozone concentrations O3 due to urbanization within cities in arid/semi-arid environments. First,

The effects of urbanization on ozone levels have been widely investigated over cities primarily located in temperate and/or humid regions. In this study, nested WRF-Chem simulations with a finest grid resolution of 1 km are conducted to investigate ozone concentrations O3 due to urbanization within cities in arid/semi-arid environments. First, a method based on a shape preserving Monotonic Cubic Interpolation (MCI) is developed and used to downscale anthropogenic emissions from the 4 km resolution 2005 National Emissions Inventory (NEI05) to the finest model resolution of 1 km. Using the rapidly expanding Phoenix metropolitan region as the area of focus, we demonstrate the proposed MCI method achieves ozone simulation results with appreciably improved correspondence to observations relative to the default interpolation method of the WRF-Chem system. Next, two additional sets of experiments are conducted, with the recommended MCI approach, to examine impacts of urbanization on ozone production: (1) the urban land cover is included (i.e., urbanization experiments) and, (2) the urban land cover is replaced with the region's native shrubland. Impacts due to the presence of the built environment on O3 are highly heterogeneous across the metropolitan area. Increased near surface O3 due to urbanization of 10–20 ppb is predominantly a nighttime phenomenon while simulated impacts during daytime are negligible. Urbanization narrows the daily O3 range (by virtue of increasing nighttime minima), an impact largely due to the region's urban heat island. Our results demonstrate the importance of the MCI method for accurate representation of the diurnal profile of ozone, and highlight its utility for high-resolution air quality simulations for urban areas.

ContributorsLi, Jialun (Author) / Georgescu, Matei (Author) / Hyde, Peter (Author) / Mahalov, Alex (Author) / Moustaoui, Mohamed (Author) / Julie Ann Wrigley Global Institute of Sustainability (Contributor)
Created2014-11-01
129251-Thumbnail Image.png
Description

Forecasts of noise pollution from a highway line segment noise source are obtained from a sound propagation model utilizing effective sound speed profiles derived from a Numerical Weather Prediction (NWP) limited area forecast with 1 km horizontal resolution and near-ground vertical resolution finer than 20 m. Methods for temporal along

Forecasts of noise pollution from a highway line segment noise source are obtained from a sound propagation model utilizing effective sound speed profiles derived from a Numerical Weather Prediction (NWP) limited area forecast with 1 km horizontal resolution and near-ground vertical resolution finer than 20 m. Methods for temporal along with horizontal and vertical spatial nesting are demonstrated within the NWP model for maintaining forecast feasibility. It is shown that vertical nesting can improve the prediction of finer structures in near-ground temperature and velocity profiles, such as morning temperature inversions and low level jet-like features. Accurate representation of these features is shown to be important for modeling sound refraction phenomena and for enabling accurate noise assessment. Comparisons are made using the parabolic equation model for predictions with profiles derived from NWP simulations and from field experiment observations during mornings on November 7 and 8, 2006 in Phoenix, Arizona. The challenges faced in simulating accurate meteorological profiles at high resolution for sound propagation applications are highlighted and areas for possible improvement are discussed.

ContributorsShaffer, Stephen (Author) / Fernando, H. J. S. (Author) / Ovenden, N. C. (Author) / Moustaoui, Mohamed (Author) / Mahalov, Alex (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-05-01
129252-Thumbnail Image.png
Description

Physical mechanisms of incongruency between observations and Weather Research and Forecasting (WRF) Model predictions are examined. Limitations of evaluation are constrained by (i) parameterizations of model physics, (ii) parameterizations of input data, (iii) model resolution, and (iv) flux observation resolution. Observations from a new 22.1-m flux tower situated within a

Physical mechanisms of incongruency between observations and Weather Research and Forecasting (WRF) Model predictions are examined. Limitations of evaluation are constrained by (i) parameterizations of model physics, (ii) parameterizations of input data, (iii) model resolution, and (iv) flux observation resolution. Observations from a new 22.1-m flux tower situated within a residential neighborhood in Phoenix, Arizona, are utilized to evaluate the ability of the urbanized WRF to resolve finescale surface energy balance (SEB) when using the urban classes derived from the 30-m-resolution National Land Cover Database. Modeled SEB response to a large seasonal variation of net radiation forcing was tested during synoptically quiescent periods of high pressure in winter 2011 and premonsoon summer 2012. Results are presented from simulations employing five nested domains down to 333-m horizontal resolution. A comparative analysis of model cases testing parameterization of physical processes was done using four configurations of urban parameterization for the bulk urban scheme versus three representations with the Urban Canopy Model (UCM) scheme, and also for two types of planetary boundary layer parameterization: the local Mellor–Yamada–Janjić scheme and the nonlocal Yonsei University scheme. Diurnal variation in SEB constituent fluxes is examined in relation to surface-layer stability and modeled diagnostic variables. Improvement is found when adapting UCM for Phoenix with reduced errors in the SEB components. Finer model resolution is seen to have insignificant (<1 standard deviation) influence on mean absolute percent difference of 30-min diurnal mean SEB terms.

ContributorsShaffer, Stephen (Author) / Chow, Winston, 1951- (Author) / Georgescu, Matei (Author) / Hyde, Peter (Author) / Jenerette, G. D. (Author) / Mahalov, Alex (Author) / Moustaoui, Mohamed (Author) / Ruddell, Benjamin (Author) / College of Liberal Arts and Sciences (Contributor)
Created2015-06-11
129259-Thumbnail Image.png
Description

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all

What's a profession without a code of ethics? Being a legitimate profession almost requires drafting a code and, at least nominally, making members follow it. Codes of ethics (henceforth “codes”) exist for a number of reasons, many of which can vary widely from profession to profession - but above all they are a form of codified self-regulation. While codes can be beneficial, it argues that when we scratch below the surface, there are many problems at their root. In terms of efficacy, codes can serve as a form of ethical window dressing, rather than effective rules for behavior. But even more that, codes can degrade the meaning behind being a good person who acts ethically for the right reasons.

Created2013-11-30
128778-Thumbnail Image.png
Description

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems

Online communities are becoming increasingly important as platforms for large-scale human cooperation. These communities allow users seeking and sharing professional skills to solve problems collaboratively. To investigate how users cooperate to complete a large number of knowledge-producing tasks, we analyze Stack Exchange, one of the largest question and answer systems in the world. We construct attention networks to model the growth of 110 communities in the Stack Exchange system and quantify individual answering strategies using the linking dynamics on attention networks. We identify two answering strategies. Strategy A aims at performing maintenance by doing simple tasks, whereas strategy B aims at investing time in doing challenging tasks. Both strategies are important: empirical evidence shows that strategy A decreases the median waiting time for answers and strategy B increases the acceptance rate of answers. In investigating the strategic persistence of users, we find that users tends to stick on the same strategy over time in a community, but switch from one strategy to the other across communities. This finding reveals the different sets of knowledge and skills between users. A balance between the population of users taking A and B strategies that approximates 2:1, is found to be optimal to the sustainable growth of communities.

ContributorsWu, Lingfei (Author) / Baggio, Jacopo (Author) / Janssen, Marco (Author) / ASU-SFI Center for Biosocial Complex Systems (Contributor)
Created2016-03-02
128687-Thumbnail Image.png
Description

Proteins can exist as multiple proteoforms in vivo, as a result of alternative splicing and single-nucleotide polymorphisms (SNPs), as well as posttranslational processing. To address their clinical significance in a context of diagnostic information, proteoforms require a more in-depth analysis. Mass spectrometric immunoassays (MSIA) have been devised for studying structural

Proteins can exist as multiple proteoforms in vivo, as a result of alternative splicing and single-nucleotide polymorphisms (SNPs), as well as posttranslational processing. To address their clinical significance in a context of diagnostic information, proteoforms require a more in-depth analysis. Mass spectrometric immunoassays (MSIA) have been devised for studying structural diversity in human proteins. MSIA enables protein profiling in a simple and high-throughput manner, by combining the selectivity of targeted immunoassays, with the specificity of mass spectrometric detection. MSIA has been used for qualitative and quantitative analysis of single and multiple proteoforms, distinguishing between normal fluctuations and changes related to clinical conditions. This mini review offers an overview of the development and application of mass spectrometric immunoassays for clinical and population proteomics studies. Provided are examples of some recent developments, and also discussed are the trends and challenges in mass spectrometry-based immunoassays for the next-phase of clinical applications.

ContributorsTrenchevska, Olgica (Author) / Nelson, Randall (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2016-03-17
128933-Thumbnail Image.png
Description

Introduction: Apolipoprotein C-III (apoC-III) regulates triglyceride (TG) metabolism. In plasma, apoC-III exists in non-sialylated (apoC-III0a without glycosylation and apoC-III[subscript 0b] with glycosylation), monosialylated (apoC-III1) or disialylated (apoC-III2) proteoforms. Our aim was to clarify the relationship between apoC-III sialylation proteoforms with fasting plasma TG concentrations.

Methods: In 204 non-diabetic adolescent participants, the

Introduction: Apolipoprotein C-III (apoC-III) regulates triglyceride (TG) metabolism. In plasma, apoC-III exists in non-sialylated (apoC-III0a without glycosylation and apoC-III[subscript 0b] with glycosylation), monosialylated (apoC-III1) or disialylated (apoC-III2) proteoforms. Our aim was to clarify the relationship between apoC-III sialylation proteoforms with fasting plasma TG concentrations.

Methods: In 204 non-diabetic adolescent participants, the relative abundance of apoC-III plasma proteoforms was measured using mass spectrometric immunoassay.

Results: Compared with the healthy weight subgroup (n = 16), the ratios of apoC-III0a, apoC-III0b, and apoC-III1 to apoC-III2 were significantly greater in overweight (n = 33) and obese participants (n = 155). These ratios were positively correlated with BMI z-scores and negatively correlated with measures of insulin sensitivity (S[subscript i]). The relationship of apoC-III1 / apoC-III2 with Si persisted after adjusting for BMI (p = 0.02). Fasting TG was correlated with the ratio of apoC-III0a / apoC-III2 (r = 0.47, p<0.001), apoC-III0b / apoC-III2 (r = 0.41, p<0.001), apoC-III1 / apoC-III2 (r = 0.43, p<0.001). By examining apoC-III concentrations, the association of apoC-III proteoforms with TG was driven by apoC-III0a (r = 0.57, p<0.001), apoC-III0b (r = 0.56. p<0.001) and apoC-III1 (r = 0.67, p<0.001), but not apoC-III2 (r = 0.006, p = 0.9) concentrations, indicating that apoC-III relationship with plasma TG differed in apoC-III2 compared with the other proteoforms.

Conclusion: We conclude that apoC-III0a, apoC-III0b, and apoC-III1, but not apoC-III2 appear to be under metabolic control and associate with fasting plasma TG. Measurement of apoC-III proteoforms can offer insights into the biology of TG metabolism in obesity.

ContributorsYassine, Hussein N. (Author) / Trenchevska, Olgica (Author) / Ramrakhiani, Ambika (Author) / Parekh, Aarushi (Author) / Koska, Juraj (Author) / Walker, Ryan W. (Author) / Billheimer, Dean (Author) / Reaven, Peter D. (Author) / Yen, Frances T. (Author) / Nelson, Randall (Author) / Goran, Michael I. (Author) / Nedelkov, Dobrin (Author) / Biodesign Institute (Contributor)
Created2015-12-03