Search Content

Spatiotemporal Patterns and Predictability of Cyberattacks

Description

A relatively unexplored issue in cybersecurity science and engineering is whether there exist intrinsic patterns of cyberattacks. Conventional wisdom favors absence of such patterns due to the overwhelming complexity of the modern cyberspace. Surprisingly, through a detailed analysis of an extensive data set that records the time-dependent frequencies of attacks…

A relatively unexplored issue in cybersecurity science and engineering is whether there exist intrinsic patterns of cyberattacks. Conventional wisdom favors absence of such patterns due to the overwhelming complexity of the modern cyberspace. Surprisingly, through a detailed analysis of an extensive data set that records the time-dependent frequencies of attacks over a relatively wide range of consecutive IP addresses, we successfully uncover intrinsic spatiotemporal patterns underlying cyberattacks, where the term “spatio” refers to the IP address space. In particular, we focus on analyzing macroscopic properties of the attack traffic flows and identify two main patterns with distinct spatiotemporal characteristics: deterministic and stochastic. Strikingly, there are very few sets of major attackers committing almost all the attacks, since their attack “fingerprints” and target selection scheme can be unequivocally identified according to the very limited number of unique spatiotemporal characteristics, each of which only exists on a consecutive IP region and differs significantly from the others. We utilize a number of quantitative measures, including the flux-fluctuation law, the Markov state transition probability matrix, and predictability measures, to characterize the attack patterns in a comprehensive manner. A general finding is that the attack patterns possess high degrees of predictability, potentially paving the way to anticipating and, consequently, mitigating or even preventing large-scale cyberattacks using macroscopic approaches.

ContributorsChen, Yu-Zhong (Author) / Huang, Zi-Gang (Author) / Xu, Shouhuai (Author) / Lai, Ying-Cheng (Author) / Ira A. Fulton Schools of Engineering (Contributor)

Created2015-05-20

Optimization and Resilience of Complex Supply-Demand Networks

Description

Supply-demand processes take place on a large variety of real-world networked systems ranging from power grids and the internet to social networking and urban systems. In a modern infrastructure, supply-demand systems are constantly expanding, leading to constant increase in load requirement for resources and consequently, to problems such as low…

Supply-demand processes take place on a large variety of real-world networked systems ranging from power grids and the internet to social networking and urban systems. In a modern infrastructure, supply-demand systems are constantly expanding, leading to constant increase in load requirement for resources and consequently, to problems such as low efficiency, resource scarcity, and partial system failures. Under certain conditions global catastrophe on the scale of the whole system can occur through the dynamical process of cascading failures. We investigate optimization and resilience of time-varying supply-demand systems by constructing network models of such systems, where resources are transported from the supplier sites to users through various links. Here by optimization we mean minimization of the maximum load on links, and system resilience can be characterized using the cascading failure size of users who fail to connect with suppliers.

We consider two representative classes of supply schemes: load driven supply and fix fraction supply. Our findings are: (1) optimized systems are more robust since relatively smaller cascading failures occur when triggered by external perturbation to the links; (2) a large fraction of links can be free of load if resources are directed to transport through the shortest paths; (3) redundant links in the performance of the system can help to reroute the traffic but may undesirably transmit and enlarge the failure size of the system; (4) the patterns of cascading failures depend strongly upon the capacity of links; (5) the specific location of the trigger determines the specific route of cascading failure, but has little effect on the final cascading size; (6) system expansion typically reduces the efficiency; and (7) when the locations of the suppliers are optimized over a long expanding period, fewer suppliers are required. These results hold for heterogeneous networks in general, providing insights into designing optimal and resilient complex supply-demand systems that expand constantly in time.

ContributorsZhang, Si-Ping (Author) / Huang, Zi-Gang (Author) / Dong, Jia-Qi (Author) / Eisenberg, Daniel (Author) / Seager, Thomas (Author) / Lai, Ying-Cheng (Author) / Ira A. Fulton Schools of Engineering (Contributor)

Created2015-06-23

High pH Microbial Ecosystems in a Newly Discovered, Ephemeral, Serpentinizing Fluid Seep at Yanartaş (Chimera), Turkey

Description

Gas seeps emanating from Yanartaş (Chimera), Turkey, have been documented for thousands of years. Active serpentinization produces hydrogen and a range of carbon gases that may provide fuel for life. Here we report a newly discovered, ephemeral fluid seep emanating from a small gas vent at Yanartaş. Fluids and biofilms…

Gas seeps emanating from Yanartaş (Chimera), Turkey, have been documented for thousands of years. Active serpentinization produces hydrogen and a range of carbon gases that may provide fuel for life. Here we report a newly discovered, ephemeral fluid seep emanating from a small gas vent at Yanartaş. Fluids and biofilms were sampled at the source and points downstream. We describe site conditions, and provide microbiological data in the form of enrichment cultures, Scanning electron microscopy (SEM), carbon and nitrogen isotopic composition of solids, and PCR screens of nitrogen cycle genes. Source fluids are pH 11.95, with a Ca:Mg of ~200, and sediments under the ignited gas seep measure 60°C. Collectively, these data suggest the fluid is the product of active serpentinization at depth. Source sediments are primarily calcite and alteration products (chlorite and montmorillonite). Downstream, biofilms are mixed with montmorillonite. SEM shows biofilms distributed homogeneously with carbonates. Organic carbon accounts for 60% of the total carbon at the source, decreasing downstream to <15% as inorganic carbon precipitates. δ¹³C ratios of the organic carbon fraction of solids are depleted (−25 to −28‰) relative to the carbonates (−11 to −20‰). We conclude that heterotrophic processes are dominant throughout the surface ecosystem, and carbon fixation may be key down channel. δ¹⁵N ratios ~3‰, and absence of nifH in extracted DNA suggest that nitrogen fixation is not occurring in sediments. However, the presence of narG and nirS at most locations and in enrichments indicates genomic potential for nitrate and nitrite reduction. This small seep with shallow run-off is likely ephemeral, but abundant preserved microterracettes in the outflow and the surrounding area suggest it has been present for some time. This site and others like it present an opportunity for investigations of preserved deep biosphere signatures, and subsurface-surface interactions.

ContributorsMeyer-Dombard, D'Arcy R. (Author) / Woycheese, Kristin M. (Author) / Yargicoglu, Erin N. (Author) / Cardace, Dawn (Author) / Shock, Everett (Author) / Gulecal-Pektas, Yasemin (Author) / Temel, Mustafa (Author) / College of Liberal Arts and Sciences (Contributor)

Created2015-01-19

Learning Energy Consumption and Demand Models Through Data Mining for Reverse Engineering

Description

The estimation of energy demand (by power plants) has traditionally relied on historical energy use data for the region(s) that a plant produces for. Regression analysis, artificial neural network and Bayesian theory are the most common approaches for analysing these data. Such data and techniques do not generate reliable results.…

The estimation of energy demand (by power plants) has traditionally relied on historical energy use data for the region(s) that a plant produces for. Regression analysis, artificial neural network and Bayesian theory are the most common approaches for analysing these data. Such data and techniques do not generate reliable results. Consequently, excess energy has to be generated to prevent blackout; causes for energy surge are not easily determined; and potential energy use reduction from energy efficiency solutions is usually not translated into actual energy use reduction. The paper highlights the weaknesses of traditional techniques, and lays out a framework to improve the prediction of energy demand by combining energy use models of equipment, physical systems and buildings, with the proposed data mining algorithms for reverse engineering. The research team first analyses data samples from large complex energy data, and then, presents a set of computationally efficient data mining algorithms for reverse engineering. In order to develop a structural system model for reverse engineering, two focus groups are developed that has direct relation with cause and effect variables. The research findings of this paper includes testing out different sets of reverse engineering algorithms, understand their output patterns and modify algorithms to elevate accuracy of the outputs.

ContributorsNaganathan, Hariharan (Author) / Chong, Oswald (Author) / Ye, Long (Author) / Ira A. Fulton School of Engineering (Contributor)

Created2015-12-09

Low-Investment Energy Retrofit Framework for Small and Medium Office Buildings

Description

Small and medium office buildings consume a significant parcel of the U.S. building stock energy consumption. Still, owners lack resources and experience to conduct detailed energy audits and retrofit analysis. We present an eight-steps framework for an energy retrofit assessment in small and medium office buildings. Through a bottom-up approach…

Small and medium office buildings consume a significant parcel of the U.S. building stock energy consumption. Still, owners lack resources and experience to conduct detailed energy audits and retrofit analysis. We present an eight-steps framework for an energy retrofit assessment in small and medium office buildings. Through a bottom-up approach and a web-based retrofit toolkit tested on a case study in Arizona, this methodology was able to save about 50% of the total energy consumed by the case study building, depending on the adopted measures and invested capital. While the case study presented is a deep energy retrofit, the proposed framework is effective in guiding the decision-making process that precedes any energy retrofit, deep or light.

ContributorsRios, Fernanda (Author) / Parrish, Kristen (Author) / Chong, Oswald (Author) / Ira A. Fulton School of Engineering (Contributor)

Created2016-05-20

A Non-Stationary Analysis Using Ensemble Empirical Mode Decomposition to Detect Anomalies in Building Energy Consumption

Description

Commercial buildings’ consumption is driven by multiple factors that include occupancy, system and equipment efficiency, thermal heat transfer, equipment plug loads, maintenance and operational procedures, and outdoor and indoor temperatures. A modern building energy system can be viewed as a complex dynamical system that is interconnected and influenced by external…

Commercial buildings’ consumption is driven by multiple factors that include occupancy, system and equipment efficiency, thermal heat transfer, equipment plug loads, maintenance and operational procedures, and outdoor and indoor temperatures. A modern building energy system can be viewed as a complex dynamical system that is interconnected and influenced by external and internal factors. Modern large scale sensor measures some physical signals to monitor real-time system behaviors. Such data has the potentials to detect anomalies, identify consumption patterns, and analyze peak loads. The paper proposes a novel method to detect hidden anomalies in commercial building energy consumption system. The framework is based on Hilbert-Huang transform and instantaneous frequency analysis. The objectives are to develop an automated data pre-processing system that can detect anomalies and provide solutions with real-time consumption database using Ensemble Empirical Mode Decomposition (EEMD) method. The finding of this paper will also include the comparisons of Empirical mode decomposition and Ensemble empirical mode decomposition of three important type of institutional buildings.

ContributorsNaganathan, Hariharan (Author) / Chong, Oswald (Author) / Huang, Zigang (Author) / Cheng, Ying (Author) / Ira A. Fulton School of Engineering (Contributor)

Created2016-05-20

Optical Computed Tomography for Spatially Isotropic Four-Dimensional Imaging of Live Single Cells

Description

Quantitative three-dimensional (3D) computed tomography (CT) imaging of living single cells enables orientation-independent morphometric analysis of the intricacies of cellular physiology. Since its invention, x-ray CT has become indispensable in the clinic for diagnostic and prognostic purposes due to its quantitative absorption-based imaging in true 3D that allows objects of…

Quantitative three-dimensional (3D) computed tomography (CT) imaging of living single cells enables orientation-independent morphometric analysis of the intricacies of cellular physiology. Since its invention, x-ray CT has become indispensable in the clinic for diagnostic and prognostic purposes due to its quantitative absorption-based imaging in true 3D that allows objects of interest to be viewed and measured from any orientation. However, x-ray CT has not been useful at the level of single cells because there is insufficient contrast to form an image. Recently, optical CT has been developed successfully for fixed cells, but this technology called Cell-CT is incompatible with live-cell imaging due to the use of stains, such as hematoxylin, that are not compatible with cell viability. We present a novel development of optical CT for quantitative, multispectral functional 4D (three spatial + one spectral dimension) imaging of living single cells. The method applied to immune system cells offers truly isotropic 3D spatial resolution and enables time-resolved imaging studies of cells suspended in aqueous medium. Using live-cell optical CT, we found a heterogeneous response to mitochondrial fission inhibition in mouse macrophages and differential basal remodeling of small (0.1 to 1 fl) and large (1 to 20 fl) nuclear and mitochondrial structures on a 20- to 30-s time scale in human myelogenous leukemia cells. Because of its robust 3D measurement capabilities, live-cell optical CT represents a powerful new tool in the biomedical research field.

ContributorsKelbauskas, Laimonas (Author) / Shetty, Rishabh Manoj (Author) / Cao, Bin (Author) / Wang, Kuo-Chen (Author) / Smith, Dean (Author) / Wang, Hong (Author) / Chao, Shi-Hui (Author) / Gangaraju, Sandhya (Author) / Ashcroft, Brian (Author) / Kritzer, Margaret (Author) / Glenn, Honor (Author) / Johnson, Roger (Author) / Meldrum, Deirdre (Author) / Biodesign Institute (Contributor)

Created2017-12-06

Semi-Supervised Energy Modeling (SSEM) for Building Clusters Using Machine Learning Techniques

Description

There are many data mining and machine learning techniques to manage large sets of complex energy supply and demand data for building, organization and city. As the amount of data continues to grow, new data analysis methods are needed to address the increasing complexity. Using data from the energy loss…

There are many data mining and machine learning techniques to manage large sets of complex energy supply and demand data for building, organization and city. As the amount of data continues to grow, new data analysis methods are needed to address the increasing complexity. Using data from the energy loss between the supply (energy production sources) and demand (buildings and cities consumption), this paper proposes a Semi-Supervised Energy Model (SSEM) to analyse different loss factors for a building cluster. This is done by deep machine learning by training machines to semi-supervise the learning, understanding and manage the process of energy losses. Semi-Supervised Energy Model (SSEM) aims at understanding the demand-supply characteristics of a building cluster and utilizes the confident unlabelled data (loss factors) using deep machine learning techniques. The research findings involves sample data from one of the university campuses and presents the output, which provides an estimate of losses that can be reduced. The paper also provides a list of loss factors that contributes to the total losses and suggests a threshold value for each loss factor, which is determined through real time experiments. The conclusion of this paper provides a proposed energy model that can provide accurate numbers on energy demand, which in turn helps the suppliers to adopt such a model to optimize their supply strategies.

ContributorsNaganathan, Hariharan (Author) / Chong, Oswald (Author) / Chen, Xue-wen (Author) / Ira A. Fulton Schools of Engineering (Contributor)

Created2015-09-14

Korarchaeota Diversity, Biogeography, and Abundance in Yellowstone and Great Basin Hot Springs and Ecological Niche Modeling Based on Machine Learning

Description

Over 100 hot spring sediment samples were collected from 28 sites in 12 areas/regions, while recording as many coincident geochemical properties as feasible (>60 analytes). PCR was used to screen samples for Korarchaeota 16S rRNA genes. Over 500 Korarchaeota 16S rRNA genes were screened by RFLP analysis and 90 were…

Over 100 hot spring sediment samples were collected from 28 sites in 12 areas/regions, while recording as many coincident geochemical properties as feasible (>60 analytes). PCR was used to screen samples for Korarchaeota 16S rRNA genes. Over 500 Korarchaeota 16S rRNA genes were screened by RFLP analysis and 90 were sequenced, resulting in identification of novel Korarchaeota phylotypes and exclusive geographical variants. Korarchaeota diversity was low, as in other terrestrial geothermal systems, suggesting a marine origin for Korarchaeota with subsequent niche-invasion into terrestrial systems. Korarchaeota endemism is consistent with endemism of other terrestrial thermophiles and supports the existence of dispersal barriers. Korarchaeota were found predominantly in >55°C springs at pH 4.7–8.5 at concentrations up to 6.6×10⁶ 16S rRNA gene copies g^-1 wet sediment. In Yellowstone National Park (YNP), Korarchaeota were most abundant in springs with a pH range of 5.7 to 7.0. High sulfate concentrations suggest these fluids are influenced by contributions from hydrothermal vapors that may be neutralized to some extent by mixing with water from deep geothermal sources or meteoric water. In the Great Basin (GB), Korarchaeota were most abundant at spring sources of pH<7.2 with high particulate C content and high alkalinity, which are likely to be buffered by the carbonic acid system. It is therefore likely that at least two different geological mechanisms in YNP and GB springs create the neutral to mildly acidic pH that is optimal for Korarchaeota. A classification support vector machine (C-SVM) trained on single analytes, two analyte combinations, or vectors from non-metric multidimensional scaling models was able to predict springs as Korarchaeota-optimal or sub-optimal habitats with accuracies up to 95%. To our knowledge, this is the most extensive analysis of the geochemical habitat of any high-level microbial taxon and the first application of a C-SVM to microbial ecology.

ContributorsMiller-Coleman, Robin L. (Author) / Dodsworth, Jeremy A. (Author) / Ross, Christian A. (Author) / Shock, Everett (Author) / Williams, Amanda (Author) / Hartnett, Hilairy (Author) / McDonald, Austin I. (Author) / Havig, Jeff (Author) / Hedlund, Brian P. (Author) / College of Liberal Arts and Sciences (Contributor)

Created2012-05-04

A Metastable Equilibrium Model for the Relative Abundances of Microbial Phyla in a Hot Spring

Description

Many studies link the compositions of microbial communities to their environments, but the energetics of organism-specific biomass synthesis as a function of geochemical variables have rarely been assessed. We describe a thermodynamic model that integrates geochemical and metagenomic data for biofilms sampled at five sites along a thermal and chemical…

Many studies link the compositions of microbial communities to their environments, but the energetics of organism-specific biomass synthesis as a function of geochemical variables have rarely been assessed. We describe a thermodynamic model that integrates geochemical and metagenomic data for biofilms sampled at five sites along a thermal and chemical gradient in the outflow channel of the hot spring known as “Bison Pool” in Yellowstone National Park. The relative abundances of major phyla in individual communities sampled along the outflow channel are modeled by computing metastable equilibrium among model proteins with amino acid compositions derived from metagenomic sequences. Geochemical conditions are represented by temperature and activities of basis species, including pH and oxidation-reduction potential quantified as the activity of dissolved hydrogen. By adjusting the activity of hydrogen, the model can be tuned to closely approximate the relative abundances of the phyla observed in the community profiles generated from BLAST assignments. The findings reveal an inverse relationship between the energy demand to form the proteins at equal thermodynamic activities and the abundance of phyla in the community. The distance from metastable equilibrium of the communities, assessed using an equation derived from energetic considerations that is also consistent with the information-theoretic entropy change, decreases along the outflow channel. Specific divergences from metastable equilibrium, such as an underprediction of the relative abundances of phototrophic organisms at lower temperatures, can be explained by considering additional sources of energy and/or differences in growth efficiency. Although the metabolisms used by many members of these communities are driven by chemical disequilibria, the results support the possibility that higher-level patterns of chemotrophic microbial ecosystems are shaped by metastable equilibrium states that depend on both the composition of biomass and the environmental conditions.

ContributorsDick, Jeffrey M. (Author) / Shock, Everett (Author) / College of Liberal Arts and Sciences (Contributor)

Created2013-09-02

ASU Scholarship Showcase

Filtering by

Spatiotemporal Patterns and Predictability of Cyberattacks

Optimization and Resilience of Complex Supply-Demand Networks

High pH Microbial Ecosystems in a Newly Discovered, Ephemeral, Serpentinizing Fluid Seep at Yanartaş (Chimera), Turkey

Learning Energy Consumption and Demand Models Through Data Mining for Reverse Engineering

Low-Investment Energy Retrofit Framework for Small and Medium Office Buildings

A Non-Stationary Analysis Using Ensemble Empirical Mode Decomposition to Detect Anomalies in Building Energy Consumption

Optical Computed Tomography for Spatially Isotropic Four-Dimensional Imaging of Live Single Cells

Semi-Supervised Energy Modeling (SSEM) for Building Clusters Using Machine Learning Techniques

Korarchaeota Diversity, Biogeography, and Abundance in Yellowstone and Great Basin Hot Springs and Ecological Niche Modeling Based on Machine Learning

A Metastable Equilibrium Model for the Relative Abundances of Microbial Phyla in a Hot Spring