Matching Items (10)
Filtering by

Clear all filters

151878-Thumbnail Image.png
Description
Researchers across a variety of fields are often interested in determining if data are of a random nature or if they exhibit patterning which may be the result of some alternative and potentially more interesting process. This dissertation explores a family of statistical methods, i.e. space-time interaction tests, designed to

Researchers across a variety of fields are often interested in determining if data are of a random nature or if they exhibit patterning which may be the result of some alternative and potentially more interesting process. This dissertation explores a family of statistical methods, i.e. space-time interaction tests, designed to detect structure within three-dimensional event data. These tests, widely employed in the fields of spatial epidemiology, criminology, ecology and beyond, are used to identify synergistic interaction across the spatial and temporal dimensions of a series of events. Exploration is needed to better understand these methods and determine how their results may be affected by data quality problems commonly encountered in their implementation; specifically, how inaccuracy and/or uncertainty in the input data analyzed by the methods may impact subsequent results. Additionally, known shortcomings of the methods must be ameliorated. The contributions of this dissertation are twofold: it develops a more complete understanding of how input data quality problems impact the results of a number of global and local tests of space-time interaction and it formulates an improved version of one global test which accounts for the previously identified problem of population shift bias. A series of simulation experiments reveal the global tests of space-time interaction explored here to be dramatically affected by the aforementioned deficiencies in the quality of the input data. It is shown that in some cases, a conservative degree of these common data problems can completely obscure evidence of space-time interaction and in others create it where it does not exist. Conversely, a local metric of space-time interaction examined here demonstrates a surprising robustness in the face of these same deficiencies. This local metric is revealed to be only minimally affected by the inaccuracies and incompleteness introduced in these experiments. Finally, enhancements to one of the global tests are presented which solve the problem of population shift bias associated with the test and better contextualize and visualize its results, thereby enhancing its utility for practitioners.
ContributorsMalizia, Nicholas (Author) / Anselin, Luc (Thesis advisor) / Murray, Alan (Committee member) / Rey, Sergio (Committee member) / Arizona State University (Publisher)
Created2013
151349-Thumbnail Image.png
Description
This dissertation addresses the research challenge of developing efficient new methods for discovering useful patterns and knowledge in large volumes of electronically collected spatiotemporal activity data. I propose to analyze three types of such spatiotemporal activity data in a methodological framework that integrates spatial analysis, data mining, machine learning, and

This dissertation addresses the research challenge of developing efficient new methods for discovering useful patterns and knowledge in large volumes of electronically collected spatiotemporal activity data. I propose to analyze three types of such spatiotemporal activity data in a methodological framework that integrates spatial analysis, data mining, machine learning, and geovisualization techniques. Three different types of spatiotemporal activity data were collected through different data collection approaches: (1) crowd sourced geo-tagged digital photos, representing people's travel activity, were retrieved from the website Panoramio.com through information retrieval techniques; (2) the same techniques were used to crawl crowd sourced GPS trajectory data and related metadata of their daily activities from the website OpenStreetMap.org; and finally (3) preschool children's daily activities and interactions tagged with time and geographical location were collected with a novel TabletPC-based behavioral coding system. The proposed methodology is applied to these data to (1) automatically recommend optimal multi-day and multi-stay travel itineraries for travelers based on discovered attractions from geo-tagged photos, (2) automatically detect movement types of unknown moving objects from GPS trajectories, and (3) explore dynamic social and socio-spatial patterns of preschool children's behavior from both geographic and social perspectives.
ContributorsLi, Xun (Author) / Anselin, Luc (Thesis advisor) / Koschinsky, Julia (Committee member) / Maciejewski, Ross (Committee member) / Rey, Sergio (Committee member) / Griffin, William (Committee member) / Arizona State University (Publisher)
Created2012
153527-Thumbnail Image.png
Description
The shortest path between two locations is important for spatial analysis, location modeling, and wayfinding tasks. Depending on permissible movement and availability of data, the shortest path is either derived from a pre-defined transportation network or constructed in continuous space. However, continuous space movement adds substantial complexity to identifying the

The shortest path between two locations is important for spatial analysis, location modeling, and wayfinding tasks. Depending on permissible movement and availability of data, the shortest path is either derived from a pre-defined transportation network or constructed in continuous space. However, continuous space movement adds substantial complexity to identifying the shortest path as the influence of obstacles has to be considered to avoid errors and biases in a derived path. This obstacle-avoiding shortest path in continuous space has been referred to as Euclidean shortest path (ESP), and attracted the attention of many researchers. It has been proven that constructing a graph is an effective approach to limit infinite search options associated with continuous space, reducing the problem to a finite set of potential paths. To date, various methods have been developed for ESP derivation. However, their computational efficiency is limited due to fundamental limitations in graph construction. In this research, a novel algorithm is developed for efficient identification of a graph guaranteed to contain the ESP. This new approach is referred to as the convexpath algorithm, and exploits spatial knowledge and GIS functionality to efficiently construct a graph. The convexpath algorithm utilizes the notion of a convex hull to simultaneously identify relevant obstacles and construct the graph. Additionally, a spatial filtering technique based on intermediate shortest path is enhances intelligent identification of relevant obstacles. Empirical applications show that the convexpath algorithm is able to construct a graph and derive the ESP with significantly improved efficiency compared to visibility and local visibility graph approaches. Furthermore, to boost the performance of convexpath in big data environments, a parallelization approach is proposed and applied to exploit computationally intensive spatial operations of convexpath. Multicore CPU parallelization demonstrates noticeable efficiency gain over the sequential convexpath. Finally, spatial representation and approximation issues associated with raster-based approximation of the ESP are assessed. This dissertation provides a comprehensive treatment of the ESP, and details an important approach for deriving an optimal ESP in real time.
ContributorsHong, Insu (Author) / Murray, Alan T. (Thesis advisor) / Kuby, Micheal (Committee member) / Rey, Sergio (Committee member) / Arizona State University (Publisher)
Created2015
150793-Thumbnail Image.png
Description
After a relative period of growth (2000-06), the U.S. economy experienced a sharp decline (2007-09) from which it is yet to recover. One of the primary factors that contributed to this decline was the sub-prime mortgage crisis, which triggered a significant increase in residential foreclosures and a slump in housing

After a relative period of growth (2000-06), the U.S. economy experienced a sharp decline (2007-09) from which it is yet to recover. One of the primary factors that contributed to this decline was the sub-prime mortgage crisis, which triggered a significant increase in residential foreclosures and a slump in housing values nationwide. Most studies examining this crisis have explained the high rate of foreclosures by associating it with socio-economic characteristics of the people affected and their financial decisions with respect to home mortgages. Though these studies were successful in identifying the section of the population facing foreclosures, they were mostly silent about region-wide factors that contributed to the crisis. This resulted in the absence of studies that could identify indicators of resiliency and robustness in urban areas that are affected by economic perturbations but had different outcomes. This study addresses this shortcoming by incorporating three concepts. First, it situates the foreclosure crisis in the broader regional economy by considering the concept of regional economic resiliency. Second, it includes the concept of housing submarkets, capturing the role of housing market dynamics in contributing to market performance. Third, the notion of urban growth pattern is included in an urban sprawl index to examine whether factors related to sprawl could partly explain the variation in foreclosures. These, along with other important socio-economic and housing characteristics, are used in this study to better understand the variation in impacts of the current foreclosure crisis. This study is carried out for all urban counties in the U.S. between 2000 and 2009. The associations between foreclosure rates and different variables are established using spatial regression models. Based on these models, this dissertation argues that counties with higher degree of employment diversity, encouragement for small business enterprises, and with less dependence on housing related industries, experienced fewer foreclosures. In addition, this thesis concludes that the spatial location of foreclosed properties is a function of location of origination of sub-prime mortgages and not the spatial location of the properties per se. Also importantly, the study found that the counties with high number of dissimilar housing submarkets experienced more foreclosures.
ContributorsRay, Indro (Author) / Guhathakurta, Subhrajit (Thesis advisor) / Rey, Sergio (Committee member) / Phillips, Rhonda (Committee member) / Arizona State University (Publisher)
Created2012
151251-Thumbnail Image.png
Description
Change of residence is a commonly occurring event in urban areas. It reflects how people interact with the social or physical environment. Thus, by exploring the movement patterns of residential changes, geographers and other scholars hope to learn more about the reasons and impacts associated with residential mobility, and to

Change of residence is a commonly occurring event in urban areas. It reflects how people interact with the social or physical environment. Thus, by exploring the movement patterns of residential changes, geographers and other scholars hope to learn more about the reasons and impacts associated with residential mobility, and to better understand how humans and the environment mutually interact. This is especially meaningful if exploration is based on micro scale movements, since residential changes within a city or a county reflect how the urban structure and community composition interact. Local differentiation, as an inevitable feature among movements at different places, can best be examined based on data at the micro scale. Such work is meaningful, but there have not been appropriate approaches for assessment and evaluation. The majority of traditional methods concentrate more on aggregate movement data at a national scale. So, in order to facilitate research examining movement patterns from a mass of individual residential changes at a micro scale, a toolkit, implemented by computational programming, is introduced in this dissertation to integrate both exploratory as well as confirmatory methods. This toolkit also employs a creative method to explore the spatial autocorrelation of residential movements, reflecting the local effects involved in this social event. The effectiveness and efficiency of this toolkit is examined through a concrete application involving 2,363 residential movements in Franklin County, Ohio.
ContributorsLiu, Yin (Author) / Murray, Alan (Thesis advisor) / Rey, Sergio (Committee member) / Wentz, Elizabeth (Committee member) / Arizona State University (Publisher)
Created2012
156693-Thumbnail Image.png
Description
In the study of regional economic growth and convergence, the distribution dynamics approach which interrogates the evolution of the cross-sectional distribution as a whole and is concerned with both the external and internal dynamics of the distribution has received wide usage. However, many methodological issues remain to be resolved before

In the study of regional economic growth and convergence, the distribution dynamics approach which interrogates the evolution of the cross-sectional distribution as a whole and is concerned with both the external and internal dynamics of the distribution has received wide usage. However, many methodological issues remain to be resolved before valid inferences and conclusions can be drawn from empirical research. Among them, spatial effects including spatial heterogeneity and spatial dependence invalidate the assumption of independent and identical distributions underlying the conventional maximum likelihood techniques while the availability of small samples in regional settings questions the usage of the asymptotic properties. This dissertation is comprised of three papers targeted at addressing these two issues. The first paper investigates whether the conventional regional income mobility estimators are still suitable in the presence of spatial dependence and/or a small sample. It is approached through a series of Monte Carlo experiments which require the proposal of a novel data generating process (DGP) capable of generating spatially dependent time series. The second paper moves to the statistical tests for detecting specific forms of spatial (spatiotemporal) effects in the discrete Markov chain model, investigating their robustness to the alternative spatial effect, sensitivity to discretization granularity, and properties in small sample settings. The third paper proposes discrete kernel estimators with cross-validated bandwidths as an alternative to maximum likelihood estimators in small sample settings. It is demonstrated that the performance of discrete kernel estimators offers improvement when the sample size is small. Taken together, the three papers constitute an endeavor to relax the restrictive assumptions of spatial independence and spatial homogeneity, as well as demonstrating the difference between the small sample and asymptotic properties for conventionally adopted maximum likelihood estimators towards a more valid inferential framework for the distribution dynamics approach to the study of regional economic growth and convergence.
ContributorsKang, Wei (Author) / Rey, Sergio (Thesis advisor) / Fotheringham, A. Stewart (Committee member) / Ye, Xinyue (Committee member) / Arizona State University (Publisher)
Created2018
154744-Thumbnail Image.png
Description
Energy use within urban building stocks is continuing to increase globally as populations expand and access to electricity improves. This projected increase in demand could require deployment of new generation capacity, but there is potential to offset some of this demand through modification of the buildings themselves. Building

Energy use within urban building stocks is continuing to increase globally as populations expand and access to electricity improves. This projected increase in demand could require deployment of new generation capacity, but there is potential to offset some of this demand through modification of the buildings themselves. Building stocks are quasi-permanent infrastructures which have enduring influence on urban energy consumption, and research is needed to understand: 1) how development patterns constrain energy use decisions and 2) how cities can achieve energy and environmental goals given the constraints of the stock. This requires a thorough evaluation of both the growth of the stock and as well as the spatial distribution of use throughout the city. In this dissertation, a case study in Los Angeles County, California (LAC) is used to quantify urban growth, forecast future energy use under climate change, and to make recommendations for mitigating energy consumption increases. A reproducible methodological framework is included for application to other urban areas.

In LAC, residential electricity demand could increase as much as 55-68% between 2020 and 2060, and building technology lock-in has constricted the options for mitigating energy demand, as major changes to the building stock itself are not possible, as only a small portion of the stock is turned over every year. Aggressive and timely efficiency upgrades to residential appliances and building thermal shells can significantly offset the projected increases, potentially avoiding installation of new generation capacity, but regulations on new construction will likely be ineffectual due to the long residence time of the stock (60+ years and increasing). These findings can be extrapolated to other U.S. cities where the majority of urban expansion has already occurred, such as the older cities on the eastern coast. U.S. population is projected to increase 40% by 2060, with growth occurring in the warmer southern and western regions. In these growing cities, improving new construction buildings can help offset electricity demand increases before the city reaches the lock-in phase.
ContributorsReyna, Janet Lorel (Author) / Chester, Mikhail V (Thesis advisor) / Gurney, Kevin (Committee member) / Reddy, T. Agami (Committee member) / Rey, Sergio (Committee member) / Arizona State University (Publisher)
Created2016
155841-Thumbnail Image.png
Description
A major challenge in health-related policy and program evaluation research is attributing underlying causal relationships where complicated processes may exist in natural or quasi-experimental settings. Spatial interaction and heterogeneity between units at individual or group levels can violate both components of the Stable-Unit-Treatment-Value-Assumption (SUTVA) that are core to the counterfactual

A major challenge in health-related policy and program evaluation research is attributing underlying causal relationships where complicated processes may exist in natural or quasi-experimental settings. Spatial interaction and heterogeneity between units at individual or group levels can violate both components of the Stable-Unit-Treatment-Value-Assumption (SUTVA) that are core to the counterfactual framework, making treatment effects difficult to assess. New approaches are needed in health studies to develop spatially dynamic causal modeling methods to both derive insights from data that are sensitive to spatial differences and dependencies, and also be able to rely on a more robust, dynamic technical infrastructure needed for decision-making. To address this gap with a focus on causal applications theoretically, methodologically and technologically, I (1) develop a theoretical spatial framework (within single-level panel econometric methodology) that extends existing theories and methods of causal inference, which tend to ignore spatial dynamics; (2) demonstrate how this spatial framework can be applied in empirical research; and (3) implement a new spatial infrastructure framework that integrates and manages the required data for health systems evaluation.

The new spatially explicit counterfactual framework considers how spatial effects impact treatment choice, treatment variation, and treatment effects. To illustrate this new methodological framework, I first replicate a classic quasi-experimental study that evaluates the effect of drinking age policy on mortality in the United States from 1970 to 1984, and further extend it with a spatial perspective. In another example, I evaluate food access dynamics in Chicago from 2007 to 2014 by implementing advanced spatial analytics that better account for the complex patterns of food access, and quasi-experimental research design to distill the impact of the Great Recession on the foodscape. Inference interpretation is sensitive to both research design framing and underlying processes that drive geographically distributed relationships. Finally, I advance a new Spatial Data Science Infrastructure to integrate and manage data in dynamic, open environments for public health systems research and decision- making. I demonstrate an infrastructure prototype in a final case study, developed in collaboration with health department officials and community organizations.
ContributorsKolak, Marynia Aniela (Author) / Anselin, Luc (Thesis advisor) / Rey, Sergio (Committee member) / Koschinsky, Julia (Committee member) / Maciejewski, Ross (Committee member) / Arizona State University (Publisher)
Created2017
157004-Thumbnail Image.png
Description
In the field of Geographic Information Science (GIScience), we have witnessed the unprecedented data deluge brought about by the rapid advancement of high-resolution data observing technologies. For example, with the advancement of Earth Observation (EO) technologies, a massive amount of EO data including remote sensing data and other sensor observation

In the field of Geographic Information Science (GIScience), we have witnessed the unprecedented data deluge brought about by the rapid advancement of high-resolution data observing technologies. For example, with the advancement of Earth Observation (EO) technologies, a massive amount of EO data including remote sensing data and other sensor observation data about earthquake, climate, ocean, hydrology, volcano, glacier, etc., are being collected on a daily basis by a wide range of organizations. In addition to the observation data, human-generated data including microblogs, photos, consumption records, evaluations, unstructured webpages and other Volunteered Geographical Information (VGI) are incessantly generated and shared on the Internet.

Meanwhile, the emerging cyberinfrastructure rapidly increases our capacity for handling such massive data with regard to data collection and management, data integration and interoperability, data transmission and visualization, high-performance computing, etc. Cyberinfrastructure (CI) consists of computing systems, data storage systems, advanced instruments and data repositories, visualization environments, and people, all linked together by software and high-performance networks to improve research productivity and enable breakthroughs that are not otherwise possible.

The Geospatial CI (GCI, or CyberGIS), as the synthesis of CI and GIScience has inherent advantages in enabling computationally intensive spatial analysis and modeling (SAM) and collaborative geospatial problem solving and decision making.

This dissertation is dedicated to addressing several critical issues and improving the performance of existing methodologies and systems in the field of CyberGIS. My dissertation will include three parts: The first part is focused on developing methodologies to help public researchers find appropriate open geo-spatial datasets from millions of records provided by thousands of organizations scattered around the world efficiently and effectively. Machine learning and semantic search methods will be utilized in this research. The second part develops an interoperable and replicable geoprocessing service by synthesizing the high-performance computing (HPC) environment, the core spatial statistic/analysis algorithms from the widely adopted open source python package – Python Spatial Analysis Library (PySAL), and rich datasets acquired from the first research. The third part is dedicated to studying optimization strategies for feature data transmission and visualization. This study is intended for solving the performance issue in large feature data transmission through the Internet and visualization on the client (browser) side.

Taken together, the three parts constitute an endeavor towards the methodological improvement and implementation practice of the data-driven, high-performance and intelligent CI to advance spatial sciences.
ContributorsShao, Hu (Author) / Li, Wenwen (Thesis advisor) / Rey, Sergio (Thesis advisor) / Maciejewski, Ross (Committee member) / Arizona State University (Publisher)
Created2018
158025-Thumbnail Image.png
Description
Drawn from a trio of manuscripts, this dissertation evaluates the sustainability contributions and implications of deploying underutilized spaces for alternative uses at multiple scales: urban, regional and continental. The first paper considers the use of underutilized spaces at the urban scale for urban agriculture (UA) to meet local sustainability goals

Drawn from a trio of manuscripts, this dissertation evaluates the sustainability contributions and implications of deploying underutilized spaces for alternative uses at multiple scales: urban, regional and continental. The first paper considers the use of underutilized spaces at the urban scale for urban agriculture (UA) to meet local sustainability goals in Phoenix, Arizona. Through a data-driven analysis, it demonstrates UA can meet 90% of annual demand for fresh produce, supply local produce in all food deserts, reduce areas underserved by public parks by 60%, and displace >50,000 tons of carbon-dioxide emissions from buildings.

The second paper considers marginal agricultural land use for bioenergy crop cultivation to meet future liquid fuels demand from cellulosic biofuels sustainably and profitably. At a wholesale fuel price of $4 gallons-of-gasoline-equivalent, 30 to 90.7 billion gallons of cellulosic biofuels can be supplied by converting 22 to 79.3 million hectares of marginal lands in the Eastern United States (U.S.). Displacing marginal croplands (9.4-13.7 million hectares) reduces stress on water resources by preserving soil moisture. This displacement is comparable to existing land use for first-generation biofuels, limiting food supply impacts. Coupled modeling reveals positive hydroclimate feedback on bioenergy crop yields that moderates the land footprint.

The third paper examines the sustainability implications of expanding use of marginal lands for corn cultivation in the Western Corn Belt, a commercially important and environmentally sensitive U.S. region. Corn cultivation on lower quality lands, which tend to overlap with marginal agricultural lands, is shown to be nearly three times more sensitive to changes in crop prices. Therefore, corn cultivation disproportionately expanded into these lands following price spikes.

Underutilized spaces can contribute towards sustainability at small and large scales in a complementary fashion. While supplying fresh produce locally and delivering other benefits in terms of energy use and public health, UA can also reduce pressures on croplands and complement non-urban food production. This complementarity can help diversify agricultural land use for meeting other goals, like supplying biofuels. However, understanding the role of market forces and economic linkages is critical to anticipate any unintended consequences due to such re-organization of land use.
ContributorsULUDERE ARAGON, Nazli Zeynep (Author) / Georgescu, Matei (Thesis advisor) / Hanemann, William M (Committee member) / Parker, Nathan C. (Committee member) / Rey, Sergio (Committee member) / Arizona State University (Publisher)
Created2020