Matching Items (3)
Filtering by

Clear all filters

156060-Thumbnail Image.png
Description
As urban populations become increasingly dense, massive amounts of new 'big' data that characterize human activity are being made available and may be characterized as having a large volume of observations, being produced in real-time or near real-time, and including a diverse variety of information. In particular, spatial interaction (SI)

As urban populations become increasingly dense, massive amounts of new 'big' data that characterize human activity are being made available and may be characterized as having a large volume of observations, being produced in real-time or near real-time, and including a diverse variety of information. In particular, spatial interaction (SI) data - a collection of human interactions across a set of origins and destination locations - present unique challenges for distilling big data into insight. Therefore, this dissertation identifies some of the potential and pitfalls associated with new sources of big SI data. It also evaluates methods for modeling SI to investigate the relationships that drive SI processes in order to focus on human behavior rather than data description.

A critical review of the existing SI modeling paradigms is first presented, which also highlights features of big data that are particular to SI data. Next, a simulation experiment is carried out to evaluate three different statistical modeling frameworks for SI data that are supported by different underlying conceptual frameworks. Then, two approaches are taken to identify the potential and pitfalls associated with two newer sources of data from New York City - bike-share cycling trips and taxi trips. The first approach builds a model of commuting behavior using a traditional census data set and then compares the results for the same model when it is applied to these newer data sources. The second approach examines how the increased temporal resolution of big SI data may be incorporated into SI models.

Several important results are obtained through this research. First, it is demonstrated that different SI models account for different types of spatial effects and that the Competing Destination framework seems to be the most robust for capturing spatial structure effects. Second, newer sources of big SI data are shown to be very useful for complimenting traditional sources of data, though they are not sufficient substitutions. Finally, it is demonstrated that the increased temporal resolution of new data sources may usher in a new era of SI modeling that allows us to better understand the dynamics of human behavior.
ContributorsOshan, Taylor Matthew (Author) / Fotheringham, A. S. (Thesis advisor) / Farmer, Carson J.Q. (Committee member) / Rey, Sergio S.J. (Committee member) / Nelson, Trisalyn (Committee member) / Arizona State University (Publisher)
Created2017
158320-Thumbnail Image.png
Description
Walking and bicycling bring many merits to people, both physically and mentally.

However, not everyone has an opportunity to enjoy healthy and safe bicycling and

walking. Many studies suggested that access to healthy walking and bicycling is heavily

related to socio-economic status. Low income population and racial minorities have

poorer

Walking and bicycling bring many merits to people, both physically and mentally.

However, not everyone has an opportunity to enjoy healthy and safe bicycling and

walking. Many studies suggested that access to healthy walking and bicycling is heavily

related to socio-economic status. Low income population and racial minorities have

poorer transportation that results in less walking and bicycling, as well as less access to

public transportation. They are also under higher risks of being hit by vehicles while

walking and bicycling. This research quantifies the relationship between socioeconomic

factors and bicyclist and pedestrian involved traffic crash rates in order to establish an

understanding of how equitable access to safe bicycling and walking is in Phoenix. The

crash rates involving both bicyclists and pedestrians were categorized into two groups,

minor crashes and severe crashes. Then, the OLS model was used to analyze minor and

severe bicycle crash rates, and minor and severe pedestrian crash rates, respectively.

There are four main results, (1) The median income of an area is always negatively

related to the crash rates of bicyclists and pedestrians. The reason behind the negative

correlation is that there is a very small proportion of people choosing to walk or ride

bicycles as their commuting methods in the high-income areas. Consequently, there are

low crash rates of pedestrians and bicyclists. (2) The minor bicycle crash rates are more

related to socio-economic determinants than the severe crash rates. (3) A higher

population density reduces both the minor and the severe crash rates of bicyclists and

pedestrians in Phoenix. (4) A higher pedestrian commuting ratio does not reduce bicyclist

and pedestrian crash rates in Phoenix. The findings from this study can provide a

reference value for the government and other researchers and encourage better future

decisions from policy makers.
ContributorsWu, Feiyi (Author) / Nelson, Trisalyn (Thesis advisor) / Salon, Deborah (Committee member) / Kuby, Michael (Committee member) / Arizona State University (Publisher)
Created2020
158516-Thumbnail Image.png
Description
Geographically Weighted Regression (GWR) has been broadly used in various fields to

model spatially non-stationary relationships. Classic GWR is considered as a single-scale model that is based on one bandwidth parameter which controls the amount of distance-decay in weighting neighboring data around each location. The single bandwidth in GWR assumes that

Geographically Weighted Regression (GWR) has been broadly used in various fields to

model spatially non-stationary relationships. Classic GWR is considered as a single-scale model that is based on one bandwidth parameter which controls the amount of distance-decay in weighting neighboring data around each location. The single bandwidth in GWR assumes that processes (relationships between the response variable and the predictor variables) all operate at the same scale. However, this posits a limitation in modeling potentially multi-scale processes which are more often seen in the real world. For example, the measured ambient temperature of a location is affected by the built environment, regional weather and global warming, all of which operate at different scales. A recent advancement to GWR termed Multiscale GWR (MGWR) removes the single bandwidth assumption and allows the bandwidths for each covariate to vary. This results in each parameter surface being allowed to have a different degree of spatial variation, reflecting variation across covariate-specific processes. In this way, MGWR has the capability to differentiate local, regional and global processes by using varying bandwidths for covariates. Additionally, bandwidths in MGWR become explicit indicators of the scale at various processes operate. The proposed dissertation covers three perspectives centering on MGWR: Computation; Inference; and Application. The first component focuses on addressing computational issues in MGWR to allow MGWR models to be calibrated more efficiently and to be applied on large datasets. The second component aims to statistically differentiate the spatial scales at which different processes operate by quantifying the uncertainty associated with each bandwidth obtained from MGWR. In the third component, an empirical study will be conducted to model the changing relationships between county-level socio-economic factors and voter preferences in the 2008-2016 United States presidential elections using MGWR.
ContributorsLi, Ziqi (Author) / Fotheringham, A. Stewart (Thesis advisor) / Goodchild, Michael F. (Committee member) / Li, Wenwen (Committee member) / Arizona State University (Publisher)
Created2020