Filtering by
- All Subjects: Transportation
- All Subjects: spatial statistics
- All Subjects: Spatial linear mixed model
- Creators: Ozer, Hasan
- Status: Published
A critical review of the existing SI modeling paradigms is first presented, which also highlights features of big data that are particular to SI data. Next, a simulation experiment is carried out to evaluate three different statistical modeling frameworks for SI data that are supported by different underlying conceptual frameworks. Then, two approaches are taken to identify the potential and pitfalls associated with two newer sources of data from New York City - bike-share cycling trips and taxi trips. The first approach builds a model of commuting behavior using a traditional census data set and then compares the results for the same model when it is applied to these newer data sources. The second approach examines how the increased temporal resolution of big SI data may be incorporated into SI models.
Several important results are obtained through this research. First, it is demonstrated that different SI models account for different types of spatial effects and that the Competing Destination framework seems to be the most robust for capturing spatial structure effects. Second, newer sources of big SI data are shown to be very useful for complimenting traditional sources of data, though they are not sufficient substitutions. Finally, it is demonstrated that the increased temporal resolution of new data sources may usher in a new era of SI modeling that allows us to better understand the dynamics of human behavior.
In the first part, nonlinear regression models were embedded into a multistage workflow to predict the spatial abundance of reef fish species in the Gulf of Mexico. There were two challenges, zero-inflated data and out of sample prediction. The methods and models in the workflow could effectively handle the zero-inflated sampling data without strong assumptions. Three strategies were proposed to solve the out of sample prediction problem. The results and discussions showed that the nonlinear prediction had the advantages of high accuracy, low bias and well-performed in multi-resolution.
In the second part, a two-stage spatial regression model was proposed for analyzing soil carbon stock (SOC) data. In the first stage, there was a spatial linear mixed model that captured the linear and stationary effects. In the second stage, a generalized additive model was used to explain the nonlinear and nonstationary effects. The results illustrated that the two-stage model had good interpretability in understanding the effect of covariates, meanwhile, it kept high prediction accuracy which is competitive to the popular machine learning models, like, random forest, xgboost and support vector machine.
A new nonlinear regression model, Gaussian process BART (Bayesian additive regression tree), was proposed in the third part. Combining advantages in both BART and Gaussian process, the model could capture the nonlinear effects of both observed and latent covariates. To develop the model, first, the traditional BART was generalized to accommodate correlated errors. Then, the failure of likelihood based Markov chain Monte Carlo (MCMC) in parameter estimating was discussed. Based on the idea of analysis of variation, back comparing and tuning range, were proposed to tackle this failure. Finally, effectiveness of the new model was examined by experiments on both simulation and real data.
United States Air Force airfield PAVER pavement management system enterprise data was reviewed for 67 networks. The distress survey extents and severity fields were joined with treatment costs estimated using RSMeans to determine the costliest distress. In asphalt surfaced pavements Longitudinal/transverse cracking, weathering, and block cracking resulted in the most pavement condition index (PCI) deducts while the costliest distresses are weathering, block cracking and longitudinal cracking. In portland cement concrete surfaced pavements linear cracking, joint seal damage, and joint spalling resulted in the most PCI deducts while the costliest distresses are joint seal damage, linear cracking, and corner spalling. The results of this data were then compared to airfield attributes: Pavement Temperature Group, Dominant American Association of State Highway and Transportation Officials (AASHTO) Soil Classification, Pavement- Transportation Computer Assisted Structural Engineering (PCASE) Climate Zone, and years since last maintenance. Maps showing the Pavement Temperature Group, Dominant AASHTO Soil Classification, and PCASE Climate Zone are included in Appendix A. Alligator cracking is most prevalent at the airfields with PTG 64-34 (Ellsworth, Fairchild, Hill, and Offutt) and 58-22 (Niagara and Vandenberg). Rutting is most prevalent at PTG 64-34 (Ellsworth, Fairchild, Hill, and Offutt). An increasing trend of joint spalling, corner spalling, and corner break with decreasing soil quality (AASHOTO A-1 to A-8 soils). The PCASE Climate Zone Cost Indices the cost index for weathering is approximately double in the moist region over the dry region. The cost index for block cracking is approximately double in the cold region over the hot region. It is recommended that the agency review its pavement performance modeling in the pavement management system to increase the recommendation of pavement preservation treatments and review the use of higher quality materials for pavement maintenance treatments.
In this research, an effort was undertaken to gather and collect most recent asphalt mixtures’ design data and compare it to historical data such as those available in the Long-Term Pavement Performance (LTPP), maintained by the Federal Highway Administration (FHWA). The new asphalt mixture design data was collected from 25 states within the United States and separated according to the four suggested climatic regions. The previously designed asphalt mixture designs in the 1960’s present in the LTPP Database implemented for the test sections were compared with the recently designed pavement mixtures gathered, and pavement performance was assessed using predictive models.
Three predictive models were studied in this research. The models were related to three major asphalt pavement distresses: Rutting, Fatigue Cracking and Thermal Cracking. Once the performance of the asphalt mixtures was assessed, four ranking criteria were developed to support the assessment of the mix designs quality at hand; namely, Low, Satisfactory, Good or Excellent. The evaluation results were reasonable and deemed acceptable. Out of the 48 asphalt mixtures design evaluated, the majority were between Satisfactory and Good.
The evaluation methodology and criteria developed are helpful tools in determining the quality of asphalt mixtures produced by the different agencies. They provide a quick insight on the needed improvement/modification against the potential development of distress during the lifespan of the pavement structure.