Search Content

Simulating Cooperative Behaviors in a Subsistence Population: An Agent-Based Modeling Approach

Description

Humans cooperate at levels unseen in other species. Identifying the adaptive mechanisms driving this unusual behavior, as well as how these mechanisms interact to create complex cooperative patterns, remains an open question in anthropology. One impediment to such investigations is that complete, long-term datasets of human cooperative behaviors in small-scale…

Humans cooperate at levels unseen in other species. Identifying the adaptive mechanisms driving this unusual behavior, as well as how these mechanisms interact to create complex cooperative patterns, remains an open question in anthropology. One impediment to such investigations is that complete, long-term datasets of human cooperative behaviors in small-scale societies are hard to come by; such field research is often hindered both by humans' long lifespans and by the difficulties of collecting data in remote societies. In this study, I attempted to overcome these methodological challenges by simulating individual human cooperative behaviors in a small-scale population. Using an agent-based model tuned to population-level measurements from a real-life marine subsistence population in the southern Philippines, I generated dynamic daily cooperative behaviors in a hypothetical subsistence population over a period of 1500 years and 42 overlapping generations. Preliminary findings from the model suggest that, while the agent-based model broadly captured a number of characteristic population-level patterns in the subsistence population, it did not fully replicate nuances of the population's observed cooperative behaviors. In particular, statistical models of the simulated data identified reciprocity-based and need-based cooperative behaviors but did not detect kinship-motivated cooperation, despite the fact that kin cooperation traits evolved positively and reciprocity cooperation traits evolved negatively over time in the agent population. It is possible that this discrepancy reflects a complex interaction between kinship and reciprocity in the agent-based model. On the other hand, it may also suggest that these types of statistical models, which are frequently utilized in human cooperation studies in the anthropological literature, do not reliably discriminate between kin-based and reciprocity-based cooperation mechanisms when both exist in a population. Even so, the completeness of the simulated data enabled use of more complex statistical methodologies which were able to disentangle the relative effects of cooperative mechanisms operating at different decision levels. By addressing remaining pattern-matching issues, future iterations of the agent-based model may prove to be a useful tool for validating empirical research and investigating novel hypotheses about the evolution and maintenance of cooperative behaviors in human populations.

ContributorsPhelps, Julia R. (Author) / Reiser, Mark (Thesis advisor) / Saul, Steven (Thesis advisor) / Morgan, Thomas (Committee member) / Arizona State University (Publisher)

Created2023

Case Studies in Machine Learning of Reduced Form Models for Causal Inference

Description

This dissertation develops versatile modeling tools to estimate causal effects when conditional unconfoundedness is not immediately satisfied. Chapter 2 provides a brief overview ofcommon techniques in causal inference, with a focus on models relevant to the data explored in later chapters. The rest of the dissertation focuses on the development of…

This dissertation develops versatile modeling tools to estimate causal effects when conditional unconfoundedness is not immediately satisfied. Chapter 2 provides a brief overview ofcommon techniques in causal inference, with a focus on models relevant to the data explored in later chapters. The rest of the dissertation focuses on the development of novel “reduced form” models which are designed to assess the particular challenges of different datasets. Chapter 3 explores the question of whether or not forecasts of bankruptcy cause bankruptcy. The question arises from the observation that companies issued going concern opinions were more likely to go bankrupt in the following year, leading people to speculate that the opinions themselves caused the bankruptcy via a “self-fulfilling prophecy”. A Bayesian machine learning sensitivity analysis is developed to answer this question. In exchange for additional flexibility and fewer assumptions, this approach loses point identification of causal effects and thus a sensitivity analysis is developed to study a wide range of plausible scenarios of the causal effect of going concern opinions on bankruptcy. Reported in the simulations are different performance metrics of the model in comparison with other popular methods and a robust analysis of the sensitivity of the model to mis-specification. Results on empirical data indicate that forecasts of bankruptcies likely do have a small causal effect. Chapter 4 studies the effects of vaccination on COVID-19 mortality at the state level in the United States. The dynamic nature of the pandemic complicates more straightforward regression adjustments and invalidates many alternative models. The chapter comments on the limitations of mechanistic approaches as well as traditional statistical methods to epidemiological data. Instead, a state space model is developed that allows the study of the ever-changing dynamics of the pandemic’s progression. In the first stage, the model decomposes the observed mortality data into component surges, and later uses this information in a semi-parametric regression model for causal analysis. Results are investigated thoroughly for empirical justification and stress-tested in simulated settings.

ContributorsPapakostas, Demetrios (Author) / Hahn, Paul (Thesis advisor) / McCulloch, Robert (Committee member) / Zhou, Shuang (Committee member) / Kao, Ming-Hung (Committee member) / Lan, Shiwei (Committee member) / Arizona State University (Publisher)

Created2023

Computational Challenges in BART Modeling: Extrapolation, Classification, and Causal Inference

Description

This dissertation centers on Bayesian Additive Regression Trees (BART) and Accelerated BART (XBART) and presents a series of models that tackle extrapolation, classification, and causal inference challenges. To improve extrapolation in tree-based models, I propose a method called local Gaussian Process (GP) that combines Gaussian process regression with trained BART…

This dissertation centers on Bayesian Additive Regression Trees (BART) and Accelerated BART (XBART) and presents a series of models that tackle extrapolation, classification, and causal inference challenges. To improve extrapolation in tree-based models, I propose a method called local Gaussian Process (GP) that combines Gaussian process regression with trained BART trees. This allows for extrapolation based on the most relevant data points and covariate variables determined by the trees' structure. The local GP technique is extended to the Bayesian causal forest (BCF) models to address the positivity violation issue in causal inference. Additionally, I introduce the LongBet model to estimate time-varying, heterogeneous treatment effects in panel data. Furthermore, I present a Poisson-based model, with a modified likelihood for XBART for the multi-class classification problem.

ContributorsWang, Meijia (Author) / Hahn, Paul (Thesis advisor) / He, Jingyu (Committee member) / Lan, Shiwei (Committee member) / McCulloch, Robert (Committee member) / Zhou, Shuang (Committee member) / Arizona State University (Publisher)

Created2024

Spatial Regression and Gaussian Process BART

Description

Spatial regression is one of the central topics in spatial statistics. Based on the goals, interpretation or prediction, spatial regression models can be classified into two categories, linear mixed regression models and nonlinear regression models. This dissertation explored these models and their real world applications. New methods and models were…

Spatial regression is one of the central topics in spatial statistics. Based on the goals, interpretation or prediction, spatial regression models can be classified into two categories, linear mixed regression models and nonlinear regression models. This dissertation explored these models and their real world applications. New methods and models were proposed to overcome the challenges in practice. There are three major parts in the dissertation.

In the first part, nonlinear regression models were embedded into a multistage workflow to predict the spatial abundance of reef fish species in the Gulf of Mexico. There were two challenges, zero-inflated data and out of sample prediction. The methods and models in the workflow could effectively handle the zero-inflated sampling data without strong assumptions. Three strategies were proposed to solve the out of sample prediction problem. The results and discussions showed that the nonlinear prediction had the advantages of high accuracy, low bias and well-performed in multi-resolution.

In the second part, a two-stage spatial regression model was proposed for analyzing soil carbon stock (SOC) data. In the first stage, there was a spatial linear mixed model that captured the linear and stationary effects. In the second stage, a generalized additive model was used to explain the nonlinear and nonstationary effects. The results illustrated that the two-stage model had good interpretability in understanding the effect of covariates, meanwhile, it kept high prediction accuracy which is competitive to the popular machine learning models, like, random forest, xgboost and support vector machine.

A new nonlinear regression model, Gaussian process BART (Bayesian additive regression tree), was proposed in the third part. Combining advantages in both BART and Gaussian process, the model could capture the nonlinear effects of both observed and latent covariates. To develop the model, first, the traditional BART was generalized to accommodate correlated errors. Then, the failure of likelihood based Markov chain Monte Carlo (MCMC) in parameter estimating was discussed. Based on the idea of analysis of variation, back comparing and tuning range, were proposed to tackle this failure. Finally, effectiveness of the new model was examined by experiments on both simulation and real data.

ContributorsLu, Xuetao (Author) / McCulloch, Robert (Thesis advisor) / Hahn, Paul (Committee member) / Lan, Shiwei (Committee member) / Zhou, Shuang (Committee member) / Saul, Steven (Committee member) / Arizona State University (Publisher)

Created2020

Quantifying the Preference of Corynorhinus townsendii to Hibernate in Highly Ventilated Areas in Arizona Caves

Description

Corynorhinus townsendii, a bat species residing in north-central Arizona, has historically been observed hibernating in highly ventilated areas within caves and abandoned mines, but there is little to no specific data regarding this tendency. Understanding how air movement may influence hibernacula selection is critical in bettering conservation efforts for Arizona…

Corynorhinus townsendii, a bat species residing in north-central Arizona, has historically been observed hibernating in highly ventilated areas within caves and abandoned mines, but there is little to no specific data regarding this tendency. Understanding how air movement may influence hibernacula selection is critical in bettering conservation efforts for Arizona bats, especially with white-nose syndrome continuing to devastate bat species populations throughout the United States. My study aimed to begin filling in this knowledge gap. I measured wind speed in three known Arizona hibernacula during the winter hibernation season and combined this data with the locations of bats observed throughout each of the three survey locations. I modeled our findings using a generalized linear model, which confirmed that wind speed is indeed a predictor of C. townsendii roost selection.

ContributorsKitchel, Heidi (Author) / Moore, Marianne (Thesis director) / Saul, Steven (Committee member) / Barrett, The Honors College (Contributor) / College of Integrative Sciences and Arts (Contributor)

Created2022-05

Bayesian Spatiotemporal Modeling and Uncertainty Quantification

Description

Uncertainty Quantification (UQ) is crucial in assessing the reliability of predictivemodels that make decisions for human experts in a data-rich world. The Bayesian approach to UQ for inverse problems has gained popularity. However, addressing UQ in high-dimensional inverse problems is challenging due to the intensity and inefficiency of Markov Chain…

Uncertainty Quantification (UQ) is crucial in assessing the reliability of predictivemodels that make decisions for human experts in a data-rich world. The Bayesian approach to UQ for inverse problems has gained popularity. However, addressing UQ in high-dimensional inverse problems is challenging due to the intensity and inefficiency of Markov Chain Monte Carlo (MCMC) based Bayesian inference methods. Consequently, the first primary focus of this thesis is enhancing efficiency and scalability for UQ in inverse problems. On the other hand, the omnipresence of spatiotemporal data, particularly in areas like traffic analysis, underscores the need for effectively addressing inverse problems with spatiotemporal observations. Conventional solutions often overlook spatial or temporal correlations, resulting in underutilization of spatiotemporal interactions for parameter learning. Appropriately modeling spatiotemporal observations in inverse problems thus forms another pivotal research avenue. In terms of UQ methodologies, the calibration-emulation-sampling (CES) scheme has emerged as effective for large-dimensional problems. I introduce a novel CES approach by employing deep neural network (DNN) models during the emulation and sampling phase. This approach not only enhances computational efficiency but also diminishes sensitivity to training set variations. The newly devised “Dimension- Reduced Emulative Autoencoder Monte Carlo (DREAM)” algorithm scales Bayesian UQ up to thousands of dimensions in physics-constrained inverse problems. The algorithm’s effectiveness is exemplified through elliptic and advection-diffusion inverse problems. In the realm of spatiotemporal modeling, I propose to use Spatiotemporal Gaussian processes (STGP) in likelihood modeling and Spatiotemporal Besov processes (STBP) in prior modeling separately. These approaches highlight the efficacy of incorporat- ing spatial and temporal information for enhanced parameter estimation and UQ. Additionally, the superiority of STGP is demonstrated compared to static and time- averaged methods in time-dependent advection-diffusion partial differential equation (PDE) and three chaotic ordinary differential equations (ODE). Expanding upon Besov Process (BP), a method known for sparsity-promotion and edge-preservation, STBP is introduced to capture spatial data features and model temporal correlations by replacing the random coefficients in the series expansion with stochastic time functions following Q-exponential process(Q-EP). This advantage is showcased in dynamic computerized tomography (CT) reconstructions through comparison with classic STGP and a time-uncorrelated approach.

ContributorsLi, Shuyi (Author) / Lan, Shiwei (Thesis advisor) / Hahn, Paul (Committee member) / McCulloch, Robert (Committee member) / Dan, Cheng (Committee member) / Lopes, Hedibert (Committee member) / Arizona State University (Publisher)

Created2023

Filtering by

Simulating Cooperative Behaviors in a Subsistence Population: An Agent-Based Modeling Approach

Case Studies in Machine Learning of Reduced Form Models for Causal Inference

Computational Challenges in BART Modeling: Extrapolation, Classification, and Causal Inference

Spatial Regression and Gaussian Process BART

Quantifying the Preference of Corynorhinus townsendii to Hibernate in Highly Ventilated Areas in Arizona Caves

Bayesian Spatiotemporal Modeling and Uncertainty Quantification