Search Content

Determining appropriate sample sizes and their effects on key parameters in longitudinal three-level models

Description

Through a two study simulation design with different design conditions (sample size at level 1 (L1) was set to 3, level 2 (L2) sample size ranged from 10 to 75, level 3 (L3) sample size ranged from 30 to 150, intraclass correlation (ICC) ranging from 0.10 to 0.50, model…

Through a two study simulation design with different design conditions (sample size at level 1 (L1) was set to 3, level 2 (L2) sample size ranged from 10 to 75, level 3 (L3) sample size ranged from 30 to 150, intraclass correlation (ICC) ranging from 0.10 to 0.50, model complexity ranging from one predictor to three predictors), this study intends to provide general guidelines about adequate sample sizes at three levels under varying ICC conditions for a viable three level HLM analysis (e.g., reasonably unbiased and accurate parameter estimates). In this study, the data generating parameters for the were obtained using a large-scale longitudinal data set from North Carolina, provided by the National Center on Assessment and Accountability for Special Education (NCAASE). I discuss ranges of sample sizes that are inadequate or adequate for convergence, absolute bias, relative bias, root mean squared error (RMSE), and coverage of individual parameter estimates. The current study, with the help of a detailed two-part simulation design for various sample sizes, model complexity and ICCs, provides various options of adequate sample sizes under different conditions. This study emphasizes that adequate sample sizes at either L1, L2, and L3 can be adjusted according to different interests in parameter estimates, different ranges of acceptable absolute bias, relative bias, root mean squared error, and coverage. Under different model complexity and varying ICC conditions, this study aims to help researchers identify L1, L2, and L3 sample size or both as the source of variation in absolute bias, relative bias, RMSE, or coverage proportions for a certain parameter estimate. This assists researchers in making better decisions for selecting adequate sample sizes in a three-level HLM analysis. A limitation of the study was the use of only a single distribution for the dependent and explanatory variables, different types of distributions and their effects might result in different sample size recommendations.

ContributorsYel, Nedim (Author) / Levy, Roy (Thesis advisor) / Elliott, Stephen N. (Thesis advisor) / Schulte, Ann C (Committee member) / Iida, Masumi (Committee member) / Arizona State University (Publisher)

Created2016

A Bayesian Synthesis approach to data fusion using augmented data-dependent priors

Description

The process of combining data is one in which information from disjoint datasets sharing at least a number of common variables is merged. This process is commonly referred to as data fusion, with the main objective of creating a new dataset permitting more flexible analyses than the separate analysis of…

The process of combining data is one in which information from disjoint datasets sharing at least a number of common variables is merged. This process is commonly referred to as data fusion, with the main objective of creating a new dataset permitting more flexible analyses than the separate analysis of each individual dataset. Many data fusion methods have been proposed in the literature, although most utilize the frequentist framework. This dissertation investigates a new approach called Bayesian Synthesis in which information obtained from one dataset acts as priors for the next analysis. This process continues sequentially until a single posterior distribution is created using all available data. These informative augmented data-dependent priors provide an extra source of information that may aid in the accuracy of estimation. To examine the performance of the proposed Bayesian Synthesis approach, first, results of simulated data with known population values under a variety of conditions were examined. Next, these results were compared to those from the traditional maximum likelihood approach to data fusion, as well as the data fusion approach analyzed via Bayes. The assessment of parameter recovery based on the proposed Bayesian Synthesis approach was evaluated using four criteria to reflect measures of raw bias, relative bias, accuracy, and efficiency. Subsequently, empirical analyses with real data were conducted. For this purpose, the fusion of real data from five longitudinal studies of mathematics ability varying in their assessment of ability and in the timing of measurement occasions was used. Results from the Bayesian Synthesis and data fusion approaches with combined data using Bayesian and maximum likelihood estimation methods were reported. The results illustrate that Bayesian Synthesis with data driven priors is a highly effective approach, provided that the sample sizes for the fused data are large enough to provide unbiased estimates. Bayesian Synthesis provides another beneficial approach to data fusion that can effectively be used to enhance the validity of conclusions obtained from the merging of data from different studies.

ContributorsMarcoulides, Katerina M (Author) / Grimm, Kevin (Thesis advisor) / Levy, Roy (Thesis advisor) / MacKinnon, David (Committee member) / Suk, Hye Won (Committee member) / Arizona State University (Publisher)

Created2017

Statistical properties of the single mediator model with latent variables in the bayesian framework

Description

Statistical mediation analysis has been widely used in the social sciences in order to examine the indirect effects of an independent variable on a dependent variable. The statistical properties of the single mediator model with manifest and latent variables have been studied using simulation studies. However, the single mediator model…

Statistical mediation analysis has been widely used in the social sciences in order to examine the indirect effects of an independent variable on a dependent variable. The statistical properties of the single mediator model with manifest and latent variables have been studied using simulation studies. However, the single mediator model with latent variables in the Bayesian framework with various accurate and inaccurate priors for structural and measurement model parameters has yet to be evaluated in a statistical simulation. This dissertation outlines the steps in the estimation of a single mediator model with latent variables as a Bayesian structural equation model (SEM). A Monte Carlo study is carried out in order to examine the statistical properties of point and interval summaries for the mediated effect in the Bayesian latent variable single mediator model with prior distributions with varying degrees of accuracy and informativeness. Bayesian methods with diffuse priors have equally good statistical properties as Maximum Likelihood (ML) and the distribution of the product. With accurate informative priors Bayesian methods can increase power up to 25% and decrease interval width up to 24%. With inaccurate informative priors the point summaries of the mediated effect are more biased than ML estimates, and the bias is higher if the inaccuracy occurs in priors for structural parameters than in priors for measurement model parameters. Findings from the Monte Carlo study are generalizable to Bayesian analyses with priors of the same distributional forms that have comparable amounts of (in)accuracy and informativeness to priors evaluated in the Monte Carlo study.

ContributorsMiočević, Milica (Author) / Mackinnon, David P. (Thesis advisor) / Levy, Roy (Thesis advisor) / Grimm, Kevin (Committee member) / West, Stephen G. (Committee member) / Arizona State University (Publisher)

Created2017

Childhood Adversity in Adolescent Custodial Grandchildren: A Study of Daily Stressors, Emotional Dynamics, and the Efficacy of a Mobile Socio-Emotional Program

Description

Guided by the Risky Families model and Daily Process methods, the present study examined how daily stressors are related to emotional well-being at the between- and within-person levels among adolescent grandchildren raised by grandmothers. This study also examined whether risk (i.e., adverse childhood experiences/ACES) and resilience (i.e., socio-emotional skills) factors…

Guided by the Risky Families model and Daily Process methods, the present study examined how daily stressors are related to emotional well-being at the between- and within-person levels among adolescent grandchildren raised by grandmothers. This study also examined whether risk (i.e., adverse childhood experiences/ACES) and resilience (i.e., socio-emotional skills) factors were linked to differences in daily well-being, stressor exposure, and emotional reactivity, and evaluated the efficacy of an online social intelligence training (SIT) program on daily stressor-emotion dynamics. Data came from a subsample (n = 188) of custodial adolescents who participated in an attention-controlled randomized clinical trial and completed 14-day daily surveys prior to and following intervention. Analyses were conducted with dynamic structural equation modeling. Daily stressors, on average, and experiencing above average stressors, were associated with higher negative emotions and lower positive emotions and social connection. Those with more ACEs, on average, reported higher daily stressors and worse well-being, whereas those with higher socio-emotional skills, on average, reported lower daily stressors and better well-being. At the within-person level, more ACEs were associated with higher daily negative emotions. Nonverbal processing was linked to higher daily positive emotions and social connection. Conversational skills were associated with higher daily positive emotions and social connection, and lower, more inert daily negative emotions. Neither ACEs nor socio-emotional skills were associated with within-person reactivity to stressors. Also, the SIT program did not demonstrate efficacy for any outcome. My discussion focused on how findings extend the literature on custodial adolescents by showing that daily stressors impact well-being, offer knowledge of how ACEs and socio-emotional skills shape daily stressor-emotion dynamics, and considers reasons why the online, self-guided SIT program failed to show efficacy on key outcomes.

ContributorsCastro, Saul (Author) / Infurna, Frank (Thesis advisor) / Doane, Leah (Committee member) / Davis, Mary (Committee member) / Grimm, Kevin (Committee member) / Arizona State University (Publisher)

Created2023

Beyond Moderation: Exploring Person-Level Mediation with Residuals and Individual Model Fit

Description

Mediation analysis is integral to psychology, investigating human behavior’s causal mechanisms. The diversity of explanations for human behavior has implications for the estimation and interpretation of statistical mediation models. Individuals can have similar observed outcomes while undergoing different causal processes or different observed outcomes while receiving the same treatment. Researchers…

Mediation analysis is integral to psychology, investigating human behavior’s causal mechanisms. The diversity of explanations for human behavior has implications for the estimation and interpretation of statistical mediation models. Individuals can have similar observed outcomes while undergoing different causal processes or different observed outcomes while receiving the same treatment. Researchers can employ diverse strategies when studying individual differences in multiple mediation pathways, including individual fit measures and analysis of residuals. This dissertation investigates the use of individual residuals and fit measures to identify individual differences in multiple mediation pathways. More specifically, this study focuses on mediation model residuals in a heterogeneous population in which some people experience indirect effects through one mediator and others experience indirect effects through a different mediator. A simulation study investigates 162 conditions defined by effect size and sample size for three proposed methods: residual differences, delta z, and generalized Cook’s distance. Results indicate that analogs of Type 1 error rates are generally acceptable for the method of residual differences, but statistical power is limited. Likewise, neither delta z nor gCd could reliably distinguish between contrasts that had true effects and those that did not. The outcomes of this study reveal the potential for statistical measures of individual mediation. However, limitations related to unequal subpopulation variances, multiple dependent variables, the inherent relationship between direct effects and unestimated indirect effects, and minimal contrast effects require more research to develop a simple method that researchers can use on single data sets.

ContributorsSmyth, Heather Lynn (Author) / MacKinnon, David (Thesis advisor) / Tein, Jenn-Yun (Committee member) / McNeish, Daniel (Committee member) / Grimm, Kevin (Committee member) / Arizona State University (Publisher)

Created2022

Addressing the Variable Selection Bias and Local Optimum Limitations of Longitudinal Recursive Partitioning with Time-Efficient Approximations

Description

Longitudinal recursive partitioning (LRP) is a tree-based method for longitudinal data. It takes a sample of individuals that were each measured repeatedly across time, and it splits them based on a set of covariates such that individuals with similar trajectories become grouped together into nodes. LRP does this by fitting…

Longitudinal recursive partitioning (LRP) is a tree-based method for longitudinal data. It takes a sample of individuals that were each measured repeatedly across time, and it splits them based on a set of covariates such that individuals with similar trajectories become grouped together into nodes. LRP does this by fitting a mixed-effects model to each node every time that it becomes partitioned and extracting the deviance, which is the measure of node purity. LRP is implemented using the classification and regression tree algorithm, which suffers from a variable selection bias and does not guarantee reaching a global optimum. Additionally, fitting mixed-effects models to each potential split only to extract the deviance and discard the rest of the information is a computationally intensive procedure. Therefore, in this dissertation, I address the high computational demand, variable selection bias, and local optimum solution. I propose three approximation methods that reduce the computational demand of LRP, and at the same time, allow for a straightforward extension to recursive partitioning algorithms that do not have a variable selection bias and can reach the global optimum solution. In the three proposed approximations, a mixed-effects model is fit to the full data, and the growth curve coefficients for each individual are extracted. Then, (1) a principal component analysis is fit to the set of coefficients and the principal component score is extracted for each individual, (2) a one-factor model is fit to the coefficients and the factor score is extracted, or (3) the coefficients are summed. The three methods result in each individual having a single score that represents the growth curve trajectory. Therefore, now that the outcome is a single score for each individual, any tree-based method may be used for partitioning the data and group the individuals together. Once the individuals are assigned to their final nodes, a mixed-effects model is fit to each terminal node with the individuals belonging to it.

I conduct a simulation study, where I show that the approximation methods achieve the goals proposed while maintaining a similar level of out-of-sample prediction accuracy as LRP. I then illustrate and compare the methods using an applied data.

ContributorsStegmann, Gabriela (Author) / Grimm, Kevin (Thesis advisor) / Edwards, Michael (Committee member) / MacKinnon, David (Committee member) / McNeish, Daniel (Committee member) / Arizona State University (Publisher)

Created2019

Filtering by

Determining appropriate sample sizes and their effects on key parameters in longitudinal three-level models

A Bayesian Synthesis approach to data fusion using augmented data-dependent priors

Statistical properties of the single mediator model with latent variables in the bayesian framework

Childhood Adversity in Adolescent Custodial Grandchildren: A Study of Daily Stressors, Emotional Dynamics, and the Efficacy of a Mobile Socio-Emotional Program

Beyond Moderation: Exploring Person-Level Mediation with Residuals and Individual Model Fit

Addressing the Variable Selection Bias and Local Optimum Limitations of Longitudinal Recursive Partitioning with Time-Efficient Approximations