Search Content

Matching Items (3)

Filtering by

All Subjects: Bayesian statistical decision theory
All Subjects: community college
Creators: Thompson, Marilyn S

Applying academic analytics: developing a process for utilizing Bayesian networks to predict stopping out among community college students

Description

Many methodological approaches have been utilized to predict student retention and persistence over the years, yet few have utilized a Bayesian framework. It is believed this is due in part to the absence of an established process for guiding educational researchers reared in a frequentist perspective into the realms of Bayesian analysis and educational data mining. The current study aimed to address this by providing a model-building process for developing a Bayesian network (BN) that leveraged educational data mining, Bayesian analysis, and traditional iterative model-building techniques in order to predict whether community college students will stop out at the completion of each of their first six terms. The study utilized exploratory and confirmatory techniques to reduce an initial pool of more than 50 potential predictor variables to a parsimonious final BN with only four predictor variables. The average in-sample classification accuracy rate for the model was 80% (Cohen's κ = 53%). The model was shown to be generalizable across samples with an average out-of-sample classification accuracy rate of 78% (Cohen's κ = 49%). The classification rates for the BN were also found to be superior to the classification rates produced by an analog frequentist discrete-time survival analysis model.

ContributorsArcuria, Philip (Author) / Levy, Roy (Thesis advisor) / Green, Samuel B (Committee member) / Thompson, Marilyn S (Committee member) / Arizona State University (Publisher)

Created2015

Individual and combined impact of institutional student support strategies on first-time, full-time, degree-seeking community college students

Description

Although U.S. rates of college enrollment among 18-24 year olds have reached historic highs, rates of degree completion have not kept pace. This is especially evident at community colleges, where a disproportionate number of students from groups who, historically, have had low college-completion rates enroll. One way community colleges are attempting to address low completion rates is by implementing institutional interventions intended to increase opportunities for student engagement at their colleges. Utilizing logistic and linear regression analyses, this study focused on community college students, examining the association between participation in institutional support activities and student outcomes, while controlling for specific student characteristics known to impact student success in college. The sample included 746 first-time, full-time, degree-seeking students at a single community college located in the U.S. Southwest. Additional analyses were conducted for the 440 first-time, full-time, degree-seeking students in this sample who placed into at least one developmental education course. Findings indicate that significant associations exist between different types of participation in institutional interventions and various student outcomes: Academic advising was found to be related to increased rates of Fall to Spring and Fall to Fall persistence and, for developmental education students, participation in a student success course was found to be related to an increase in the proportion of course credit hours earned. The results of this study provide evidence that student participation in institutional-level support may relate to increased rates of college persistence and credit hour completion; however, additional inquiry is warranted to inform specific policy and program decision-making at the college and to determine if these findings are generalizable to populations outside of this college setting.

ContributorsBeckert, Kimberly Marrone (Author) / De Los Santos Jr., Alfredo G (Thesis advisor) / Thompson, Marilyn S (Thesis advisor) / Berliner, David C. (Committee member) / Arizona State University (Publisher)

Created2011

Multiple imputation for two-level hierarchical models with categorical variables and missing at random data

Description

Accurate data analysis and interpretation of results may be influenced by many potential factors. The factors of interest in the current work are the chosen analysis model(s), the presence of missing data, and the type(s) of data collected. If analysis models are used which a) do not accurately capture the structure of relationships in the data such as clustered/hierarchical data, b) do not allow or control for missing values present in the data, or c) do not accurately compensate for different data types such as categorical data, then the assumptions associated with the model have not been met and the results of the analysis may be inaccurate. In the presence of clustered
ested data, hierarchical linear modeling or multilevel modeling (MLM; Raudenbush & Bryk, 2002) has the ability to predict outcomes for each level of analysis and across multiple levels (accounting for relationships between levels) providing a significant advantage over single-level analyses. When multilevel data contain missingness, multilevel multiple imputation (MLMI) techniques may be used to model both the missingness and the clustered nature of the data. With categorical multilevel data with missingness, categorical MLMI must be used. Two such routines for MLMI with continuous and categorical data were explored with missing at random (MAR) data: a formal Bayesian imputation and analysis routine in JAGS (R/JAGS) and a common MLM procedure of imputation via Bayesian estimation in BLImP with frequentist analysis of the multilevel model in Mplus (BLImP/Mplus). Manipulated variables included interclass correlations, number of clusters, and the rate of missingness. Results showed that with continuous data, R/JAGS returned more accurate parameter estimates than BLImP/Mplus for almost all parameters of interest across levels of the manipulated variables. Both R/JAGS and BLImP/Mplus encountered convergence issues and returned inaccurate parameter estimates when imputing and analyzing dichotomous data. Follow-up studies showed that JAGS and BLImP returned similar imputed datasets but the choice of analysis software for MLM impacted the recovery of accurate parameter estimates. Implications of these findings and recommendations for further research will be discussed.

ContributorsKunze, Katie L (Author) / Levy, Roy (Thesis advisor) / Enders, Craig K. (Committee member) / Thompson, Marilyn S (Committee member) / Arizona State University (Publisher)

Created2016