Random Forest as a Predictive Analytics Alternative to Regression in Institutional Research

He Lingjun; Richard A. Levine; Juanjuan Fan; Joshua Beemer; Jeanne Stronach

doi:10.7275/1wpr-m024

Options

Article

Random Forest as a Predictive Analytics Alternative to Regression in Institutional Research

Authors

He Lingjun
Richard A. Levine
Juanjuan Fan
Joshua Beemer
Jeanne Stronach

Abstract

In institutional research, modern data mining approaches are seldom considered to address predictive analytics problems. The goal of this paper is to highlight the advantages of tree-based machine learning algorithms over classic (logistic) regression methods for data-informed decision making in higher education problems, and stress the success of random forest in circumstances where the regression assumptions are often violated in big data applications. Random forest is a model averaging procedure where each tree is constructed based on a bootstrap sample of the data set. In particular, we emphasize the ease of application, low computational cost, high predictive accuracy, flexibility, and interpretability of random forest machinery. Our overall recommendation is that institutional researchers look beyond classical regression and single decision tree analytics tools, and consider random forest as the predominant method for prediction tasks. The proposed points of view are detailed and illustrated through a simulation experiment and analyses of data from real institutional research projects. Accessed 3,712 times on https://pareonline.net from January 13, 2018 to December 31, 2019. For downloads from January 1, 2020 forward, please click on the PlumX Metrics link to the right.

Keywords: Educational Research, Research Methodology, Statistical Analysis

How to Cite:

Lingjun, H., Levine, R. A., Fan, J., Beemer, J. & Stronach, J., (2018) “Random Forest as a Predictive Analytics Alternative to Regression in Institutional Research”, Practical Assessment, Research, and Evaluation 23(1): 1. doi: https://doi.org/10.7275/1wpr-m024

Downloads:
Download PDF
View PDF

2597 Views

442 Downloads

Published on
2018-01-01

License

Creative Commons Attribution-NonCommercial-NoDerivatives 4.0

Authors

He Lingjun
Richard A. Levine
Juanjuan Fan
Joshua Beemer
Jeanne Stronach

Downloads

Issue

Volume 23 • 2018

Identifiers

DOI: https://doi.org/10.7275/1wpr-m024

Publication details

Article Number: 1
Submitted on: 2019-11-25

File Checksums (MD5)

PDF: b56d75aa05440225b4da80a8f70d5d9f

Random Forest as a Predictive Analytics Alternative to Regression in Institutional Research

Abstract

Harvard-Style Citation

Vancouver-Style Citation

APA-Style Citation

Non Specialist Summary