Alhoori, Hamed||Rogness, Daniel
B.A. (Bachelor of Arts)
Department of Computer Science
Empirical research should always be backed by substantial and verifiable data so that anyone who wishes to reproduce the study or replicate the study with different data can verify the claims made by the research are accurate. We attempt to use a novel method of discovering reproducible research papers. Using this technique future research can be done to provide an even better understanding of the reproducibility crisis. We collected scholarly data from three different sources and combined them in order to obtain a dataset of 657 papers. The dataset comprises of papers that are verified as reproducible and ones that have been shown to not be reproducible. When the dataset was cleaned it resulted in 237 papers marked reproducible and 36 irreproducible. We then used three different models; Gaussian Naive Bayes, Multinomial Naive Bayes, and Adaboost to classify texts based on structural characteristics of papers and linguistic. Then we used a Long Short-Term Memory Recurrent Neural Network to compare results.
McDade, Joseph C., "Can We Predict Reproducible Scholarly Research?" (2018). Honors Capstones. 262.
Northern Illinois University
Rights Statement 2