Taken – Investigating the Reproducibility of Sustainability Studies which conducted Statistical Analysis

*This project is for a MSc in Statistics and Sustainability student.

“Reproducibility, closely related to replicability and repeatability, is a major principle underpinning the scientific method. For the findings of a study to be reproducible means that results obtained by an experiment or an observational study or in a statistical analysis of a data set should be achieved again with a high degree of reliability when the study is replicated. … With a narrower scope, reproducibility has been defined in computational sciences as having the following quality: the results should be documented by making all data and code available in such a way that the computations can be executed again with identical results.” – Wikipedia 

This project is about examining the reproducibility of sustainability studies (e.g., environment or biodiversity) that have conducted statistical analysis/machine learning.  As part of this study, the student should create a compilation of data sets of relevance and consider what is needed for reproducibility (e.g, whether the method is detailed enough to be reproducible, whether code has been provided, whether any cleaning of the data can be reproduced, whether the results can be duplicated within an acceptable margin of error).