About the data: The edX MOOC data released by Harvard and MIT covered 16 sections of 13 courses from the first year of edX, from fall 2012 to summer 2013. About 475,000 students and about 640,000 registrations are included. Although there were 841,000 registrations in these courses, 200,000 rows of data were deleted by HarvardX and MITx in the de-identification process. De-identification was most likely to remove outliers and extremely active users, which may affect some of the analysis. You can learn more about the de-identification process from the HarvardX and MITx documentation.
Via Dennis T OConnor