When I discuss open and reproducible research with graduate students, their minds often drift toward all the bad things that could happen from having their code and data available. It is certainly true that bad things could happen, but my sense is that people often overestimate these kinds of risks and underestimate the benefits of being open. So, in this post I wanted to highlight an example of something good that can happen from being open with data and code: it can raise the visibility of your work and help make it more useful to others.
This fall I was teaching an undergraduate course on data analysis, and I was looking for a compelling, modern example of real research that involved dummy variables. Fortunately, Kevin Munger had done an interesting experiment on harassment on Twitter, and all of the data and code were available on github. So, I download his data and code, tweaked them a bit, and then built my lecture around his study. In case they are helpful to someone else, here are the slides (and here are the slides in R Markdown format).
I’ve never meet Kevin Munger, but I wanted to thank him for posting his data and code. It helped me, and it helped my students. This is just one small example of a good thing that can come from being more open.