Dataset with manually validated version histories of Stack Overflow posts

We used this dataset to evaluate different string similarity metrics for SOTorrent (http://sotorrent.org/).

The dataset has been created with this tool: https://github.com/sotorrent/so-posthistory-gt

The dataset has been used in this project: https://github.com/sotorrent/metrics-comparison