A protocol for simulated experimentation of automated grading systems