I must admit my remarks are intended to be constrained to the marking of open-ended work as we haven't run any experiments on the more constrained answers you mention so I apologise if I have not made this clear. In our case you could obviously take the number of errors as a formatted output and then process them to match a mark scheme, …
I must admit my remarks are intended to be constrained to the marking of open-ended work as we haven't run any experiments on the more constrained answers you mention so I apologise if I have not made this clear. In our case you could obviously take the number of errors as a formatted output and then process them to match a mark scheme, but it is clear that the errors GPT finds in the text are often not errors.
I must admit my remarks are intended to be constrained to the marking of open-ended work as we haven't run any experiments on the more constrained answers you mention so I apologise if I have not made this clear. In our case you could obviously take the number of errors as a formatted output and then process them to match a mark scheme, but it is clear that the errors GPT finds in the text are often not errors.