4 Comments
User's avatar
⭠ Return to thread
Chris Wheadon's avatar

I must admit my remarks are intended to be constrained to the marking of open-ended work as we haven't run any experiments on the more constrained answers you mention so I apologise if I have not made this clear. In our case you could obviously take the number of errors as a formatted output and then process them to match a mark scheme, but it is clear that the errors GPT finds in the text are often not errors.

Expand full comment