Really excited to have this out, where we give a formal account, w/ experiments, of how to make sense of that!
Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.
Yet they often assign higher probability to ungrammatical strings than to grammatical strings.
How can both things be true? 🧵👇
Really excited to have this out, where we give a formal account, w/ experiments, of how to make sense of that!
did not do carefully before publishing this assertion. time.com/7285045/resi...
did not do carefully before publishing this assertion. time.com/7285045/resi...
www.nytimes.com/2024/12/16/o...
www.nytimes.com/2024/12/16/o...