https://glampouras.github.io/
Formal math reasoning of today's LLMs is weaker than we think. Benchmarks often give the solution upfront. In reality, conjecturing is a critical reasoning step!
Formal math reasoning of today's LLMs is weaker than we think. Benchmarks often give the solution upfront. In reality, conjecturing is a critical reasoning step!
t.co/fcbq95Pwn1
t.co/fcbq95Pwn1