📍Findings Session 1 - Hall C
📅 Wed, November 5, 13:00 - 14:00
arxiv.org/abs/2505.22830
📍Findings Session 1 - Hall C
📅 Wed, November 5, 13:00 - 14:00
arxiv.org/abs/2505.22830
In “What Has Been Lost with Synthetic Evaluation”, Ana Marasović (@anamarasovic.bsky.social) and collaborators ask what happens when LLMs start generating the datasets used to test their reasoning. (1/6🧵)
In “What Has Been Lost with Synthetic Evaluation”, Ana Marasović (@anamarasovic.bsky.social) and collaborators ask what happens when LLMs start generating the datasets used to test their reasoning. (1/6🧵)
(arxiv.org/abs/2505.22830)
I'm happy to announce that the preprint release of my first project is online! Developed with the amazing support of @lasha.bsky.social & @anamarasovic.bsky.social
(arxiv.org/abs/2505.22830)
I'm happy to announce that the preprint release of my first project is online! Developed with the amazing support of @lasha.bsky.social & @anamarasovic.bsky.social