siddheshp.bsky.social
@siddheshp.bsky.social
Grad Student; Into Multilingual NLP
Reposted
Check out the camera-ready version of our ICML position paper ("Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge") to learn more!!! arxiv.org/abs/2502.00561

(6/6)
Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge
The measurement tasks involved in evaluating generative AI (GenAI) systems lack sufficient scientific rigor, leading to what has been described as "a tangle of sloppy tests [and] apples-to-oranges com...
arxiv.org
June 15, 2025 at 12:20 AM
Reposted
i mean, people have different goals, and if you cared about some niche aspect of query focused multi doc sum before, it is legit to continue. or you can switch focus and start thinking of HCI. the second became much more possible now, the first maybe hasnt.
December 17, 2024 at 5:16 PM
I wonder if people have suggestions about what parts of writing could be complemented using AI with compromising thinking or could help better organization of thoughts: Making arguments stronger, reviewing, generating ideas about structure?
December 12, 2024 at 5:00 PM
Tagging my co-authors as I find them:
@iaugenstein.bsky.social @rnv.bsky.social
November 11, 2024 at 9:58 AM