Kush Jain
kjain14.bsky.social
Kush Jain
@kjain14.bsky.social
SE PhD Student at Carnegie Mellon University interested in NLP for software engineering, program analysis and software testing. Former intern at Facebook AI Research.
Thrilled to announce our new work TestGenEval, a benchmark that measures unit test generation and test completion capabilities. This work was done in collaboration with the FAIR CodeGen team.

Preprint: arxiv.org/abs/2410.00752
Leaderboard: testgeneval.github.io/leaderboard....
December 19, 2024 at 8:59 PM
Reposted by Kush Jain
Hi, Bluesky! 👋
I’m Catarina, a dual PhD student in 🖥️ Software Engineering with the CMU Portugal program ( @carnegiemellon.bsky.social and U. Lisbon).

Imagine a world with reliable software and user-friendly verification tools. Let’s build it together! 🚀

#PhDlife #SE #PL #HCI #CMU-Portugal
November 26, 2024 at 5:07 PM
Reposted by Kush Jain
And now that we’re all here, some work!🚨 Are Large Language Models Memorizing Bug Benchmarks? 🚨
There’s growing concern that LLMs for SE are prone to data leakage, but no one has quantified it... until now. 🕵️‍♂️ 1/
arxiv.org
November 26, 2024 at 4:06 PM