Lightnews — Scholar-powered news

Vishakh Padmakumar

@vishakhpk.bsky.social

PhD Student @nyudatascience.bsky.social, working with He He on NLP and Human-AI Collaboration.
Also hanging out @ai2.bsky.social
Website - https://vishakhpk.github.io/

Posts Replies Media Videos

Vishakh Padmakumar

@vishakhpk.bsky.social

And prompting tricks like asking for novelty and denial prompting trade-off originality and quality without meaningfully shifting the frontier of novelty …. so there’s lot more work to be done 😀

April 29, 2025 at 4:35 PM

Vishakh Padmakumar

@vishakhpk.bsky.social

Sure, but can we elicit more novelty at inference time? Turns out it’s tricky. Increasing sampling temperatures (from 0.5 to 2) boosts originality but can hurt quality, creating a U-shaped effect.

April 29, 2025 at 4:35 PM

Vishakh Padmakumar

@vishakhpk.bsky.social

But improving the underlying model can help yield more novel output! This can either be by (a) increasing model scale (1B -> 7B), and (b) instruction tuning (7B -> 7B-Instruct)

April 29, 2025 at 4:35 PM

Vishakh Padmakumar

@vishakhpk.bsky.social

We find that base LLMs often generate less novel output than human-written references from the datasets

April 29, 2025 at 4:35 PM

Vishakh Padmakumar

@vishakhpk.bsky.social

What does it mean for #LLM output to be novel?
In work w/ johnchen6.bsky.social, Jane Pan, Valerie Chen and He He, we argue it needs to be both original and high quality. While prompting tricks trade one for the other, better models (scaling/post-training) can shift the novelty frontier 🧵

April 29, 2025 at 4:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news