Lightnews — Scholar-powered news

Akhil

@akhilakella.bsky.social

I ♥️ GNNs, LLM's, non-eucl Geom, uncertainty quantification, and Science of Science. Research Scientist at @KelloggCSSI. Opinions = own, RT/Like != endorsement.

Posts Replies Media Videos

Akhil

@akhilakella.bsky.social

I just asked "what is the last word in this sentence ?". Someone should adjust the training mix to support diverse length rewards i guess

June 5, 2025 at 2:41 PM

Akhil

@akhilakella.bsky.social

From an RL experiment on a small dataset
1. yellow (no explicit instructions) was never gaining any rewards from learning.
2. red (added one extra sentence) started to improve and explore the reward.
3. green (added a whole sentence) best performance.

prompting matters i guess.

April 9, 2025 at 12:06 AM

Akhil

@akhilakella.bsky.social

In case you're wondering how does the end output look like..

April 2, 2025 at 4:35 PM

Akhil

@akhilakella.bsky.social

It was wonderful giving a guest lecture in Prof. alhoori's (NIU CS) class on developing reasoning models using GPRO for scientific texts. Here are a couple of cool slides from the presentation....

April 2, 2025 at 4:34 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news