Lightnews — Scholar-powered news

Core Francisco Parkg

@corefpark.bsky.social

30 followers 16 following 29 posts

https://cfpark00.github.io/

Posts Replies Media Videos

Core Francisco Parkg

@corefpark.bsky.social

⚠️⚠️ But here comes drama!!!

What if the news appears in the context upstream of the *same* FT data?

🚨 Contextual Shadowing happens!

Prefixing the news during FT *catastrophically* reduces learning!

10/n

May 21, 2025 at 12:07 AM

Core Francisco Parkg

@corefpark.bsky.social

Among these protocols, Self-QA especially stood out, largely mitigating the FT-ICL gap and integrating the given knowledge into the model!

Training on synthetic Q/A pairs really boost knowledge integration!

7/n

May 21, 2025 at 12:07 AM

Core Francisco Parkg

@corefpark.bsky.social

As expected, naïve fine-tuning on the raw facts isn’t enough to integrate knowledge across domains or model sizes up to 32B.

We call this the FT-ICL gap.

5/n

May 21, 2025 at 12:07 AM

Core Francisco Parkg

@corefpark.bsky.social

New paper! “In-Context Learning of Representations”

What happens to an LLM’s internal representations in the large context limit?

We find that LLMs form “in-context representations” to match the structure of the task given in context!

January 5, 2025 at 4:02 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news