Lightnews — Scholar-powered news

@diegodoimo.bsky.social

26 followers 75 following 6 posts

Posts Replies Media Videos

diegodoimo.bsky.social

@diegodoimo.bsky.social

⚒️ We applied an advanced density-based clustering algorithm, showing its potential as an interpretability tool and in guiding novel strategies for the effective finetuning of LLMs.
🧵5/6

December 10, 2024 at 7:54 PM

diegodoimo.bsky.social

@diegodoimo.bsky.social

In fine-tuning, answer-focused modes rapidly emerge midway through the network, just after the intrinsic dimension peak.
Early layers remain largely unchanged.
🧵4/6

December 10, 2024 at 7:52 PM

diegodoimo.bsky.social

@diegodoimo.bsky.social

In few-shot learning, the prompt topic defines the modes of data distribution early in the network, and density modes are hierarchically organized based on the similarity of the subjects.
🧵3/6

December 10, 2024 at 7:49 PM

diegodoimo.bsky.social

@diegodoimo.bsky.social

🎯 Key results: few-shot learning and fine-tuning show two distinct processing phases inside LLMs.

These phases are separated by a peak of the data intrinsic dimension and a sharp decrease in the separation of the probability modes.

Paper: arxiv.org/abs/2409.03662
🧵2/6

December 10, 2024 at 7:48 PM

diegodoimo.bsky.social

@diegodoimo.bsky.social

Just landed in Vancouver to present @neuripsconf.bsky.social the results of our new work!

Few-shot learning and fine-tuning change the layers inside LLMs in a dramatically different way, even when they perform equally well on multiple-choice question-answering tasks.
🧵1/6

December 10, 2024 at 7:47 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news