Lightnews — Scholar-powered news

Matteo Tiezzi

@mtiezzi.bsky.social

160 followers 440 following 16 posts

PostDoc Researcher @ IIT, Continual and Lifelong Learning -> Robots, Graph Neural Networks, Sequence Processing | CoLLAs 2024 Local Chair

🏠 mtiezzi.github.io

Posts Replies Media Videos

Matteo Tiezzi

@mtiezzi.bsky.social

(Top) A standard MLP catastrophically forgets samples learned online when data distribution shifts

(Bottom) A Memory Head is capable to avoid forgetting thanks to its learnable key-value routing mechanism. Keys become representative of the data distribution.

proceedings.mlr.press/v274/tiezzi2...

February 19, 2025 at 2:27 PM

Matteo Tiezzi

@mtiezzi.bsky.social

TL;DR: in Memory Heads, neurons route their computation across a learnable key-value mechanism
⚡️ dynamic behaviour depending on their own input.
⚡ Only the units(weights) that are relevant to process the observed sample are blended ➡ parameter isolation

proceedings.mlr.press/v274/tiezzi2...

February 19, 2025 at 2:27 PM

Matteo Tiezzi

@mtiezzi.bsky.social

@logconference.bsky.social #LOG meetup with @federico-errica.bsky.social open talk on "What is going on with oversmoothing, oversquashing, and underreaching?"