Lightnews — Scholar-powered news

Explainable AI Researcher

@zootime.bsky.social

340 followers 37 following 0 posts

I work with explainability AI in a german research facility

Posts Replies Media Videos

Reposted by Explainable AI Researcher

Nathan Godey

@nthngdy.bsky.social

🚀 New Paper Alert! 🚀

We introduce Q-Filters, a training-free method for efficient KV Cache compression!

It is compatible with FlashAttention and can compress along generation which is particularly useful for reasoning models ⚡

TLDR: we make Streaming-LLM smarter using the geometry of attention

March 6, 2025 at 4:02 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news