Lightnews — Scholar-powered news

@pnawrot.bsky.social

35 followers 13 following 0 posts

Posts Replies Media Videos

Reposted

Edoardo Ponti

@edoardo-ponti.bsky.social

Sparse attention is one of the most promising strategies to unlock long-context processing and long-generation reasoning in LLMs.

We performed the most comprehensive study on training-free sparse attention to date.

Here is what we found:

April 25, 2025 at 3:39 PM

Reposted

Sasha Rush

@srushnlp.bsky.social

Several incredible NeurIPS tutorials this year. Worth navigating through the Swifties.

Edoardo Ponti @edoardo-ponti.bsky.social · Nov 20

Another nano gem from my amazing student
Piotr Nawrot!

A repo & notebook on sparse attention for efficient LLM inference: github.com/PiotrNawrot/...

This will also feature in my #NeurIPS 2024 tutorial "Dynamic Sparsity in ML" with André Martins: dynamic-sparsity.github.io Stay tuned!

A sparse mask of attention scores based on VerticalAndSlashAttention and a plot of loss vs sparsity ratio for various methods.

November 21, 2024 at 10:01 PM

Reposted

Edoardo Ponti

@edoardo-ponti.bsky.social

November 20, 2024 at 12:51 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news