Lightnews — Scholar-powered news

Matteo Gay

@lingvenvist.bsky.social

49 followers 120 following 0 posts

CompLing&NLP | computers, linguistics, rock climbing ~ {currently exploring: information theory}. KULeuven.

Posts Replies Media Videos

Reposted by Matteo Gay

Emile van Krieken

@emilevankrieken.com

We propose Neurosymbolic Diffusion Models! We find diffusion is especially compelling for neurosymbolic approaches, combining powerful multimodal understanding with symbolic reasoning 🚀

Read more 👇

May 21, 2025 at 10:57 AM

Reposted by Matteo Gay

Nikhil Garg

@nkgarg.bsky.social

*Please repost* @sjgreenwood.bsky.social and I just launched a new personalized feed (*please pin*) that we hope will become a "must use" for #academicsky. The feed shows posts about papers filtered by *your* follower network. It's become my default Bluesky experience bsky.app/profile/pape...

March 10, 2025 at 6:14 PM

Reposted by Matteo Gay

Edoardo Ponti

@edoardo-ponti.bsky.social

Sparse attention is one of the most promising strategies to unlock long-context processing and long-generation reasoning in LLMs.

We performed the most comprehensive study on training-free sparse attention to date.

Here is what we found:

April 25, 2025 at 3:39 PM

Reposted by Matteo Gay

Kyle Mahowald

@kmahowald.bsky.social

LMs need linguistics! New paper, with @futrell.bsky.social, on LMs and linguistics that conveys our excitement about what the present moment means for linguistics and what linguistics can do for LMs. Paper: arxiv.org/abs/2501.17047. 🧵below.

January 29, 2025 at 4:07 PM

Reposted by Matteo Gay

Zhaofeng Wu

@zhaofengwu.bsky.social

To appear @ #ICLR2025! We show that LMs represent semantically-equivalent inputs across languages, modalities, etc. similarly. This shared representation space is structured by the LM's dominant language, which is also relevant to recent phenomena where LMs "think" in Chinese🀄️ in English🔠 contexts

Zhaofeng Wu @zhaofengwu.bsky.social · Dec 2

💡We find that models “think” 💭 in English (or in general, their dominant language) when processing distinct non-English or even non-language data types 🤯 like texts in other languages, arithmetic expressions, code, visual inputs, & audio inputs‼️ 🧵⬇️ arxiv.org/abs/2411.04986

January 22, 2025 at 6:10 PM

Reposted by Matteo Gay

Jean-Rémi King

@jeanremiking.bsky.social

🚨We're very excited to share our latest study, by Pablo Diego and team:

"A polar coordinate system represents syntax in large language models",

📄: Paper arxiv.org/abs/2412.05571
🪧: Poster tomorrow: neurips.cc/virtual/2024...
🧵: Thread 👇

December 12, 2024 at 2:25 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news