Lightnews — Scholar-powered news

Simon Schug

@smonsays.bsky.social

920 followers 230 following 21 posts

postdoc @princeton
computational cognitive science ∪ machine learning
https://smn.one

Posts Replies Media Videos

Simon Schug

@smonsays.bsky.social

📄 Paper: arxiv.org/abs/2507.07207
💻 Code: github.com/smonsays/sca...

November 4, 2025 at 2:33 PM

Simon Schug

@smonsays.bsky.social

But, not all training distributions enable compositional generalization -- even with scale.
Strategically choosing the training data matters a lot.

November 4, 2025 at 2:33 PM

Simon Schug

@smonsays.bsky.social

We prove that MLPs can implement a general class of compositional tasks ("hyperteachers") using only a linear number of neurons in the number of modules, beating the exponential!

November 4, 2025 at 2:33 PM

Simon Schug

@smonsays.bsky.social

It turns out that simply scaling multilayer perceptrons / transformers can lead to compositional generalization.

November 4, 2025 at 2:33 PM

Simon Schug

@smonsays.bsky.social

Most natural data has compositional structure. This leads to a combinatorial explosion that is impossible to fully cover in the training data.

It might be tempting to think that we need to equip neural network architectures with stronger symbolic priors to capture this compositionality, but do we?

November 4, 2025 at 2:33 PM

Simon Schug

@smonsays.bsky.social

Would love to be added as well :)

November 20, 2024 at 8:50 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news