with Orr Well, Emmanuel Chemla, @rkatzir.bsky.social and @nurikolan.bsky.social
We explore why neural networks often struggle with simple, structured tasks.
Spoiler: our regularizers might be the problem.
🧵
with Orr Well, Emmanuel Chemla, @rkatzir.bsky.social and @nurikolan.bsky.social
We explore why neural networks often struggle with simple, structured tasks.
Spoiler: our regularizers might be the problem.
🧵
arxiv.org/abs/2502.07687
Joint paper with @nurikolan.bsky.social, Emmanuel Chemla and @rkatzir.bsky.social
arxiv.org/abs/2502.07687
Joint paper with @nurikolan.bsky.social, Emmanuel Chemla and @rkatzir.bsky.social
Neural nets can in theory learn formal languages such as aⁿbⁿ & Dyck. Yet no one ever finds such nets using standard techniques. Why?
We suggest that the culprit might have been the objective function all along 👇
arxiv.org/abs/2402.10013
Neural nets offer good approximation but consistently fail to generalize perfectly, even when perfect solutions are proved to exist.
We check whether the culprit might be their training objective.
arxiv.org/abs/2402.10013
Neural nets can in theory learn formal languages such as aⁿbⁿ & Dyck. Yet no one ever finds such nets using standard techniques. Why?
We suggest that the culprit might have been the objective function all along 👇
arxiv.org/abs/2402.10013
Neural nets offer good approximation but consistently fail to generalize perfectly, even when perfect solutions are proved to exist.
We check whether the culprit might be their training objective.
arxiv.org/abs/2402.10013
Neural nets offer good approximation but consistently fail to generalize perfectly, even when perfect solutions are proved to exist.
We check whether the culprit might be their training objective.
arxiv.org/abs/2402.10013
ling.auf.net/lingbuzz/006...
ling.auf.net/lingbuzz/006...
New work with Emmanuel Chemla and Roni Katzir:
Benchmark:
github.com/taucompling/...
Paper:
aclanthology.org/2023.clasp-1...
🧵
New work with Emmanuel Chemla and Roni Katzir:
Benchmark:
github.com/taucompling/...
Paper:
aclanthology.org/2023.clasp-1...
🧵