Timur
timurcarstensen.bsky.social
Timur
@timurcarstensen.bsky.social
Reposted by Timur
1/9 There is a fundamental tradeoff between parallelizability and expressivity of Large Language Models. We propose a new linear RNN architecture, DeltaProduct, that can effectively navigate this tradeoff. Here's how!
March 28, 2025 at 2:39 PM