Lightnews — Scholar-powered news

Robert Rosenbaum

@robertrosenbaum.bsky.social

Fixed! (I think) Try again and let me know if you still have trouble. You might need to refresh the page.

June 3, 2025 at 12:39 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Thank you for the feedback, I'll work on both of those!

June 3, 2025 at 12:24 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Thanks for the suggestion, that makes sense. I am just trying to figure out the best implementation. It's difficult (for me) to combine email verification and profile creation on the same page. Maybe a link to a screenshot of an example profile on the registration page?

June 2, 2025 at 5:25 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Most universities have generous "Conflict of Commitment" policies that allow faculty to devote a portion of their time to consulting work, but these policies are under-utilized.

Consulting work can provide valuable industry experience, and also extra cash.

June 2, 2025 at 10:03 AM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Couldn't the same argument be made for conference presentations (which 90% of the time only describe published work)?

May 20, 2025 at 7:05 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

When _you_ publish a new paper, lots of people notice, lots of people read it. No explainer thread needed. Deservedly so, because you have a reputation for writing great papers.

When Dr. Average Scientist publishes a paper, nobody notices, nobody reads it without some leg work to get it out there

May 20, 2025 at 7:04 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Thanks! Let us know if you have comments or questions

May 19, 2025 at 4:13 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

In other words:
Plasticity rules like Oja's let us go beyond studying how synaptic plasticity in the brain can _match_ the performance of backprop.

Now, we can study how synaptic plasticity can _beat_ backprop in challenging, but realistic learning scenarios.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Finally, we meta-learned pure plasticity rules with no weight transport, extending our previous work. When Oja's rule was included, the meta-learned rule _outperformed_ pure backprop.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

We find that Oja's rule works, in part, by preserving information about inputs in hidden layers. This is related to its known properties in forming orthogonal representations. Check the paper for more details.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Vanilla RNNs trained with pure BPTT fail on simple memory tasks. Adding Oja's rule to BPTT drastically improves performance.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

We often forget how important careful weight initialization is for training neural nets because our software initializes them for us. Adding Oja's rule to backprop also eliminates the need for careful weight initialization.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

We propose that plasticity rules like Oja's rule might be part of the answer. Adding Oja's rule to backprop improves learning in deep networks in an online setting (batch size 1).

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

For example, a 10-layer ffwd network trained on MNIST using online learning (batch size 1) performs poorly when trained with pure backprop. How does the brain learn effectively without all of these engineering hacks?

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

In our new preprint, we dug deeper into this observation. Our motivation is that modern machine learning depends on lots of engineering hacks beyond pure backprop: gradients averaged over batches, batchnorm, momentum, etc. These hacks don't have clear, direct biological analogues.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

In previous work on this question, we meta-learned linear combos of plasticity rules. In doing so, we noticed something intersting:

One plasticity rule improved learning, but its weight updates weren't aligned with backprop's. It was doing something different. That rule is Oja's plasticity rule.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

A lot of work in "NeuroAI," including our own, seeks to understand how synaptic plasticity rules can match the performance of backprop in training neural nets.

May 19, 2025 at 3:33 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Interesting comment, but you need to define what you mean by "neuroanatomy." Does such a thing actually exist? As a thing in itself or as a phenomenon? What would Kant have to say? ;)

May 15, 2025 at 2:44 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Sorry, I didn't mean to phrase that antagonistically.

I just think that unless we're talking just about anatomy and we're restricting to a direct synaptic pathway (which maybe you are) then it's difficult to make this type of question precise without concluding that everything can query everything

May 13, 2025 at 3:02 PM

Robert Rosenbaum

@robertrosenbaum.bsky.social

Unless we're talking about a direct synapse, I don't know how we can expect to answer this question meaningfully when a neuromuscular junction in my pinky toe can "readout" and "query" photoreceptors in my retina.

May 13, 2025 at 2:27 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news