Lightnews — Scholar-powered news

Alfredo Canziani

@alfcnz.bsky.social

3.4K followers 94 following 290 posts

Musician, math lover, cook, dancer, 🏳️‍🌈, and an ass prof of Computer Science at New York University

Posts Replies Media Videos

Alfredo Canziani

@alfcnz.bsky.social

This is different from the video I made 5 years ago, where the input-output linear interpolation of an already trained network shows what a neural net does to its input. Namely, it follows a piece-wise linear mapping defined by the hidden layer.

April 8, 2025 at 4:19 AM

Alfredo Canziani

@alfcnz.bsky.social

Training of a 2 → 100 → 2 → 5 fully connected ReLU neural net via cross-entropy minimisation.
• it starts outputting small embeddings
• around epoch 300 learns an identity function
• takes 1700 epochs more to unwind the data manifold

April 8, 2025 at 4:19 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news