Yasser Souri
yassersouri.bsky.social
Yasser Souri
@yassersouri.bsky.social
Senior Applied Scientist - Microsoft
(Opinions are my own)

Ex: Google Research PhD intern

https://yassersouri.github.io
As expected. Congrats to the authors.
#ICML2025 Test of time award is likely going to the Batch Normalization paper.
July 15, 2025 at 5:17 AM
I remember when I saw the ICML 2015 test of time award winner, I noticed the paper "Learning to Rank Using Gradient Descent" for the first time. Then I got the idea for the "Deep Relative Attributes" paper.
#ICML2025 Test of time award is likely going to the Batch Normalization paper.
July 13, 2025 at 4:49 AM
#ICML2025 Test of time award is likely going to the Batch Normalization paper.
July 13, 2025 at 4:42 AM
A while back when Sam Altman was in India he was asked whether a team with around $10 M could build something to compete with OpenAI and Sam Altman said it was "hopeless".

Deepseek-V3 with around $6 M cost for the pre-training run just released a model with very high capability (on benchmarks)
December 26, 2024 at 9:32 PM
Jason Weston comments on Ilya’s ToT award talk.
December 16, 2024 at 12:35 AM
Thomas Kipf with the Google IO's DJ bathrobe :D
#NeurIPS2024
December 13, 2024 at 8:41 AM
It is sad to see authors not being able to present their work at #NeurIPS2024 because of visa issues.
But some authors went above and beyond.

Here is @hadivafaii.bsky.social tele-presenting his work with an impressive setup (ipad, mic, speaker, holder, battery).
Well done sir!
December 13, 2024 at 8:35 AM
#NeurIPS2024 (@neuripsconf.bsky.social) Day 2 (Wednesday) Experience

One of the great things about conferences like NeurIPS is that you get to see people who you admire for different reasons. I also got to see and talk to some. Really happy I got to talk to William Agnew.
December 12, 2024 at 5:44 PM
Lot's of grokking papers recently. Lol
#NeurIPS2024
December 12, 2024 at 7:37 AM
Sepp Hochreiter claims “the bitter lesson is over”!
#neurips2024
December 11, 2024 at 5:15 PM
#NeurIPS2024 (@neuripsconf.bsky.social) Day 1 Experience

There were a bunch of interesting Tutorial, Talks and events today at NeurIPS. But definitely the highlight of the day was catching up with friends and current and past colleagues and seeing folks.
December 11, 2024 at 7:48 AM
Ilya Sutskever has won 3 test of time awards at NeurIPS now!

2022: for AlexNet paper
2023: for word2vec paper
2024: for Seq2Seq paper
December 5, 2024 at 2:54 AM
Reposted by Yasser Souri
Excellent explanation of RoPE embedding, from scratch with all the math needed: https://fleetwood.dev/posts/you-could-have-designed-SOTA-positional-encoding

And with beautiful 3blue1brown's style of animation: https://github.com/3b1b/manim.

Original RoPE paper: arxiv.org/abs/2104.09864
November 29, 2024 at 1:45 PM
November 28, 2024 at 5:16 AM
Reposted by Yasser Souri
NeurIPS Conference is now Live on Bluesky!

-NeurIPS2024 Communication Chairs
November 22, 2024 at 1:33 AM
Reposted by Yasser Souri
Interested in RLHF, DPO, LLM alignment?

I've just created this list featuring awesome people like @natolambert.bsky.social .

The list is the opposite of exhaustive; I've just joined some days ago 😅

go.bsky.app/MqRGAf2
November 21, 2024 at 1:26 PM
For a while I used pipenv and poetry for managing python projects.
After a while I tried "uv" and I am never going back.

BTW, here is a doc on using uv and pytorch: docs.astral.sh/uv/guides/in...
PyTorch | uv
docs.astral.sh
November 20, 2024 at 4:03 AM
Borealis AI used to make the coolest t-shirts for ML conferences, they used to have a shirt with “yes we GAN” on it, another one with “ganaca”!
November 19, 2024 at 1:44 AM
Which paper is going to win the #neurips2024 test of time award?
GAN by @ian-goodfellow.bsky.social et. al.
or the famous Sequence to Sequence learning paper from Ilya Sutskever and others?

Or maybe some other paper which I cannot recall?
November 19, 2024 at 1:37 AM