(Opinions are my own)
Ex: Google Research PhD intern
https://yassersouri.github.io
Deepseek-V3 with around $6 M cost for the pre-training run just released a model with very high capability (on benchmarks)
Deepseek-V3 with around $6 M cost for the pre-training run just released a model with very high capability (on benchmarks)
#NeurIPS2024
#NeurIPS2024
But some authors went above and beyond.
Here is @hadivafaii.bsky.social tele-presenting his work with an impressive setup (ipad, mic, speaker, holder, battery).
Well done sir!
But some authors went above and beyond.
Here is @hadivafaii.bsky.social tele-presenting his work with an impressive setup (ipad, mic, speaker, holder, battery).
Well done sir!
One of the great things about conferences like NeurIPS is that you get to see people who you admire for different reasons. I also got to see and talk to some. Really happy I got to talk to William Agnew.
One of the great things about conferences like NeurIPS is that you get to see people who you admire for different reasons. I also got to see and talk to some. Really happy I got to talk to William Agnew.
#NeurIPS2024
#NeurIPS2024
#neurips2024
#neurips2024
There were a bunch of interesting Tutorial, Talks and events today at NeurIPS. But definitely the highlight of the day was catching up with friends and current and past colleagues and seeing folks.
There were a bunch of interesting Tutorial, Talks and events today at NeurIPS. But definitely the highlight of the day was catching up with friends and current and past colleagues and seeing folks.
2022: for AlexNet paper
2023: for word2vec paper
2024: for Seq2Seq paper
2022: for AlexNet paper
2023: for word2vec paper
2024: for Seq2Seq paper
And with beautiful 3blue1brown's style of animation: https://github.com/3b1b/manim.
Original RoPE paper: arxiv.org/abs/2104.09864
And with beautiful 3blue1brown's style of animation: https://github.com/3b1b/manim.
Original RoPE paper: arxiv.org/abs/2104.09864
-NeurIPS2024 Communication Chairs
-NeurIPS2024 Communication Chairs
I've just created this list featuring awesome people like @natolambert.bsky.social .
The list is the opposite of exhaustive; I've just joined some days ago 😅
go.bsky.app/MqRGAf2
I've just created this list featuring awesome people like @natolambert.bsky.social .
The list is the opposite of exhaustive; I've just joined some days ago 😅
go.bsky.app/MqRGAf2
After a while I tried "uv" and I am never going back.
BTW, here is a doc on using uv and pytorch: docs.astral.sh/uv/guides/in...
After a while I tried "uv" and I am never going back.
BTW, here is a doc on using uv and pytorch: docs.astral.sh/uv/guides/in...
GAN by @ian-goodfellow.bsky.social et. al.
or the famous Sequence to Sequence learning paper from Ilya Sutskever and others?
Or maybe some other paper which I cannot recall?
GAN by @ian-goodfellow.bsky.social et. al.
or the famous Sequence to Sequence learning paper from Ilya Sutskever and others?
Or maybe some other paper which I cannot recall?