Hamid Eghbalzadeh
heghbalz.bsky.social
Hamid Eghbalzadeh
@heghbalz.bsky.social
AI research @ Meta, Opinions @ my own
Reposted by Hamid Eghbalzadeh
How well do Multimodal LLMs consider visual information when creating plans to complete household activities? To answer this, we put a few multimodal LLMs on a pair of smart glasses and had participants try to solve cooking tasks while taking instructions from them.
February 23, 2025 at 10:07 PM
Reposted by Hamid Eghbalzadeh
For those of you attending #NeurIPS2024 in person: I'm from Vancouver and I made an extensive list of restaurants, bars, bookstores, etc., that I used to frequent when I still lived there. Enjoy!
dippedrusk.com/posts/2024-0...
Vagrant's Vancouver | Vagrant Gautam
A non-comprehensive list of places to go and things to do in the Greater Vancouver Area as curated by yours truly over 6 years. Might be outdated so please double-check!
dippedrusk.com
November 29, 2024 at 8:49 PM
Reposted by Hamid Eghbalzadeh
Kicking off our TUM AI - Lecture Series tomorrow with none other than Jiaming Song, CSO @LumaLabsAI.

He'll be talking about "Dream Machine: Emergent Capabilities from Video Foundation Models".

Live stream: youtu.be/oilWwsXZamA
7pm GMT+1 / 10am PST (Mon Dec 2nd)
December 1, 2024 at 12:55 PM
Reposted by Hamid Eghbalzadeh
I've created a startepack on Generative Modeling: go.bsky.app/Hd9ykTw
November 19, 2024 at 9:57 AM
Reposted by Hamid Eghbalzadeh
Streaming Deep Reinforcement Learning Finally Works, by
M. Elsayed, G. Vasan, A. R. Mahmood, is one of those papers I wish I had written 😅

This paper seems to allow us to do RL with NNs as it should have always been done. Everyone should read it!

arxiv.org/abs/2410.14606
Streaming Deep Reinforcement Learning Finally Works
Natural intelligence processes experience as a continuous stream, sensing, acting, and learning moment-by-moment in real time. Streaming learning, the modus operandi of classic reinforcement learning ...
arxiv.org
November 27, 2024 at 11:09 PM
authors doing their best addressing reviewer’s concerns

youtu.be/FN2RM-CHkuI?...
Exact Instructions Challenge PB&J Classroom Friendly | Josh Darnit
YouTube video by Josh Darnit
youtu.be
November 28, 2024 at 5:51 AM
Reposted by Hamid Eghbalzadeh
Dear reviewers:

As you react/respond to the author rebuttal can you please articulate the answers to these questions in 1-2 sentences each?

1. Why not a lower score
2. Why not a higher score

This significantly helps bring everyone (authors/reviewers/AC/SAC) on the same page.
November 27, 2024 at 10:14 PM
Reposted by Hamid Eghbalzadeh
Test of Time Paper Awards are out! 2014 was a wonderful year with lots of amazing papers. That's why, we decided to highlight two papers: GANs (@ian-goodfellow.bsky.social et al.) and Seq2Seq (Sutskever et al.). Both papers will be presented in person 😍

Link: blog.neurips.cc/2024/11/27/a...
Announcing the NeurIPS 2024 Test of Time Paper Awards  – NeurIPS Blog
blog.neurips.cc
November 27, 2024 at 3:48 PM
Reposted by Hamid Eghbalzadeh
NeurIPS Test of Time Awards:

Generative Adversarial Nets
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio

Sequence to Sequence Learning with Neural Networks
Ilya Sutskever, Oriol Vinyals, Quoc V. Le
November 27, 2024 at 5:32 PM
Reposted by Hamid Eghbalzadeh
Very happy to announce that we could convince Andy Wood (staff.ucar.edu/users/andywood) to give a solicited talk in our session.
November 27, 2024 at 4:12 PM
Reposted by Hamid Eghbalzadeh
Small yet mighty! 💫

We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient 🤠

We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base huggingface.co/collections/...
November 26, 2024 at 4:04 PM
Reposted by Hamid Eghbalzadeh
Reviewed for #ICLR?

Please take a moment to read authors' rebuttal, other reviews, and ask clarifying questions or request for further evidence that is still missing.

Many (junior) authors have put a ton of effort into this and may get discouraged by lack of engagement!
November 21, 2024 at 10:00 PM
Reposted by Hamid Eghbalzadeh
Time sink 😡
November 26, 2024 at 11:41 AM
Reposted by Hamid Eghbalzadeh
Hello BlueSky! Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @MetaAI (FAIR) in London, while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!
João F. Henriques
Research of Joao F. Henriques
joao.science
November 23, 2024 at 2:35 PM
Reposted by Hamid Eghbalzadeh
This article really spoke to me; all the science I've enjoyed and that I thought came out well has been done with a colleague that I was talking to every day and almost every couple of hours
Doing good science is 90% finding a science buddy to constantly talk to about the project.
November 17, 2024 at 2:32 PM
Reposted by Hamid Eghbalzadeh
I have become a fan of the game-theoretic approaches to RLHF, so here are two more papers in that category! (with one more tomorrow 😅)

1. Self-Play Preference Optimization (SPO).

2. Direct Nash Optimization (DNO).

🧵 1/3.
Last week, I shared some papers in the intersection of agent/model evaluation and social choice theory.

The last was a position paper on RLHF/alignment.

This week I will share papers (in pairs) on the topic of "game-theoretic or social choice meet meet alignment/RLHF".

🧵 1/3.
November 21, 2024 at 12:30 PM