By Lukas Fluri*, @leon-lang.bsky.social *, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse
📜 arxiv.org/abs/2406.15753
🧵6 / 8
By Lukas Fluri*, @leon-lang.bsky.social *, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse
📜 arxiv.org/abs/2406.15753
🧵6 / 8
In our new paper "Modeling Human Beliefs about AI behavior for Scalable Oversight", I propose to model a human evaluator's beliefs to better interpret the feedback, which might help for scalable oversight. (1/4)
In our new paper "Modeling Human Beliefs about AI behavior for Scalable Oversight", I propose to model a human evaluator's beliefs to better interpret the feedback, which might help for scalable oversight. (1/4)
Submissions include generative modelling, AI4Science, geometric deep learning, reinforcement learning and early exiting. See the thread for the full list!
🧵1 / 12
Submissions include generative modelling, AI4Science, geometric deep learning, reinforcement learning and early exiting. See the thread for the full list!
🧵1 / 12
North America and Europe you are nice, but sometimes I also want to visit somewhere else 😅
The CfP is out 👉 www.auai.org/uai2025/call...
🚨 Feb 10: Paper submission
🗣️ Apr 3-10: rebuttal period
🎉/💀 May 6: Author notification
#UAI2025 #ML #stats #learning #reasoning #uncertainty
North America and Europe you are nice, but sometimes I also want to visit somewhere else 😅
Rosie Campbell says she has been “unsettled by some of the shifts over the last ~year, and the loss of so many people who shaped our culture”.
She says she “can’t see a place” for her to continue her work internally.
Rosie Campbell says she has been “unsettled by some of the shifts over the last ~year, and the loss of so many people who shaped our culture”.
She says she “can’t see a place” for her to continue her work internally.
Very proud of our team!
This is a new platform for rigorous, independent evaluations of AI model capabilities, featuring interactive visualizations and in-depth analysis. (1/8)
epoch.ai/blog/introdu...
Very proud of our team!
I am around in the Bay area for the next few weeks. Bay area folks hit me up if you want to meet up for coffee/ vegan food in and around SF ☕🌯 🥟
Got a major weather upgrade☀️ from Amsterdam's insanity last week 🌀🌩️
I am around in the Bay area for the next few weeks. Bay area folks hit me up if you want to meet up for coffee/ vegan food in and around SF ☕🌯 🥟
Got a major weather upgrade☀️ from Amsterdam's insanity last week 🌀🌩️
With this starter pack you can easily connect with us and keep up to date with all the member's research and news 🦋
go.bsky.app/8EGigUy
With this starter pack you can easily connect with us and keep up to date with all the member's research and news 🦋
go.bsky.app/8EGigUy
news.mit.edu/2024/mit-tui...
news.mit.edu/2024/mit-tui...
It’s hard to understand qualitative legal thresholds, but the UI looking ~exactly the same both here and on threads intuitively seems like the kind of thing that could violate a copyright if twitter had pursued one
It’s hard to understand qualitative legal thresholds, but the UI looking ~exactly the same both here and on threads intuitively seems like the kind of thing that could violate a copyright if twitter had pursued one
Looking forward to share our research here on 🦋 !
Looking forward to share our research here on 🦋 !