Marc Marone
@marcmarone.com
PhD student at JHU. @Databricks MosaicML, Microsoft Semantic Machines/Translate, Georgia Tech. I like datasets!
https://marcmarone.com/
https://marcmarone.com/
Pinned
Marc Marone
@marcmarone.com
· Dec 4
The ML/NLP grad student starter packs grew fast! I had to make a second one. Here's a list you can use to view combined posts from both packs: bsky.app/profile/marc...
You can "Pin to home" to see it as a tab. Looks like students are getting ready for NeurIPS soon 👀
You can "Pin to home" to see it as a tab. Looks like students are getting ready for NeurIPS soon 👀
Reposted by Marc Marone
🚨 You are only evaluating a slice of your test-time scaling model's performance! 🚨
📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!
📝: arxiv.org/abs/2502.13962
📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!
📝: arxiv.org/abs/2502.13962
February 20, 2025 at 3:14 PM
🚨 You are only evaluating a slice of your test-time scaling model's performance! 🚨
📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!
📝: arxiv.org/abs/2502.13962
📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!
📝: arxiv.org/abs/2502.13962
Reposted by Marc Marone
New Workshop on Multimodal Augmented Generation via MultimodAl Retrieval (MAGMaR) to be held at @aclmeeting.bsky.social ACL in Vienna this summer. We have a new shared task that stumps most LLMs - including ones pretrained on our test collection. nlp.jhu.edu/magmar/
MAGMaR Workshop
MAGMaR
nlp.jhu.edu
January 14, 2025 at 7:05 PM
New Workshop on Multimodal Augmented Generation via MultimodAl Retrieval (MAGMaR) to be held at @aclmeeting.bsky.social ACL in Vienna this summer. We have a new shared task that stumps most LLMs - including ones pretrained on our test collection. nlp.jhu.edu/magmar/
The ML/NLP grad student starter packs grew fast! I had to make a second one. Here's a list you can use to view combined posts from both packs: bsky.app/profile/marc...
You can "Pin to home" to see it as a tab. Looks like students are getting ready for NeurIPS soon 👀
You can "Pin to home" to see it as a tab. Looks like students are getting ready for NeurIPS soon 👀
December 4, 2024 at 2:23 AM
The ML/NLP grad student starter packs grew fast! I had to make a second one. Here's a list you can use to view combined posts from both packs: bsky.app/profile/marc...
You can "Pin to home" to see it as a tab. Looks like students are getting ready for NeurIPS soon 👀
You can "Pin to home" to see it as a tab. Looks like students are getting ready for NeurIPS soon 👀
I noticed a lot of starter packs skewed towards faculty/industry, so I made one of just NLP & ML students: go.bsky.app/vju2ux
Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!
Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!
November 23, 2024 at 7:54 PM
I noticed a lot of starter packs skewed towards faculty/industry, so I made one of just NLP & ML students: go.bsky.app/vju2ux
Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!
Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!
Meeting notes with Cody this week: "do you think factorio space age was a psyop for ai slowdown?"
I just installed factorio. I’m told I might not be seen again for a while.
November 22, 2024 at 5:15 PM
Meeting notes with Cody this week: "do you think factorio space age was a psyop for ai slowdown?"