Lightnews — Scholar-powered news

Ana Marasović

@anamarasovic.bsky.social

Luca Guadagnino making an OpenAI movie, what's happening 👀👀

November 16, 2025 at 9:07 PM

Ana Marasović

@anamarasovic.bsky.social

I didn't submit to ICLR, but I'm pretty sure next week I'll see similar quality issues in ARR

I feel like execs of major ML/AI conferences from ICLR/NeurIPS/ICML/AAAI, ACL/EMNLP, to CVPR should sit together and figure out a whole new strategy moving forward like 👇

Eugene Vinitsky 🍒 @eugenevinitsky.bsky.social · 14d

Sort of starting to believe that we really do need academic metrics that punish publishing too much

November 14, 2025 at 5:28 PM

Ana Marasović

@anamarasovic.bsky.social

When your husband is also an academic, so when you can't get him on a phone you shoot an email and it works every time 😂💀

November 13, 2025 at 1:10 AM

Reposted by Ana Marasović

Pranav A

@pranav-nlp.bsky.social

We're surveying researchers about name changes in academic publishing.

If you've changed your name and dealt with updating publications, we want to hear your experience. Any reason counts: transition, marriage, cultural reasons, etc.

forms.cloud.microsoft/e/E0XXBmZdEP

We're investigating how publishers handle name changes and the barriers scholars face. If you've changed your name (or are considering it) and dealt with updating your academic publications, we want to hear from you.

Researchers who have changed their name for any reason, such as gender transition, marriage, divorce, immigration, cultural reasons, or citation formatting issues. Whether you've successfully updated your work, are currently trying, or decided not to because of barriers, your opinion matters.

Your input will help us advocate for better, more inclusive policies in academic publishing. It takes around 5-10 minutes to complete.

Survey Link: https://forms.cloud.microsoft/e/E0XXBmZdEP

Please share with anyone who might benefit.

October 21, 2025 at 12:45 PM

Ana Marasović

@anamarasovic.bsky.social

@mclemcrew.bsky.social's CoLM spotlight is now available on YT! 🎵

youtu.be/w6LNmADnlNw?...

MixAssist: An Audio-Language Dataset for Co-Creative AI Assistance in Music Mixing

YouTube video by Conference on Language Modeling

youtu.be

November 11, 2025 at 10:41 PM

Reposted by Ana Marasović

Martin Tutek

@mtutek.bsky.social

*Urgently* looking for emergency reviewers for the ARR October Interpretability track 🙏🙏

ReSkies much appreciated

November 11, 2025 at 10:29 AM

Reposted by Ana Marasović

EMNLP

@emnlpmeeting.bsky.social

Outstanding paper (5/7):

"Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps"
by Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, and Yonatan Belinkov
aclanthology.org/2025.emnlp-m...

6/n

Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps

Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, Yonatan Belinkov. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025.

aclanthology.org

November 7, 2025 at 10:32 PM

Ana Marasović

@anamarasovic.bsky.social

𝙒𝙚'𝙧𝙚 𝙝𝙞𝙧𝙞𝙣𝙜 𝙣𝙚𝙬 𝙛𝙖𝙘𝙪𝙡𝙩𝙮 𝙢𝙚𝙢𝙗𝙚𝙧𝙨!

KSoC: utah.peopleadmin.com/postings/190... (AI broadly)

Education + AI:
- utah.peopleadmin.com/postings/189...
- utah.peopleadmin.com/postings/190...

Computer Vision:
- utah.peopleadmin.com/postings/183...

November 7, 2025 at 11:35 PM

Reposted by Ana Marasović

EMNLP

@emnlpmeeting.bsky.social

🎉 Congratulations to all #EMNLP2025 award winners 🎉

Starting with the ✨Best Paper award ✨:

"Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index"
by Hao Xu, Jiacheng Liu, Yejin Choi, Noah A. Smith, and Hannaneh Hajishirzi
aclanthology.org/2025.emnlp-m...

1/n

An image of the best paper slide at the EMNLP2025 conference, with the audience in the background

November 7, 2025 at 10:29 PM

Ana Marasović

@anamarasovic.bsky.social

Thrilled to see this work recognized at #EMNLP2025!

This framework and approach to measuring CoT faithfulness have been hugely influential for how I think about reasoning evaluation, and I'm so lucky to have worked with such brilliant collaborators. Huge credit to @mtutek.bsky.social

Martin Tutek @mtutek.bsky.social · 10d

Very honored to be one out of seven outstanding papers at this years' EMNLP :)

Huge thanks to my amazing collaborators @fatemehc.bsky.social @anamarasovic.bsky.social @boknilev.bsky.social , this would not have been possible without them!

November 7, 2025 at 4:56 PM

Reposted by Ana Marasović

Martin Tutek

@mtutek.bsky.social

Very honored to be one out of seven outstanding papers at this years' EMNLP :)

Huge thanks to my amazing collaborators @fatemehc.bsky.social @anamarasovic.bsky.social @boknilev.bsky.social , this would not have been possible without them!

November 7, 2025 at 8:58 AM

Ana Marasović

@anamarasovic.bsky.social

Check out Martin's talk at #EMNLP2025 today (Wed)!

If you care about CoT faithfulness, you 𝘮𝘶𝘴𝘵 read this paper. It introduces the first method for measuring CoT faithfulness that is not purely behavioral, but operates with the internals!

Martin Tutek @mtutek.bsky.social · 17d

Flying out to @emnlpmeeting soon🇨🇳
I'll present our parametric CoT faithfulness work (arxiv.org/abs/2502.14829) on Wednesday at the second Interpretability session, 16:30-18:00 local time A104-105

If you're in Suzhou, reach out to talk all things reasoning :)

Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps

When prompted to think step-by-step, language models (LMs) produce a chain of thought (CoT), a sequence of reasoning steps that the model supposedly used to produce its prediction. Despite much work o...

arxiv.org

November 4, 2025 at 10:54 PM

Ana Marasović

@anamarasovic.bsky.social

Go check Alex's poster today (Wed) in Suzhou! #EMNLP2025

I'm still so proud of our work (led by @lasha.bsky.social) on CondaQA, so we had to ask what would happen if we tried to create high-quality reasoning-over-text benchmarks now that LLMs are available. Turns out, we'd make an easier benchmark!

Alex Gill @agill32.bsky.social · 14d

I'll be in Suzhou 🇨🇳 at #EMNLP this week presenting "What has been Lost with Synthetic Evaluation?" done with @anamarasovic.bsky.social & @lasha.bsky.social! 🎉

📍Findings Session 1 - Hall C
📅 Wed, November 5, 13:00 - 14:00

arxiv.org/abs/2505.22830

November 4, 2025 at 10:44 PM

Reposted by Ana Marasović

Martin Tutek

@mtutek.bsky.social

Flying out to @emnlpmeeting soon🇨🇳
I'll present our parametric CoT faithfulness work (arxiv.org/abs/2502.14829) on Wednesday at the second Interpretability session, 16:30-18:00 local time A104-105

If you're in Suzhou, reach out to talk all things reasoning :)

Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps

When prompted to think step-by-step, language models (LMs) produce a chain of thought (CoT), a sequence of reasoning steps that the model supposedly used to produce its prediction. Despite much work o...

arxiv.org

October 31, 2025 at 1:30 PM

Reposted by Ana Marasović

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

Sort of starting to believe that we really do need academic metrics that punish publishing too much

November 3, 2025 at 10:38 PM

Reposted by Ana Marasović

Alex Gill

@agill32.bsky.social

I'll be in Suzhou 🇨🇳 at #EMNLP this week presenting "What has been Lost with Synthetic Evaluation?" done with @anamarasovic.bsky.social & @lasha.bsky.social! 🎉

📍Findings Session 1 - Hall C
📅 Wed, November 5, 13:00 - 14:00

arxiv.org/abs/2505.22830

November 3, 2025 at 11:03 AM

Reposted by Ana Marasović

Women in AI Research - WiAIR

@wiair.bsky.social

🧠 Can large language models build the very benchmarks used to evaluate them?
In “What Has Been Lost with Synthetic Evaluation”, Ana Marasović (@anamarasovic.bsky.social) and collaborators ask what happens when LLMs start generating the datasets used to test their reasoning. (1/6🧵)

October 20, 2025 at 4:01 PM

Reposted by Ana Marasović

Women in AI Research - WiAIR

@wiair.bsky.social

👉 Do large language models really reason the way their chain-of-thoughts suggest?
This week on #WiAIRpodcast, we talk with Ana Marasović (@anamarasovic.bsky.social) about her paper “Chain-of-Thought Unfaithfulness as Disguised Accuracy.” (1/6🧵)
📄 Paper: arxiv.org/pdf/2402.14897

October 15, 2025 at 4:06 PM

Reposted by Ana Marasović

Daniel Brown

@daniel-brown.bsky.social

Can you trust your reward model alignment scores?
New work presented today at the COLM Workshop on Socially Responsible Language Modelling Research led by Purbid Bambroo and in collaboration with @anamarasovic.bsky.social that probes LLM preference test sets for redundancy and inflated scores.

1/8

October 10, 2025 at 4:03 PM

Reposted by Ana Marasović

Ana Marasović

@anamarasovic.bsky.social

📣Tomorrow at #COLM2025:

1️⃣ Purbid's 𝐩𝐨𝐬𝐭𝐞𝐫 at 𝐒𝐨𝐋𝐚𝐑 (𝟏𝟏:𝟏𝟓𝐚𝐦-𝟏:𝟎𝟎𝐩𝐦) on catching redundant preference pairs & how pruning them hurts accuracy; www.anamarasovic.com/publications...

2️⃣ My 𝐭𝐚𝐥𝐤 at 𝐗𝐋𝐋𝐌-𝐑𝐞𝐚𝐬𝐨𝐧-𝐏𝐥𝐚𝐧 (𝟏𝟐𝐩𝐦) on measuring CoT faithfulness by looking at internals, not just behaviorally

1/3

October 9, 2025 at 4:54 PM

Ana Marasović

@anamarasovic.bsky.social

📣Tomorrow at #COLM2025:

1️⃣ Purbid's 𝐩𝐨𝐬𝐭𝐞𝐫 at 𝐒𝐨𝐋𝐚𝐑 (𝟏𝟏:𝟏𝟓𝐚𝐦-𝟏:𝟎𝟎𝐩𝐦) on catching redundant preference pairs & how pruning them hurts accuracy; www.anamarasovic.com/publications...

2️⃣ My 𝐭𝐚𝐥𝐤 at 𝐗𝐋𝐋𝐌-𝐑𝐞𝐚𝐬𝐨𝐧-𝐏𝐥𝐚𝐧 (𝟏𝟐𝐩𝐦) on measuring CoT faithfulness by looking at internals, not just behaviorally

1/3

October 9, 2025 at 4:54 PM

Ana Marasović

@anamarasovic.bsky.social

Sad: Can't go to CoLM because of immigration.

Happy: Well, at least I can mountain bike during the fall break in prime SLC MTB weather.

Sad: Comes down with a cold.

☹️☹️☹️☹️☹️☹️

October 8, 2025 at 11:22 PM

Ana Marasović

@anamarasovic.bsky.social

I had a great time chatting with Jekaterina and Malikeh. This episode is like a tour of all the things I've been studying lately!

Women in AI Research - WiAIR @wiair.bsky.social · Oct 8

🎙️ New Women in AI Research episode out now!
This time, we sit down with @anamarasovic.bsky.social to unpack some of the toughest questions in AI explainability and trust.

🔗 Watch here → youtu.be/xYb6uokKKOo

youtu.be

October 8, 2025 at 5:13 PM

Reposted by Ana Marasović

Women in AI Research - WiAIR

@wiair.bsky.social

🎙️ New Women in AI Research episode out now!
This time, we sit down with @anamarasovic.bsky.social to unpack some of the toughest questions in AI explainability and trust.

🔗 Watch here → youtu.be/xYb6uokKKOo

youtu.be

October 8, 2025 at 4:03 PM

Ana Marasović

@anamarasovic.bsky.social

Happening today! #COLM2025

Ana Marasović @anamarasovic.bsky.social · Oct 6

Honored 🎷🎸🥁 𝗠𝗶𝘅𝗔𝘀𝘀𝗶𝘀𝘁 🎷🎸🥁 was selected as the #COLM2025 oral spotlight. Go check out @mclemcrew.bsky.social's 𝐭𝐚𝐥𝐤 on 𝐖𝐞𝐝 (𝐎𝐜𝐭 𝟖) at 𝟑:𝟑𝟎𝐩𝐦 in 517BC and 𝐩𝐨𝐬𝐭𝐞𝐫 from 𝟒:𝟑𝟎-𝟓:𝟑𝟎 in 710!

MClem @mclemcrew.bsky.social · Oct 6

Mega stoked to attend #COLM25 this week and present our work, MixAssist, on Wednesday!

@anamarasovic.bsky.social sadly can't make it 😭, but hit me up if you'd like to chat about audio language models, music mixing, or anything else regarding music and audio!

October 8, 2025 at 3:32 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news