Lightnews — Scholar-powered news

Reposted by Yong Zheng-Xin (Yong)

Cohere Labs

@cohereforai.bsky.social

It’s been two years since cross-lingual jailbreaks were first discovered. How far has the multilingual LLM safety research field advanced? 🤔

📏 Our comprehensive survey reveals that there is still a long way to go.

June 3, 2025 at 1:59 PM

Reposted by Yong Zheng-Xin (Yong)

Julia Kreutzer

@juliakreutzer.bsky.social

🚨LLM safety research needs to be at least as multilingual as our models.

What's the current stage and how to progress from here?
This work led by @yongzx.bsky.social has answers! 👇

Cohere Labs @cohereforai.bsky.social · Jun 3

It’s been two years since cross-lingual jailbreaks were first discovered. How far has the multilingual LLM safety research field advanced? 🤔

📏 Our comprehensive survey reveals that there is still a long way to go.

June 4, 2025 at 11:44 AM

Reposted by Yong Zheng-Xin (Yong)

faridlazuarda.bsky.social

@faridlazuarda.bsky.social

Can English-finetuned LLMs reason in other languages?

Short Answer: Yes, thanks to “quote-and-think” + test-time scaling. You can even force them to reason in a target language!

But:
🌐 Low-resource langs & non-STEM topics still tough.

New paper: arxiv.org/abs/2505.05408

May 10, 2025 at 3:12 PM

Reposted by Yong Zheng-Xin (Yong)

Julia Kreutzer

@juliakreutzer.bsky.social

Multilingual 🤝reasoning 🤝 test-time scaling 🔥🔥🔥

New preprint!

@yongzx.bsky.social has all the details 👇

Yong Zheng-Xin (Yong) @yongzx.bsky.social · May 9

📣 New paper!

We observe that reasoning language models finetuned only on English data are capable of zero-shot cross-lingual reasoning through a "quote-and-think" pattern.

However, this does not mean they reason the same way across all languages or in new domains.

[1/N]

May 9, 2025 at 8:00 PM

Yong Zheng-Xin (Yong)

@yongzx.bsky.social

📣 New paper!

We observe that reasoning language models finetuned only on English data are capable of zero-shot cross-lingual reasoning through a "quote-and-think" pattern.

However, this does not mean they reason the same way across all languages or in new domains.

[1/N]

May 9, 2025 at 7:53 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news