Lightnews — Scholar-powered news

Reposted by Antoine Bosselut

chenhaotan.bsky.social

@chenhaotan.bsky.social

Recruiting PhDs & postdocs for:

🤖 agents "taking over" science (hypogenic.ai and 📌)
🧪 Real scientists ➡️AI (e.g., materials, chem, physics)
📜 Theory + incentives for H-AI collab & credit (e.g., formalizing tacit knowledge)

new adventures for me, 🔄 if you can! 🙌

chenhaot.com/recruiting.h...

Chenhao Tan's Homepage - recruiting

Chenhao Tan's Homepage

chenhaot.com

November 3, 2025 at 8:06 PM

Antoine Bosselut

@abosselut.bsky.social

If you're interested in doing a postdoc at @icepfl.bsky.social , there's still time to apply for the @epfl-ai-center.bsky.social postdoctoral fellowships.

Apart from this, I'm also recruiting postdocs in developing novel training algorithms for reasoning models and agentic AI.

October 14, 2025 at 5:56 PM

Antoine Bosselut

@abosselut.bsky.social

Join us again at #MELT workshop (520D) at #COLM2025 to hear from @ImanolSchlag about #Apertus, the largest multilingual LLM trained on over 1000 languages.

October 10, 2025 at 3:36 PM

Antoine Bosselut

@abosselut.bsky.social

Kicking off #MELT workshop at #COLM2025 with Monojit Choudhury talking about "Meta-Cultural Competence: What LLMs Should Know About Culture to Serve the Next Billion Users" !

October 10, 2025 at 1:15 PM

Antoine Bosselut

@abosselut.bsky.social

Come join us in 520D (all the way down the hall and around the corner) at #COLM2025 for the first workshop on multilingual and equitable language technologies!

October 10, 2025 at 12:53 PM

Reposted by Antoine Bosselut

Tiago Pimentel

@tpimentel.bsky.social

Very happy this paper got accepted to NeurIPS 2025 as a Spotlight! 😁

Main takeaway: In mechanistic interpretability, we need assumptions about how DNNs encode concepts in their representations (eg, the linear representation hypothesis). Without them, we can claim any DNN implements any algorithm!

Tiago Pimentel @tpimentel.bsky.social · Jul 14

Mechanistic interpretability often relies on *interventions* to study how DNNs work. Are these interventions enough to guarantee the features we find are not spurious? No!⚠️ In our new paper, we show many mech int methods implicitly rely on the linear representation hypothesis🧵

Paper title "The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?" with the paper's graphical abstract showing how more powerful alignment maps between a DNN and an algorithm allow more complex features to be found and more "accurate" abstractions.

October 1, 2025 at 3:00 PM

Reposted by Antoine Bosselut

Aaron Mueller

@amuuueller.bsky.social

What's the right unit of analysis for understanding LLM internals? We explore in our mech interp survey (a major update from our 2024 ms).

We’ve added more recent work and more immediately actionable directions for future work. Now published in Computational Linguistics!

October 1, 2025 at 2:03 PM

Reposted by Antoine Bosselut

Deniz Bayazit

@bayazitdeniz.bsky.social

1/🚨 New preprint

How do #LLMs’ inner features change as they train? Using #crosscoders + a new causal metric, we map when features appear, strengthen, or fade across checkpoints—opening a new lens on training dynamics beyond loss curves & benchmarks.

#interpretability

September 25, 2025 at 2:02 PM

Reposted by Antoine Bosselut

Mete

@mismayil.bsky.social

💡Can we optimize LLMs to be more creative?
Introducing Creative Preference Optimization (CrPO) and MuCE (Multi-task Creativity Evaluation Dataset).
Result: More novel, diverse, surprising text—without losing quality!
📝 Appearing at #EMNLP2025

September 22, 2025 at 1:43 PM

Antoine Bosselut

@abosselut.bsky.social

The next generation of open LLMs should be inclusive, compliant, and multilingual by design. That’s why we @icepfl.bsky.social @ethz.ch @cscsch.bsky.social ) built Apertus.

EPFL School of Computer and Communication Sciences @icepfl.bsky.social · Sep 2

EPFL, ETH Zurich & CSCS just released Apertus, Switzerland’s first fully open-source large language model.
Trained on 15T tokens in 1,000+ languages, it’s built for transparency, responsibility & the public good.

Read more: actu.epfl.ch/news/apertus...

September 3, 2025 at 9:26 AM

Reposted by Antoine Bosselut

EPFL AI Center

@epfl-ai-center.bsky.social

EPFL, @ethz.ch and the @cscsch.bsky.social released Apertus today, Switzerland’s first large-scale, open, multilingual language model — a milestone in generative AI for transparency and diversity.

Find out more here: ai.epfl.ch/apertus-a-fu...

@abosselut.bsky.social @icepfl.bsky.social

Apertus: a fully open, transparent, multilingual language model - EPFL AI Center

EPFL, ETH Zurich and the Swiss National Supercomputing Centre (CSCS) released Apertus today, Switzerland’s first large-scale, open, multilingual language model — a milestone in generative AI for trans...

ai.epfl.ch

September 2, 2025 at 9:46 AM

Reposted by Antoine Bosselut

EPFL School of Computer and Communication Sciences

@icepfl.bsky.social

EPFL, ETH Zurich & CSCS just released Apertus, Switzerland’s first fully open-source large language model.
Trained on 15T tokens in 1,000+ languages, it’s built for transparency, responsibility & the public good.

Read more: actu.epfl.ch/news/apertus...

September 2, 2025 at 11:48 AM

Reposted by Antoine Bosselut

Alexander Doria

@dorialexander.bsky.social

Very happy to see that Pleias multilingual data processing pipelines have contributed to the largest open pretraining project in Europe.

From their tech report: huggingface.co/swiss-ai/Ape...

September 2, 2025 at 4:46 PM

Reposted by Antoine Bosselut

Reto Vogt

@rvgt.ch

Die Schweiz steigt ins Rennen der grossen Sprachmodelle ein. Unter dem Namen #Apertus veröffentlichen @ethz.ch, @icepfl.bsky.social und das @cscsch.bsky.social das erste vollständig offene, mehrsprachige #LLM des Landes.

Fürs MAZ habe ich Apertus kurz analysiert:

www.maz.ch/news/apertus...

Apertus: ein neues Sprachmodell für die Schweiz

www.maz.ch

September 2, 2025 at 8:33 AM

Reposted by Antoine Bosselut

kyunghyuncho.bsky.social

@kyunghyuncho.bsky.social

recently gave a talk on <Reality Checks> at two venues, and discussed (and rambled) about how leaderboard chasing is awesome (and we want it to continue) but that this isn't easy because everyone (me! me! me!) wants to write more papers.

the link to the slide deck in the reply.

August 12, 2025 at 2:04 AM

Reposted by Antoine Bosselut

Negar Foroutan

@negarforoutan.bsky.social

🚨New Preprint!

In multilingual models, the same meaning can take far more tokens in some languages, penalizing users of underrepresented languages with worse performance and higher API costs. Our Parity-aware BPE algorithm is a step toward addressing this issue: 🧵

August 11, 2025 at 12:28 PM

Antoine Bosselut

@abosselut.bsky.social

The EPFL NLP lab is looking to hire a postdoctoral researcher on the topic of designing, training, and evaluating multilingual LLMs:

docs.google.com/document/d/1...

Come join our dynamic group in beautiful Lausanne!

EPFL NLP Postdoctoral Scholar Posting - Swiss AI LLMs

The EPFL Natural Language Processing (NLP) lab is looking to hire a postdoctoral researcher candidate in the area of multilingual LLM design, training, and evaluation. This postdoctoral position is as...

docs.google.com

August 4, 2025 at 3:54 PM

Reposted by Antoine Bosselut

Abhilasha Ravichander

@lasha.bsky.social

📣 Life update: Thrilled to announce that I’ll be starting as faculty at the Max Planck Institute for Software Systems this Fall!

I’ll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html

July 22, 2025 at 4:12 AM

Reposted by Antoine Bosselut

EPFL AI Center

@epfl-ai-center.bsky.social

EPFL and ETH Zürich are building together a Swiss made LLM from scratch.
Fully open and multilingual, the model is trained on CSCS's supercomputer "Alps" and supports sovereign, transparent, and responsible AI in Switzerland and beyond.
Read more here: ai.epfl.ch/a-language-m...
#ResponsibleAI

A language model built for the public good - EPFL AI Center

ETH Zurich and EPFL will release a large language model (LLM) developed on public infrastructure. Trained on the “Alps” supercomputer at the Swiss National Supercomputing Centre (CSCS), the new LLM ma...

ai.epfl.ch

July 9, 2025 at 7:26 AM

Antoine Bosselut

@abosselut.bsky.social

Check out Silin's paper done in collaboration with Apple on reinforcing abstract thinking in reasoning traces!

silingao.bsky.social @silingao.bsky.social · Jun 23

NEW PAPER ALERT: Recent studies have shown that LLMs often lack robustness to distribution shifts in their reasoning. Our paper proposes a new method, AbstRaL, to augment LLMs’ reasoning robustness, by promoting their abstract thinking with granular reinforcement learning.

June 23, 2025 at 6:55 PM

Antoine Bosselut

@abosselut.bsky.social

Check out @bkhmsi.bsky.social 's great work on mixture-of-expert models that are specialized to represent the behavior of known brain networks.

Badr AlKhamissi @bkhmsi.bsky.social · Jun 17

🚨 New Preprint!!

Thrilled to share with you our latest work: “Mixture of Cognitive Reasoners”, a modular transformer architecture inspired by the brain’s functional networks: language, logic, social reasoning, and world knowledge.

1/ 🧵👇

June 18, 2025 at 10:46 AM

Reposted by Antoine Bosselut

EPFL AI Center

@epfl-ai-center.bsky.social

Many AI models speak dozens of languages, but do they grasp cultural context? 🗣️🌍
The INCLUDE benchmark from EPFL's NLP Lab and @cohereforai.bsky.social reveal that there is still a gap...
👉 Find out how benchmarks like INCLUDE can help make AI truly inclusive: actu.epfl.ch/news/beyond-...

Beyond translation – making AI multicultural

A team of international researchers led by EPFL developed a multilingual benchmark to determine Large Language Models ability to grasp cultural context.

actu.epfl.ch

June 2, 2025 at 1:20 PM

Reposted by Antoine Bosselut

Vered Shwartz

@veredshwartz.bsky.social

I guess that now that I have 1% of my Twitter followers follow me here 😅, I should announce it here too for those of you no longer checking Twitter: my nonfiction book, "Lost in Automatic Translation" is coming out this July: lostinautomatictranslation.com. I'm very excited to share it with you!

Book cover - Lost in Automatic Translation: Navigating Life in English in the Age of Language Technologies. By Vered Shwartz. Publisher: Cambridge University Press.

May 27, 2025 at 7:16 PM

Reposted by Antoine Bosselut

Debjit Paul

@debjit-paul.bsky.social

Super excited to share that our paper "A Logical Fallacy-Informed Framework for Argument Generation" has received the Outstanding Paper Award 🎉🎉 at NAACL 2025!

Paper: aclanthology.org/2025.naacl-l...
Code: github.com/lucamouchel/...

#NAACL2025

May 1, 2025 at 1:41 PM

Reposted by Antoine Bosselut

Badr AlKhamissi

@bkhmsi.bsky.social

Excited to be at #NAACL2025 in Albuquerque! I’ll be presenting our paper “The LLM Language Network” as an Oral tomorrow at 2:00 PM in Ballroom C, hope to see you there!

Looking forward to all the discussions! 🎤 🧠

April 30, 2025 at 12:38 AM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news