Lightnews — Scholar-powered news

Reposted by Stella Biderman

@commoncrawl.bsky.social

Language identification still proves to be a challenging task, especially for web data. In collaboration with @mlcommons.org @eleutherai.bsky.social @jhu.edu and 97 community members, we created CommonLID, a new benchmark for LangID for 100+ languages!

Examples of mislabeled web text by existing LangID systems. A full text version is available on the blog post below.

February 10, 2026 at 8:45 PM

Stella Biderman

@stellaathena.bsky.social

Has anyone else had Claude code become non-functional recently? Even with a test input it spins for minutes without doing anything. Same thing happens in terminal.

February 7, 2026 at 4:40 PM

Reposted by Stella Biderman

eleutherai.bsky.social

@eleutherai.bsky.social

And the next talk (exact details TBA) by @pjox.bsky.social and @very-laurie.bsky.social from Common Crawl on work we've been collaborating on to build better benchmarking of LangID systems and understand the issues with the long tail of human language that comes up at Common Crawl scales.

January 9, 2026 at 12:56 AM

Reposted by Stella Biderman

eleutherai.bsky.social

@eleutherai.bsky.social

We’re bringing back a Community Spotlight talk series, highlighting cool work being done by members of our community. We’re kicking it off with a talk on running diffusion-based world-models in real time on consumer hardware.

Jan 9th at 2 pm US Eastern Time

January 9, 2026 at 12:56 AM

Stella Biderman

@stellaathena.bsky.social

What are people's favorite paper / project websites? I'm looking to build a library to base future ones I make off of.

The one EleutherAI has done that I'm proudest of is deepignorance.ai

Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards

Filtering pretraining data prevents dangerous capabilities, doesn’t sacrifice general performance, and results in models that are resistant to tampering.

deepignorance.ai

January 6, 2026 at 8:15 PM

Stella Biderman

@stellaathena.bsky.social

If you're looking for the 60 minutes piece on CECOT that Bari Weiss canned to protect the President, you can find it here: archive.org/details/60-m...

60 Minutes Inside CECOT : Free Download, Borrow, and Streaming : Internet Archive

Full video of the 60 Minutes Inside CECOT episode that CBS pulled.

archive.org

December 23, 2025 at 4:02 AM

Stella Biderman

@stellaathena.bsky.social

Incredibly proud of my friend and colleague @storytracer.com. Two weeks ago he and his cofounders @sucho-org.bsky.social were honored for organizing a global network of volunteers to exfiltrate and back up endangered Ukrainian cultural heritage in the wake of the invasion by Russia.

December 19, 2025 at 7:51 PM

Stella Biderman

@stellaathena.bsky.social

Really great to see NVIDIA staking out a pro-open data position. This used to be common, if not the norm in AI, and the backing away from this level of transparency has done a lot of harm to the research community.

Tim Kellogg @timkellogg.me · Dec 16

Nemotron 3

A new hybrid mamba2/attention LLM from NVIDIA that beats Qwen3-30B-A3B (same size & shape)

Notes:
* 1M context, with incredible recall past 256K
* New open datasets
* 10 open source RL environments

Overall this is a huge win for neolabs

huggingface.co/nvidia/NVIDI...

A wide bar chart comparing **accuracy** (left axis) and **relative throughput** (right axis) across multiple benchmarks for three models.

**Legend / Models**

* **Green:** Nemotron-3-Nano-30B-A3B
* **Blue:** Qwen3-30B-A3B-Thinking-2507
* **Gray:** GPT-OSS-20B-A4B

**Left Y-axis:** Accuracy (%)
**Right Y-axis:** Relative Throughput (Output tokens/s/GPU)
A dashed vertical line separates accuracy benchmarks (left) from throughput (right).

---

### Accuracy benchmarks (left to right)

* **Arena-Hard-v2-Avg (Chat):**
Nemotron **67.7**, Qwen **57.8**, GPT-OSS **48.5**

* **AIME25 (Math):**
Nemotron **99.2** (+tools noted), Qwen **85.0**, GPT-OSS **98.7**
(lighter labels near bars show intermediate values ~89.1 and ~91.7)

* **IFBench (Inst. Following):**
Nemotron **71.5**, Qwen **51.0**, GPT-OSS **65.0**

* **τ²-Bench (Tool Use):**
Nemotron **49.0**, Qwen **47.7**, GPT-OSS **47.5**

* **SWE-Bench (Coding):**
Nemotron **38.8**, Qwen **22.0**, GPT-OSS **34.0**

* **LCB v6 (Coding):**
Nemotron **68.2**, Qwen **66.0**, GPT-OSS **61.0**

* **RULER @ 1M (Long Ctx):**
Nemotron **86.3**, Qwen **77.5**, GPT-OSS **N/A**

---

### Throughput (right of dashed line)

* **ISL/OSL 8k/16k:**
Nemotron **3.3**, Qwen **1.0**, GPT-OSS **1.5**

---

**Caption (bottom):**
*Figure 2 | The hybrid Mamba-Transformer MoE architecture used by Nemotron 3 models can achieve state-of-the-art accuracy on leading reasoning benchmarks and ultra-long-context tasks while providing throughput improvements over similarly sized Transformer MoEs. For details, please see the Nemotron Nano 3 technical report.*

December 16, 2025 at 7:32 PM

Reposted by Stella Biderman

Joel S.

@joelhs.bsky.social

Compare the statement about the antisemitic terror attack in Australia by the Israeli Prime Minister with the one by Zohran Mamdani, and ask yourself who more truly cares about condemning antisemitism, as opposed to using it to promote unrelated politics.

"I wrote: "Your call for a Palestinian state pours fuel on the antisemitic fire. It rewards Hamas terrorists. It emboldens those who menace Australian Jews and encourages the Jew hatred now stalking your streets.""

"The attack at a Hanukkah celebration in Sydney today was a vile act of antisemitic terror. I mourn those who were murdered and will be keeping their families, the Jewish community, and the Chabad movement in my prayers. May the memories of all those killed be a blessing.

While we are still waiting for all the facts to emerge, what we already know is devastating. At least 11 dead, including Rabbi Eli Schlanger, who held deep ties to Crown Heights. At least 29 injured. Another Jewish community plunged into mourning and loss, a holiday of light so painfully reduced to a day of darkness. This attack is merely the latest, most horrifying iteration in a growing pattern of violence targeted at Jewish people across the world. Too many no longer feel safe to be themselves, to express their faith publicly, to worship in their synagogues without armed security stationed outside. What happened at Bondi is what many Jewish people fear will happen in their communities too.

On Bondi Beach today, as men with long guns targeted innocents, another man ran towards the gunfire and disarmed a shooter. Tonight, as Jewish New Yorkers light menorahs and usher in a first night of Hanukkah clouded by grief, let us look to his example and confront hatred with the urgency and action it demands. When I am Mayor, I will work every day to keep Jewish New Yorkers safe—on our streets, our subways, at shul, in every moment of every day. Let this be a purpose shared by every New Yorker, and let us banish this horrific violence to the past."

December 14, 2025 at 4:45 PM

Stella Biderman

@stellaathena.bsky.social

"The incentives made me do it" is an excuse, not a justification. You can be better than that, and if you're not it's because you choose to not be.

December 14, 2025 at 8:54 PM

Reposted by Stella Biderman

EvalEval Coalition

@eval-eval.bsky.social

It's a wrap on EvalEval in San Diego! A jam packed day of learning, making new friends, critically examining the field of evals, and walking away with renewed energy and new collaborations!

We have a lot of announcements coming, but first: EvalEval will be back for #ACL2026!

December 10, 2025 at 10:56 PM

Stella Biderman

@stellaathena.bsky.social

In 2023-ish it was trendy to write papers trying to explain why scaling laws had power law structures. The papers I remember were pretty unconvincing. Did anything meaningful come of this work? What does the best work in this vein look like?

November 29, 2025 at 4:36 PM

Stella Biderman

@stellaathena.bsky.social

Put this person in jail.

Put the person who drafted it in jail.

Put the higher ups who covered it up in jail.

This is a crime. If Kilmar had done the same they wouldn't hesitate to punish him. ICE is a criminal organization, not a law enforcement organization, and justice requires accountability.

Adam Klasfeld @klasfeldreports.com · Nov 20

A senior ICE official just admitted in an evidentiary hearing in Kilmar Abrego Garcia’s case that someone else drafted his declaration in the case and he didn’t know what certain words meant.

November 20, 2025 at 6:44 PM

Stella Biderman

@stellaathena.bsky.social

Hyped to write "The models in this paper cost us 476,246.57 USD to train. I'm sorry you are sad we didn't redo all of our experiments on multiple independent training datasets. If you'd like to give us a million dollars we'd be happy to run the experiments you wish" in my response to a reviewer.

November 14, 2025 at 11:07 PM

Reposted by Stella Biderman

EvalEval Coalition

@eval-eval.bsky.social

🚨 AI keeps scaling, but social impact evaluations aren’t–and the data proves it 🚨

Our new paper, 📎“Who Evaluates AI’s Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations,” analyzes hundreds of evaluation reports and reveals major blind spots ‼️🧵 (1/7)

November 13, 2025 at 1:59 PM

Reposted by Stella Biderman

Catherine Arnett

@catherinearnett.bsky.social

Our #NeurIPS2025 paper shows that even comparable monolingual tokenizers have different compression rates across languages. But by getting rid of whitespace tokenization and using a custom vocab size for each language, we can reduce token premiums. Preprint out now!

October 28, 2025 at 3:11 PM

Stella Biderman

@stellaathena.bsky.social

I feel like there are several blog posts or papers that put forth a research agenda of "making AI research a scientific field" or "advancing the science of AI" or something like that. I'm trouble finding them, does this ring a bell to anyone / does anyone have links to notable examples?

October 3, 2025 at 4:00 PM

Reposted by Stella Biderman

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

I want to print it out giant and put it everywhere

No, it’s not The Incentives—it’s you

There’s a narrative I find kind of troubling, but that unfortunately seems to be growing more common in science. The core idea is that the mere existence of perverse incentives is a valid and sufficient reason to knowingly behave in an antisocial way, just as long as one first acknowledges the existence of those perverse incentives. The way this dynamic usually unfolds is that someone points out some fairly serious problem with the way many scientists behave—say, our collective propensity to p-hack as if it’s going out of style, or the fact that we insist on submitting our manuscripts to publishers that are actively trying to undermine our interests—and then someone else will say, “I know, right—but what are you going to do, those are the incentives.”

September 18, 2025 at 12:05 AM

Stella Biderman

@stellaathena.bsky.social

The new “teen safety” program from OpenAI repeats the same lies that companies and governments have been saying since the internet began. This won't achieve better online safety for kids, but it will suppress individual liberty and promote censorship.

openai.com/index/buildi...

Building towards age prediction

Learn how OpenAI is building age prediction and parental controls in ChatGPT to create safer, age-appropriate experiences for teens while supporting families with new tools.

openai.com

September 18, 2025 at 12:40 AM

Reposted by Stella Biderman

Liz Fong-Jones (方禮真)

@lizthegrey.com

the *cato* institute says less than 10% of politically motivated terrorism is caused by leftists. the *cato* institute.

more than two-thirds is from the far-right.

Empathy @constant-tummyache.bsky.social · Sep 17

Charts don’t lie. Well, unbiased ones anyways

September 17, 2025 at 5:07 PM

Reposted by Stella Biderman

Naomi Saphra

@nsaphra.bsky.social

How can an imitative model like an LLM outperform the experts it is trained on? Our new COLM paper outlines three types of transcendence and shows that each one relies on a different aspect of data diversity. arxiv.org/abs/2508.17669

August 29, 2025 at 9:46 PM

Stella Biderman

@stellaathena.bsky.social

How did you learn to present code? Are there resources that you recommend using to help teach people?

August 27, 2025 at 6:02 PM

Reposted by Stella Biderman

Data Rescue Project #DataRescue

@datarescueproject.org

On Thursday we will be joining the #UCLA Latino Policy & Politics Institute for their demo of the newly updated Latino Data Hub (LDH), a public bilingual data platform built to democratize access to critical data about #Latino communities across the country. Register here:

Welcome! You are invited to join a webinar: Exploring the 2025 Latino Data Hub Updates. After registering, you will receive a confirmation email about joining the webinar.

At a time when federal data systems are being defunded, decommissioned, or delayed, public access to reliable, community-level information has never been more critical. Join the UCLA Latino Policy…

ucla.in

August 25, 2025 at 8:01 PM

Reposted by Stella Biderman

Cas (Stephen Casper)

@scasper.bsky.social

Here are a couple of slides that I presented yesterday at #aitechgov about open-weight model risk management.

August 17, 2025 at 10:40 AM

Reposted by Stella Biderman

Sharon Goldman

@sharongoldman.bsky.social

Thanks to @stellaathena.bsky.social for chatting with me about Deep Ignorance: the new paper/project from Eleuther AI and the UK AISI. Bottom line: Worried AI could teach people to build bioweapons? Don’t teach it how

fortune.com/2025/08/14/w...

AI safety tip: if you don’t want it giving bioweapon instructions, maybe don’t put them in the training data, say researchers

New research shows that scrubbing risky material from AI training data can build safeguards that are harder to bypass — and one author calls out tech giants for keeping such work under wraps.

fortune.com

August 15, 2025 at 3:33 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news