Lightnews — Scholar-powered news

Reposted by Charles Sutton

Marc Lanctot

@sharky6000.bsky.social

Hello all! 👋 🚨 New Preprint Alert! 🚨

Code World Models for General Game-Playing. ♟️🎲 ♣️♥️♠️♦️

I am pleased to announce our new paper, which provides an extremely sample-efficient way to create an agent that can perform well in multi-agent, partially-observed, symbolic environments!

🧵 1/N

October 9, 2025 at 7:27 PM

Charles Sutton

@randomlywalking.bsky.social

New blog! Advice about career and creativity for researchers and engineers. This time: What my PhD in computer science taught me about gardening. www.theexclusive.org/2025/07/phd-...

What I Learned about Gardening from my PhD in Computer Science

Several years ago, I moved into a house with a garden. Having always lived in apartments, I had never had a garden as an adult. I like having a garden, but I would not say that I like gardening, nor a...

www.theexclusive.org

July 21, 2025 at 9:26 PM

Reposted by Charles Sutton

Jeff Dean

@jeffdean.bsky.social

🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇

March 25, 2025 at 5:25 PM

Reposted by Charles Sutton

inkle

@inkle.co

The inkle round-up:

- Expelled is out and rocking 80+ on metacritic
- A Highland Song is BAFTA nominated…
- … and there’s a sweet disc edition coming from @superraregames.bsky.social next week (27th)
- still some Heaven’s Vault news to come (not a sequel. Not a remaster)
- we’re prototyping

SRG#130: A Highland Song (Switch)

Moira McKinnon is running away. A wild adventure through the Scottish Highlands, with open platforming and dynamic storytelling, maps and music in A Highland Song. Each purchase includes the following...

superraregames.com

March 22, 2025 at 10:08 PM

Reposted by Charles Sutton

Nenad Tomasev

@nenadtomasev.bsky.social

I'm happy to advertise an upcoming Student Researcher position on my Agent Frontiers team here at the Google DeepMind Foundational Research Unit, aimed for a start date early in the summer (currently listed as late June, but obviously somewhat flexible).

March 3, 2025 at 2:02 PM

Charles Sutton

@randomlywalking.bsky.social

Happy fifth birthday to my sourdough starter! Their name is Yeasty.

I learned to bake as a teenager but mostly lapsed after I grew up. When the pandemic hit, I decided to try again, and when there was no yeast, I decided to try sourdough again.

My sourdough starter. A small tupperware container with a brown mixture of flour and water.

A round loaf of homemade country-style sourdough bread.

A rectangular loaf of homemade freshly-baked sourdough sandwich bread.

Eight homemade sourdough bagels, piled up on a wire rack to cool. Freshly baked.

March 9, 2025 at 9:05 PM

Reposted by Charles Sutton

Senator Scott Wiener

@scottwiener.bsky.social

As a former Fulbright Scholar (in Chile), I personally understand how terrifying this is. U.S. citizens around the world & foreign nationals in the U.S. — all Fulbright Scholars — have now had their incomes eliminated & are stranded.

DOGE is an illegal, criminal operation that must be shut down.

Yan Matusevich @ymatusik.bsky.social · Mar 7

Just got an email from the Fulbright Association. As of right now, funding has been cut off to 12,500 US citizens currently abroad and and more than 7,400 foreigner scholars and students in the United States

March 7, 2025 at 11:34 PM

Reposted by Charles Sutton

Jessy Li

@jessyjli.bsky.social

🌟Job ad🌟 We (@gregdnlp.bsky.social, @mattlease.bsky.social and I) are hiring a postdoc fellow within the CosmicAI Institute, to do galactic work with LLMs and generative AI! If you would like to push the frontiers of foundation models to help solve myths of the universe, please apply!

NSF-Simons AI Institute for Cosmic Origins (CosmicAI) @nsfsimonscosmicai.bsky.social · Feb 25

Seeking candidates (within three years of the award of their PhD) for a postdoctoral position with the Explorable Universe research group to perform research on developing next-generation generative AI copilots & agents to aid astronomy research. Info here www.cosmicai.org/jobs/postdoc...

February 25, 2025 at 10:09 PM

Charles Sutton

@randomlywalking.bsky.social

This is a magnum opus! Highly recommended.

jacobaustin123.bsky.social @jacobaustin123.bsky.social · Feb 4

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

February 5, 2025 at 5:17 PM

Charles Sutton

@randomlywalking.bsky.social

Looking back, I think the presents under the tree, “To: Dad, From: Santa”, in Dad’s distinctively illegible handwriting, may have been his way of telling me.

Manny Voices | Dr. Little | E. @itsdrlittle.bsky.social · Dec 23

IF YOU BELIEVED in Santa, at what age did you realize he's not real?

December 24, 2024 at 2:03 PM

Reposted by Charles Sutton

Jess Hamrick

@jhamrick.bsky.social

Excited to be able to share what I've been up to recently!

We released a new experimental model, Gemini 2.0 Flash Thinking, which shows the thoughts/reasoning it uses to come up with its answers.

Try it out here! aistudio.google.com/prompts/new_...

December 20, 2024 at 1:26 AM

Reposted by Charles Sutton

Ofir Press

@ofirpress.bsky.social

We're presenting SWE-agent tomorrow (Wed) at the 11AM poster session, East Exhibit Hall A-C #1000.

We're going to talk about a lot of upcoming SWE-agent features. Join @jyangballin @_carlosejimenez @KLieret and me. I also have a bunch of SWE-agent stickers to hand out :)

December 10, 2024 at 6:16 PM

Reposted by Charles Sutton

Stanford NLP Group

@stanfordnlp.bsky.social

The extraordinary recent takeover of ML/AI by #NLP is well-known but insufficiently reflected on.

Look at the @neuripsconf.bsky.social tutorials in 2024!

neurips.cc/virtual/2024...

14 tutorials; 6 have "LLM" in the title; 4 more cover foundation models, with large NLP coverage. That's > 70% 😲

NeurIPS 2024 TutorialsNeurIPS 2024

neurips.cc

December 9, 2024 at 7:29 PM

Reposted by Charles Sutton

Laura

@lauraruis.bsky.social

Do you know what rating you’ll give after reading the intro? Are your confidence scores 4 or higher? Do you not respond in rebuttal phases? Are you worried how it will look if your rating is the only 8 among 3’s? This thread is for you.

November 27, 2024 at 5:25 PM

Charles Sutton

@randomlywalking.bsky.social

Benchmarks drive the decision making for large modeling efforts. Here are some great thoughts about how to design good benchmarks!

Ofir Press @ofirpress.bsky.social · Nov 25

I wrote some thoughts on how to build good LM benchmarks: ofir.io/How-to-Build...

How to Build Good Language Modeling Benchmarks

Building benchmarks is important because they shine a spotlight on the weaknesses of existing language models and so can guide the community on how to improve them.

ofir.io

November 26, 2024 at 2:51 AM

Charles Sutton

@randomlywalking.bsky.social

That’s right. You might think that all successful CS academics are good at running. But that’s only because the ones who weren’t, have been eaten by bears.

Sasha Rush @srushnlp.bsky.social · Nov 21

A disproportionate number of sucessful CS academics have some intense cardio hobby. Took me some years to understand.

James Medlock @jdcmedlock.bsky.social · Nov 19

Every time I see someone post this image it goes viral

November 21, 2024 at 12:45 AM

Reposted by Charles Sutton

Pasquale Minervini

@neuralnoise.com

Starter pack for University of Edinburgh researchers done by the amazing ramandutt4.bsky.social - go.bsky.app/KRNDkN7

University of Edinburgh Starter Pack

Join the conversation

go.bsky.app

November 20, 2024 at 4:39 PM

Reposted by Charles Sutton

Christopher Manning

@chrmanning.bsky.social

I did an unscientific, uncontrolled experiment for #EMNLP2024—details in 🧵👇. I posted my conference & workshop papers to 5 socials. Clear results: Mastodon is near dead, Threads may have users but not my people, not giving up on X/Twitter yet, but Bluesky is worth investing in.

November 18, 2024 at 6:40 PM

Reposted by Charles Sutton

Gabriel Dulac-Arnold

@gabepsilon.bsky.social

Our team is looking for an ML hacker extraordinaire. make the TPUs hum with the sound of AGI. Job description: make intelligence = f(compute) unbounded : boards.greenhouse.io/deepmind/job...

Research Engineer, Generative AI

London, UK

boards.greenhouse.io

November 16, 2024 at 4:00 PM

Reposted by Charles Sutton

Kuzman Ganchev

@ganchev.bsky.social

Wanted to share that Varun Godbole recently released a prompting playbook. The title says prompt tuning, but this is text prompts, not soft prompts.

github.com/varungodbole...

GitHub - varungodbole/prompt-tuning-playbook: A playbook for effectively prompting post-trained LLMs

A playbook for effectively prompting post-trained LLMs - varungodbole/prompt-tuning-playbook

github.com

November 11, 2024 at 3:51 PM

Charles Sutton

@randomlywalking.bsky.social

These are great starter packs!

If you like these, you should check out this starter pack of DeepMind researchers: go.bsky.app/GZ4hZzu

M A Osborne @maosbot.bsky.social · Nov 9

New here? Interested in AI/ML? Check out these great starter packs!

AI: go.bsky.app/SipA7it
RL: go.bsky.app/3WPHcHg
Women in AI: go.bsky.app/LaGDpqg
NLP: go.bsky.app/SngwGeS
AI and news: go.bsky.app/5sFqVNS

You can also search all starter packs here: blueskydirectory.com/starter-pack...

November 10, 2024 at 9:13 PM

Reposted by Charles Sutton

Cats of Yore

@catsofyore.bsky.social

Hi! Are you new here and looking for more cat accounts to follow? Here is a starter pack I made of cat accounts that use ALT text and are run by real people - no bots or image scrapers! go.bsky.app/J8yqxnN

November 8, 2024 at 1:35 AM

Charles Sutton

@randomlywalking.bsky.social

Our team has been working hard to harness the power of AI to make software more secure.✨🔐

Today we are excited to share a major milestone: our AI agent has discovered its first real-world security vulnerability!

Read here for more:

googleprojectzero.blogspot.com/2024/10/from...

From Naptime to Big Sleep: Using Large Language Models To Catch Vulnerabilities In Real-World Code

Posted by the Big Sleep team Introduction In our previous post, Project Naptime: Evaluating Offensive Security Capabilities of Large L...

googleprojectzero.blogspot.com

November 1, 2024 at 8:32 PM