Simon
banner
spoltier.qoto.org.ap.brid.gy
Simon
@spoltier.qoto.org.ap.brid.gy
code / data wrangler in Switzerland.
Recovering reply guy. Posts random photos once in a while.

[bridged from https://qoto.org/@spoltier on the fediverse by https://fed.brid.gy/ ]
Reposted by Simon
This is an important negative reality. It's also sustained by industry incentives—and the thing most likely to break it is a big company or two deciding it's bad business to network on a competitor's site.

I am a fan of banning mean bsky account FlameTroll420, but that's not the tipping point.
Wrote about an obvious and yet profoundly underappreciated aspect of the AI boom: its total narrative capture by Elon Musk's X nymag.com/intelligence...
January 5, 2026 at 8:18 PM
Reposted by Simon
Do I know anyone on the Bing team? I'm noticing abusive traffic that ignores robots.txt.
December 17, 2025 at 2:29 PM
Reposted by Simon
So ... we're doing this thing, and we want to do it with you:

#theperspectivestudio - A Collaborative Practice for a Fragmented World

with Marcus Neustetter

You can come to the studio — or we can bring the studio to you!

Just published on Andrea Hiott's […]

[Original post on spore.social]
December 14, 2025 at 8:53 PM
Reposted by Simon
OpenAI aren't talking about it yet, but it turns out they've adopted Anthropic's brilliant "skills" mechanism in a big way

Skills are now live in both ChatGPT and their Codex CLI tool, I wrote up some detailed notes on how they work so far here: https://simonwillison.net/2025/Dec/12/openai-skills/
OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI
One of the things that most excited me about Anthropic’s new Skills mechanism back in October is how easy it looked for other platforms to implement. A skill is just …
simonwillison.net
December 12, 2025 at 11:45 PM
Reposted by Simon
in the parent circles, there’s horror stories of kitchen remodels getting delayed due to ICE raids, so i anticipate NIMBYs to come out as anti-ICE soon
December 11, 2025 at 12:40 AM
Reposted by Simon
CBP is proposing that travelers to the US from such countries as France, Germany, South Korea and the UK submit extensive personal info, including social media histories, email addresses used in the past 10 years and parents’ birthplaces. https://public-inspection.federalregister.gov/2025-22461.pdf
December 10, 2025 at 1:53 AM
Benebelt
December 9, 2025 at 9:58 PM
Reading ~~classical liberal~~ (neoclassical reactionary) writing about ❄️🍑, I'm realizing something: just like polished English used to be evidence of quality of content ( now subverted by LLMs), being halfway literate used to be a sign of semi-elite status... I don't think J.S. Mill et al […]
Original post on qoto.org
qoto.org
December 7, 2025 at 12:38 PM
Reposted by Simon
in the words of Gemini 3:

“It is basically a Frankenstein monster combining a CNN (Convolutional Neural Network) and a Transformer, organized like a mammalian brain”

0.5B, SYNTH

huggingface.co/mkurman/Neur...
December 3, 2025 at 5:11 AM
Reposted by Simon
Four new models from Mistral today - all Apache 2 licensed, all vision-capable, and one of them is a 3GB model that can run in a web browser and answer questions about things it can see through the webcam! https://simonwillison.net/2025/Dec/2/introducing-mistral-3/
Introducing Mistral 3
Four new models from Mistral today: three in their "Ministral" smaller model series (14B, 8B, and 3B) and a new Mistral Large 3 MoE model with 675B parameters, 41B active. …
simonwillison.net
December 2, 2025 at 5:33 PM
Reposted by Simon
Hierarchies 😩... One of the biggest recurring time-consuming issues I sometimes encounter is making decisions about _where_ to put some (new or exisiting) code/feature, i.e. in which package, new or existing, considering: functional fit (topic), structural fit (pre-existing data format […]
Original post on mastodon.thi.ng
mastodon.thi.ng
November 30, 2025 at 4:24 PM
From RUnit to testthat with Coding Agent Support | Mirai Solutions GmbH
We present an interesting case study: we migrated the test suite of our R package 𝗫𝗟𝗖𝗼𝗻𝗻𝗲𝗰𝘁 from 𝗥𝗨𝗻𝗶𝘁 to 𝘁𝗲𝘀𝘁𝘁𝗵𝗮𝘁 using 𝗔𝗜-𝗽𝗼𝘄𝗲𝗿𝗲𝗱 𝗰𝗼𝗱𝗶𝗻𝗴 𝗮𝗴𝗲𝗻𝘁𝘀s. We used 𝗚𝗼𝗼𝗴𝗹𝗲 𝗝𝘂𝗹𝗲𝘀, an asynchronous coding agent, to handle the repetitive, multi-file refactoring work. The process wasn't just about automation, it required careful context preparation, environment setup, and iterative prompt engineering. Key 𝘁𝗮𝗸𝗲𝗮𝘄𝗮𝘆𝘀: • AI agents excel at tedious, semantics-aware tasks (like fixing 𝘦𝘹𝘱𝘦𝘤𝘵_𝘦𝘲𝘶𝘢𝘭 argument order across dozens of files) • Managed VM environments reduce risks while maintaining utility • Combining different AI tools (Jules + Aider) provided robust validation through "third-party" code review This results in a faithfully migrated test suite with equivalent coverage and behavior, achieved faster and with more confidence. Read about our process, challenges, and solutions in our news post. https://lnkd.in/d-QUGDJ4 #rstats #SoftwareDevelopment #AIEngineering #TestAutomation #OpenSource
www.linkedin.com
November 28, 2025 at 11:38 AM
Reposted by Simon
community note: using cost on the y axis makes it appear like cheaper models are more capable on pass@3
November 25, 2025 at 1:59 PM
Reposted by Simon
he’s nice even when he’s trashing someone
November 22, 2025 at 11:23 PM
Reposted by Simon
Evolutionary Algorithms for optimizing LLM weights

Gradient descent and backpropagation have a lot of problems, alignment becomes a nightmare. Evolutionary algos fix this, but they don’t scale

A recent paper, EGGROLL, makes it computationally feasible to do now

www.alphaxiv.org/abs/2511.16652
November 23, 2025 at 2:19 AM
Reposted by Simon
i'm delighted to be able to host academic content at https://grebedoc.dev!

(yes, you can push 500 MB of slides and stuff as a single site to it. yes, i will gladly host it! no, it will not cost me any remotely meaningful amount of money, push at your leisure)
Grebedoc — static site hosting for git forges
grebedoc.dev
November 17, 2025 at 5:41 AM
Reposted by Simon
the ironic part about immigrants is they’re not lazy, the lazy ones didn’t have enough agency to move to a different country

immigration is as close to a filter for high performing individuals as you’re going to get
November 16, 2025 at 6:35 PM
Reposted by Simon
Some notes on GPT-5.1, which is now available in the OpenAI API

The new reasoning options are interesting, but the pelican feels like a bit of a regression from GPT-5 https://simonwillison.net/2025/Nov/13/gpt-51/
Introducing GPT-5.1 for developers
OpenAI announced GPT-5.1 yesterday, calling it a smarter, more conversational ChatGPT. Today they've added it to their API. We actually got four new models today: gpt-5.1 gpt-5.1-chat-latest gpt-5.1-codex gpt-5.1-codex-mini There …
simonwillison.net
November 14, 2025 at 12:10 AM
Reposted by Simon
I find AI does accelerate solving complex problems, so you can get back to your to-do list.

Unfortunately, I love being immersed in long complex problems, and hate managing my top-level to-do list. So I am once again begging tech companies to make us an AI Project Manager.
November 12, 2025 at 2:26 PM
Reposted by Simon
overheard: “they’re on twitter, instagram, X.. i don’t even know what X is, what is X?”
November 5, 2025 at 2:11 PM
Spooky jog before dawn
November 6, 2025 at 3:41 PM
Reposted by Simon
And it's not just Cursor... rival agentic coding IDE Windsurf announced their own custom RL-trained fast coding model today as well!

Here are notes and a pelican on Windsurf's new SWE-1.5 model https://simonwillison.net/2025/Oct/29/swe-15/
Introducing SWE-1.5: Our Fast Agent Model
Here's the second fast coding model released by a coding agent IDE in the same day - the first was Composer-1 by Cursor. This time it's Windsurf releasing SWE-1.5: Today …
simonwillison.net
October 30, 2025 at 12:06 AM
Reposted by Simon
I will be in Berlin on December 10th giving talks.

I'm looking for other places to give talks in Europe around the same time.

Please reach out if know of a spot for me to speak, happy to joint sponsor with Letta.

Preferences for London, Paris, Amsterdam, etc.
October 29, 2025 at 4:55 PM
Reposted by Simon
The other day we had our first ever chained AI tool success on the #curl factory floor:

- tool A found a possible flaw in code and reported it.

- using the plain English description from tool A, tool B could create a reproducible by itself that verified the finding

The sense of magic is […]
Original post on mastodon.social
mastodon.social
October 29, 2025 at 7:52 AM
The new "this looks shopped": suspecting a #nanobanana edit
October 29, 2025 at 6:40 AM