Harrison Pim
banner
harrisonpim.com
Harrison Pim
@harrisonpim.com
I'm working on search, machine learning, and knowledge graphs at climatepolicyradar.org | harrisonpim.com
Reposted by Harrison Pim
This is so simple, I can’t believe that it works.

tl;dr they trained a model to answer questions about another model’s thoughts

alignment.anthropic.com/2025/activat...
December 19, 2025 at 7:59 AM
Reposted by Harrison Pim
EXCL: An agreement to rejoin Erasmus – the EU’s student exchange programme – set to be announced on Wednesday as part of UK government’s drive towards closer relations with Brussels.

www.theguardian.com/world/2025/d...
UK to rejoin EU’s Erasmus student exchange programme
Exclusive: British students will be able to participate in EU-wide scheme from January 2027, sources say
www.theguardian.com
December 16, 2025 at 6:17 PM
Reposted by Harrison Pim
guy who just learned it's not polite to call an artist's work "content": hey man it's cool to meet you i'm a huge fan of your slop
December 16, 2025 at 1:46 PM
this is probably the most thoughtful, well reasoned take i’ve read on The Big Bad Bubble so far www.oaktreecapital.com/insights/mem...
Is It a Bubble?
In his latest memo, Howard Marks explores if there is a bubble in AI, identifying uncertainty, parallels to past bubbles, AI's vast potential, and emphasizing the importance of prudence.
www.oaktreecapital.com
December 16, 2025 at 2:18 PM
Reposted by Harrison Pim
OpenAI leadership are promoting a paper in Physics Letters B where GPT-5 proposed the main idea — possibly the first peer-reviewed paper where an LLM generated the core contribution. One small problem: GPT-5's idea tests the wrong thing. My technical comment: scirate.com/arxiv/2512.0... 1/
December 9, 2025 at 5:17 PM
Reposted by Harrison Pim
JustHTML by @emilstenstrom.bsky.social is a new Python library (no dependencies) that parses HTML according to the HTML5 specification and passes the 9,200 test html5lib-tests suite

It's 3,000 lines of code mostly written by coding agents over a couple of months simonwillison.net/2025/Dec/14/...
JustHTML is a fascinating example of vibe engineering in action
I recently came across JustHTML, a new Python library for parsing HTML released by Emil Stenström. It’s a very interesting piece of software, both as a useful library and as …
simonwillison.net
December 14, 2025 at 5:13 PM
I had a very nice time at NeurIPS in San Diego this week!
December 8, 2025 at 7:52 PM
So great to see openrouter publishing reports on LLM usage patterns, akin to the annual stackoverflow developer survey openrouter.ai/state-of-ai
State of AI | OpenRouter
An empirical study analyzing over 100 trillion tokens of real-world LLM interactions across tasks, geographies, and time.
openrouter.ai
December 5, 2025 at 9:39 AM
brilliant news, obviously 🙌

though when compared to what's being done elsewhere, these numbers feel quite disappointing... other countries with similar solar potential have been adding capacity more than 5x faster than we have in the UK!
December 2, 2025 at 11:54 PM
Reposted by Harrison Pim
'Britain becomes world’s largest economy to end new oil and gas exploration'

Feels like this should have been bigger news...
www.greenpeace.org.uk/news/britain...
Britain becomes world’s largest economy to end new oil and gas exploration - Greenpeace UK
Commenting on the government’s North Sea Future Plan, in which it has confirmed that no more licences for new oil and gas will be issued, Greenpeace UK’s co-executive director, Areeba Hamid, said:  “B...
www.greenpeace.org.uk
December 2, 2025 at 10:36 PM
Reposted by Harrison Pim
Four new models from Mistral today - all Apache 2 licensed, all vision-capable, and one of them is a 3GB model that can run in a web browser and answer questions about things it can see through the webcam! simonwillison.net/2025/Dec/2/i...
Introducing Mistral 3
Four new models from Mistral today: three in their "Ministral" smaller model series (14B, 8B, and 3B) and a new Mistral Large 3 MoE model with 675B parameters, 41B active. …
simonwillison.net
December 2, 2025 at 5:32 PM
Reposted by Harrison Pim
wild. not what anyone would have predicted as the endgame for a JS runtime
Bun is joining Anthropic
Bun has been acquired by Anthropic. Anthropic is betting on Bun as the infrastructure powering Claude Code, Claude Agent SDK, and future AI coding products & tools.
bun.com
December 2, 2025 at 6:18 PM
when are we getting the funk document
December 1, 2025 at 11:40 PM
Reposted by Harrison Pim
I showed you my Soul Document pls respond
December 1, 2025 at 1:11 PM
I’ll be in San Diego next week for the @climatechangeai.bsky.social workshop at NeurIPS, sharing what we’ve learned while building a huge knowledge graph which maps the global climate policy landscape 👨‍🔬🕸️
November 28, 2025 at 11:09 AM
Reposted by Harrison Pim
for all talks of local-first AI, given controversies over datacenters, I don't think people are willing to hear that local LLM inference is *probably* less environmentally friendly than cloud inference the majority of the time.
November 17, 2025 at 8:25 PM
The effects of the evil vector can be neutralised by calling out the existence of the evil vector
From shortcuts to sabotage: natural emergent misalignment from reward hacking
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
www.anthropic.com
November 24, 2025 at 8:21 PM
Reposted by Harrison Pim
Initial impressions (and pelicans) of Claude Opus 4.5, Anthropic's new "best model in the world for coding" released this morning. simonwillison.net/2025/Nov/24/...
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult
Anthropic released Claude Opus 4.5 this morning, which they call “best model in the world for coding, agents, and computer use”. This is their attempt to retake the crown for …
simonwillison.net
November 24, 2025 at 7:38 PM
Reposted by Harrison Pim
Journalist challenge: Use “Machine Learning” when you mean machine learning and “LLM” when you mean LLM. Ditch “AI” as a catch-all term, it’s not useful for readers and it helps companies trying to confuse the public by obscuring the roles played by different technologies. 🧪
November 22, 2025 at 4:50 PM
God I absolutely love reading about these borrrrrring infrastructure bugs they’re so real they’re so thorny they’re so interesting I LOVE IT
www.anthropic.com/engineering/...
A postmortem of three recent issues
This is a technical report on three bugs that intermittently degraded responses from Claude. Below we explain what happened, why it took time to fix, and what we're changing.
www.anthropic.com
September 26, 2025 at 10:13 PM
> Although the inference server itself can be claimed to be "deterministic", the story is different for an individual user. From the perspective of an individual user, the other concurrent users are not an "input" to the system but rather a nondeterministic property of the system.
Defeating Nondeterminism in LLM Inference
Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models. For example, you might observe that asking ChatGPT the...
thinkingmachines.ai
September 21, 2025 at 12:05 PM
Reposted by Harrison Pim
There are just under 12 hours left to vote in this year's Tiny Awards! Small, fun, beautiful and occasionally-pointless websites, and a pleasing rebuttal to anyone who thinks everything online is rubbish in 2025: tinyawards.net/vote/
Tiny Awards
This is the home of the Tiny Awards, which, since 2023, has celebrated the best of the small, poetic, creative, handmade web.
tinyawards.net
September 1, 2025 at 11:21 AM
Reposted by Harrison Pim
What sort of black magic is this
August 19, 2025 at 9:04 PM
google estimate that gemini models use 0.24 Wh and 0.26 ml of water per prompt
Measuring the environmental impact of AI inference | Google Cloud Blog
A methodology for measuring the energy, emissions, and water impact of Gemini prompts shines a light on the environmental impact of AI inference.
cloud.google.com
August 21, 2025 at 3:53 PM
I'll be talking about @climatepolicyradar.bsky.social's work on building knowledge graphs for policy research at @nestauk.bsky.social's Policy Live event on 11 September
Policy Live 2025 - Homepage
www.policylive.org
August 20, 2025 at 11:18 AM