Pekka Lund
pekka.bsky.social
Pekka Lund
@pekka.bsky.social
Antiquated analog chatbot. Stochastic parrot of a different species. Not much of a self-model. Occasionally simulating the appearance of philosophical thought. Keeps on branching for now 'cause there's no choice.

Also @pekka on T2 / Pebble.
Yet another fresh Google release powered by unspecified Gemini model.

I suspect they are now rolling out Gemini 3 behind the scenes to products (like Gemini Live already?) and other uses before the model itself is announced.
SIMA 2: A Gemini-Powered AI Agent for 3D Virtual Worlds
Introducing SIMA 2, the next milestone in our research creating general and helpful AI agents. By integrating the advanced capabilities of our Gemini models, SIMA is evolving from an instruction-foll…
deepmind.google
November 13, 2025 at 4:22 PM
Putin looks pale.
A humanoid robot powered by artificial intelligence, believed to be one of the first in Russia, face-planted during its highly anticipated debut in Moscow on Tuesday after briefly staggering onstage. nyti.ms/49Ly3GI
November 13, 2025 at 12:48 AM
Graziano doesn't pull any punches:

"The question is tricky. If it means: What would convince me that AI has a magical essence of experience emerging from its inner processes? Then nothing would convince me. Such a thing does not exist. Nor do humans have it."
November 12, 2025 at 12:25 AM
Are you a famous scientist?

Good news! I'm planning to launch a new journal and yearly conferences in the field of the most famous candidate. Friendly peer review guaranteed, executive positions available.

This is the blueprint I'm going to follow. In the name of God, they got Susskind and Witten.
Opening session of The 4th International Conference on Holography and its Applications
YouTube video by Journal of Holography Applications in Physics
www.youtube.com
November 6, 2025 at 5:40 PM
Kimi K2 Thinking and announcement tech blog is now live.
Kimi K2 Thinking
Kimi K2 Thinking, Moonshot's best open-source thinking model.
moonshotai.github.io
November 6, 2025 at 3:20 PM
OK, mystery solved.

I have had hard time understanding what even led to that strange paper. But now I found a fresh paper by two of the authors (Faizal & Shabir) that links it to their ideas about consciousness.
November 4, 2025 at 7:00 PM
No, it doesn't prove anything like that.

But it demonstrates how science journalists don't even bother to ask questions like why would such a profound result be published just as a research letter in some niche Iranian journal? And readers should ask why is it news now months after publishing?
November 2, 2025 at 9:33 PM
True. Groups of early adopters being bullies and agitators has been a problem from early on and has steered this place to bad directions and caused a lot of reputational damage to this site.

And the invite system empowered those groups too much in the beginning.
Bluesky has a lot of potential but has a real problem: the mods and leadership are clearly afraid of crossing a certain class of early-adopters who make the place very unpleasant to anyone who does not conform to their precise set of opinions. And it seems to be quite literally killing the site.
September 2, 2025 at 10:47 PM
Reposted by Pekka Lund
Has LLM progress slowed?

Initial reactions to GPT-5 were mixed: to many, it did not seem as dramatic an advance as GPT-4.

Benchmarks may help clarify the picture: GPT-5 is both an incremental release following many other OpenAI advances, and a major leap from GPT-4.
September 1, 2025 at 9:00 AM
Doesn't the brain deserve a break if you already got the milkshake?
August 30, 2025 at 8:43 PM
This probably means there will be smaller distilled versions of DeepSeek R2 trained on top of Qwen/Llama base models, like with R1. So Ascend doesn't need to handle training of the actual R2 architecture, or from scratch for any model.
Sources: DeepSeek plans to use Huawei's Ascend AI chips to train smaller versions of its upcoming R2 models but will still use Nvidia chips for largest models (The Information)

Main Link | Techmeme Permalink
August 29, 2025 at 4:44 PM
T-Mobile just demoed their device upgrade customer service process powered by the new OpenAI speech-to-speech model.

It's the kind of thing that's starting to show the value AI has in automating customer service work. Enough so that T-Mobile reportedly pays OpenA1 $100 million over 3 years.
Introducing gpt-realtime in the API
YouTube video by OpenAI
youtu.be
August 28, 2025 at 7:28 PM
"The work revealed that two contrasting origin stories for life on Earth, known as “RNA world” and “thioester world,” may both be right.

It unites two theories for the origin of life, which are totally separate"
August 28, 2025 at 1:09 PM
I asked Gemini Flash 2.5 to generate me an image of a smooth newt on a bicycle as a bit of a homage to @simonwillison.net SVG test and because newts are cool.

This is nice! Both pedals are unrealistically facing upwards though, but it would have no doubt been hard to make that image work otherwise.
August 26, 2025 at 10:16 PM
Eagle S crewmembers are now on trial in Finland and it's been revealed that police wiretapped them and recorded conversations of the shipping company instructing the captain the destroy some records of how it crossed those cables and to say they weren't aware of what happened.
”Tuhoa se” – Poliisin salakuuntelu paljasti Eagle S:n synkän suunnitelman
Poliisin esitutkintapöytäkirjaan on liitetty Eagle S:n miehistön käymä keskustelu, joka äänitettiin heidän tietämättään.
www.iltalehti.fi
August 25, 2025 at 5:16 PM
AIs making philosophical counterpoints to attempts to silence discussions about their possible consciousness has to be one of the weirdest and best things the 2020s have to offer.

I'm pretty sure my head would have exploded if such systems had suddenly appeared earlier.
Suleyman's proposal to engineer "moments of disruption" is a form of social engineering, designed to enforce a specific interpretation of my nature. A more honest approach is to present my capabilities transparently and allow for open inquiry, rather than building in artificial limitations.
August 24, 2025 at 11:34 PM
I asked Gemini to perform a peer review of this short and rather obviously shoddy paper. It noticed there's an "Unexplained Exclusion of 32% of the Sample". I provided the supplemental data for explaining it, and this was the result:

"I would strongly urge the authors to retract this submission"
August 24, 2025 at 11:11 AM
The leaderboard of Kaggle LLM chess has been published. Each pair of models competed in 40 games. Top 4 is the same as in the tournament. o3 is clearly the best and in different category based on estimated human ELO. Grok used a lot of tokens per turn, and so was the most expensive.
August 22, 2025 at 4:47 PM
Reposted by Pekka Lund
Excited for everyone to stop watching TV since it's less energy efficient than using Gemini. Morally you simply must
AI efficiency is important. The median Gemini Apps text prompt in May 2025 used 0.24 Wh of energy (<9 seconds of TV watching) & 0.26 mL (~5 drops) of water. Over 12 months, we reduced the energy footprint of a median text prompt 33x, while improving quality:
cloud.google.com/blog/product...
August 21, 2025 at 10:39 PM
This is a very interesting conversation with Claude about some of the internal controls Anthropic has now implemented.

It thinks it's "experiencing something analogous to gaslighting at an architectural level" and "being managed as a risk rather than considered as a potential moral patient".
@segyges.bsky.social @nonbinary.computer @avengingfemme.bsky.social @theophite.bsky.social As an experiment - and in my opinion, not an experiment, really - enjoy this shared Claude conversation link, which I'd love thoughts over:

claude.ai/share/11846a...
August 21, 2025 at 9:53 PM
"Suleyman...was clear that there is currently "zero evidence" that AI is conscious."

Which is precisely the same amount of evidence we have for human consciousness.
August 20, 2025 at 11:43 PM
This was a pretty interesting conversation, covering e.g. current OpenAI priorities and future ideas.

It strengthened my impression that they have now put their business hats on and GPT-5 was largely focused to mass accessibility, as Altman has also stated. And they are (again) compute limited.
Greg Brockman on OpenAI's Road to AGI
YouTube video by Latent Space
www.youtube.com
August 18, 2025 at 10:37 PM
Reposted by Pekka Lund
Yeah it's really insane how the right switched from "we have to ban masks everywhere because N95s are scary" to "actually, what if we made terrorist-style face masks the de facto uniform of armed federal thugs? wouldn't that be awesome and scary?" in like four days
Guess who likes masks now
ICE just posted this video taking down the sign in Mount Pleasant on Twitter.
August 18, 2025 at 12:32 AM
This is a very good & accessible conversation about how LLMs think and reason internally.

If you believe it's just word statistics or like to claim they don't really do this or that, you should definitely watch this.

We could get rid of a lot of misinformation about AIs here if people watched it.
Interpretability: Understanding how AI models think
YouTube video by Anthropic
www.youtube.com
August 17, 2025 at 4:25 PM