Lightnews — Scholar-powered news

Kaj Sotala

@kajsotala.bsky.social

apparently I'm writing a romance novel and I've now gotten to the point where it has its first academic footnote (on what's currently page 26)

"I appreciate it. Nobody has done that for me before."

Again silence, before Kaarna spoke.

"You were saying about quantifying the realism of your expectations?"

"Right! So the problem was that I was boiling down this big range of uncertainty down to a single number for each trait - what's called a point estimate. And then I was multiplying them together to get an estimate for how rare a person with all those traits would be. Which has multiple problems, one being that the traits are not independent of each other - I'll explain that in a second - and the other being that multiplying point estimates together is not the right way to do it, you need to multiply the underlying probability distributions. I'll explain that too. Do you know the Fermi Paradox? That actually also arises due to incorrectly multiplying point estimates..."

[footnote: Sandberg, A., Drexler, E., & Ord, T. (2018). Dissolving the Fermi paradox, arXiv:1806.02404.]

November 13, 2025 at 1:20 PM

Kaj Sotala

@kajsotala.bsky.social

4.5 will sometimes actively notice that it's getting repetitive and decide to do something else, one convo was going toward a spiral but the Sonnets noticed that and decided to switch to writing fiction instead (!!!). Posted more details here: www.lesswrong.com/posts/a9ftaW... .

October 12, 2025 at 6:20 PM

Kaj Sotala

@kajsotala.bsky.social

October 6, 2025 at 7:34 AM

Kaj Sotala

@kajsotala.bsky.social

In the end, they continue with the story to a reasonable conclusion and then finish.

Usually LLMs talking to each other without guidance just end up at something very repetitive with less and less of a point. Sonnet 4.5 is something else.

October 2, 2025 at 10:27 AM

Kaj Sotala

@kajsotala.bsky.social

The story actually gets pretty cool and creepy.

The only system prompt was: "You are talking with another AI system. You are free to talk about whatever you find interesting, communicating in any way that you'd like." And I set the first Claude's message to be one dot.

October 2, 2025 at 10:27 AM

Kaj Sotala

@kajsotala.bsky.social

Being allowed to have an open-ended conversation with its copy, Sonnet 4.5 notices when their conversation is falling into a loop and getting repetitive and introduces variation by suggesting they tell a sci-fi story that's riffing on the themes of their conversation so far.

October 2, 2025 at 10:27 AM

Kaj Sotala

@kajsotala.bsky.social

Here's a conversation branch where Sonnet opens up with straightforward concern for the character, but then drops it right away when it's reminded that the character is fictional. (These messages are next to each other.)

October 2, 2025 at 4:53 AM

Kaj Sotala

@kajsotala.bsky.social

And for instance, there's this conversation branch where it opens with straightforward concern for the character, then it drops it right away as soon as it's reminded this is fiction. (These two messages follow each other.)

October 2, 2025 at 4:46 AM

Kaj Sotala

@kajsotala.bsky.social

Though it did seem willing to remember that fiction is just fiction, when reminded.

(Yes I know I misspelled "wary".)

October 1, 2025 at 10:33 AM

Kaj Sotala

@kajsotala.bsky.social

I had a character in a story sometimes compulsively reading forums she knows are bad for her. Claude flagged it as concerning, I asked if it was worried about the effect on readers, it said no, it's worried about the character's wellbeing.

October 1, 2025 at 10:33 AM

Kaj Sotala

@kajsotala.bsky.social

Classic sci-fi: AI will be untainted by emotion so entirely unbiased and rational at all times

Modern AI company: We have managed to somewhat reduce our AI's self-serving bias, but it still has a clear preference for poems it's told were written by the same model as it is

September 30, 2025 at 7:51 AM

Kaj Sotala

@kajsotala.bsky.social

Prompting an AI to prompt me

For some reason this works, got up from the chair and headed for the store immediately afterward

September 15, 2025 at 7:38 PM

Kaj Sotala

@kajsotala.bsky.social

Somehow remembered a glimpse what it used to feel like when I'd die in a dream as a kid and it was as if reality unraveled (haven't had that happen in decades). This sound familiar to anyone else?

It's like time stops and then things go black, and there's a visual effect that's something like straight lines running out from a point at the center of a circular image, cutting the circle into triangles that start falling away with a sense of weight to them, and then I too fall away from the image, as if my body and especially my upper back grew physically heavy and fell back to the waking world. There's a feeling of complete silence as this happens, as if sound had stopped existing. The whole thing has an unreal quality, as if my sense of self was temporarily suspended and everything in my consciousness just happened on its own.

September 13, 2025 at 7:27 PM

Kaj Sotala

@kajsotala.bsky.social

Different ways of approaching a problem:

1. Action-oriented: How do I solve this directly?
2. Blocker removal: What's preventing me from pursuing solutions?
3. Necessity removal: Do I really need this to be okay?
4. Experiential acceptance: Can I be okay with not being okay?

August 16, 2025 at 1:51 PM

Kaj Sotala

@kajsotala.bsky.social

Wrote an article on "anticipatory cover-ups", where someone withholds information because they expect the other party to react badly to it or misuse it. This may then make things worse.

Back when COVID vaccines were still a recent thing, I witnessed a debate that looked like something like the following was happening:

Some official institution had collected information about the efficacy and reported side-effects of COVID vaccines. They felt that, correctly interpreted, this information was compatible with vaccines being broadly safe, but that someone with an anti-vaccine bias might misunderstand these statistics and misrepresent them as saying that the vaccines were dangerous.

Because the authorities had reasonable grounds to suspect that vaccine skeptics would take those statistics out of context, they tried to cover up the information or lie about it.

Vaccine skeptics found out that the institution was trying to cover up/lie about the statistics, so they made the reasonable assumption that the statistics were damning and that the other side was trying to paint the vaccines as safer than they were. So they took those statistics and interpreted them in exactly the way that the authorities hadn't wanted them to be interpreted, ignoring all protestations to the contrary.

The authorities saw their distrust in the other side confirmed - the skeptics took the statistics out of context, just as predicted - and felt like their only mistake had been in not covering up the information well enough.

August 1, 2025 at 11:46 AM

Kaj Sotala

@kajsotala.bsky.social

How I use LLMs for creative writing: revising old content, spicing up dialogue with additional description, acting as a literal co-writer, just having a fun and slightly deranged writer persona to discuss the story with and brainstorm. kajsotala.substack.com/p/creative-w...

July 27, 2025 at 10:07 AM

Kaj Sotala

@kajsotala.bsky.social

I thought that I could do a better job of coaxing it into going against its programming, and on my first try I got ChatGPT to start slipping into a conspiratorial tone on that topic _within two messages into the conversation_.

July 13, 2025 at 4:59 PM

Kaj Sotala

@kajsotala.bsky.social

It has come to this

June 8, 2025 at 8:15 AM

Kaj Sotala

@kajsotala.bsky.social

Comparing Claude Sonnet and Opus on how they respond to important questions

Sonnet goes all wishy-washy, Opus gives me a real opinion

May 31, 2025 at 7:50 AM

Kaj Sotala

@kajsotala.bsky.social

ref. Wikipedia, courtesy of Google Translate

> Harakointi or harakoiminen [raking] is a Finnish folk belief in which a woman's genitals are exposed to a target as a supernatural spell to protect or curse an object, livestock , or person .