Lightnews — Scholar-powered news

Fabien Niñoles

@ninoles.bsky.social

Why mechanistic interpretability is important? By better understanding how a LM works, we can fix some of his bias, better train them, and more importantly, built more efficient solution that do the same things more deterministically.

2/2

November 19, 2025 at 6:06 PM

Fabien Niñoles

@ninoles.bsky.social

Yes, sorry. Given my engineering background, I did find the distinction between black box and grey box important. But yes, for most people it is the same.
For me, the difference means you can't do mechanistic interpretability on a black box.

1/..

November 19, 2025 at 6:06 PM

Fabien Niñoles

@ninoles.bsky.social

Darn, maybe we should avoid trying to find a single generic approach for all technical problems?

November 19, 2025 at 5:44 PM

Fabien Niñoles

@ninoles.bsky.social

It's more of a grey box: open but incomprehensible. Here again an excellent video from Welch Lab:

youtu.be/UZDiGooFs54?...

The moment we stopped understanding AI [AlexNet]

YouTube video by Welch Labs

youtu.be

November 19, 2025 at 5:41 PM

Fabien Niñoles

@ninoles.bsky.social

There is a paper about the costs and the limits to achieve those rates. Welch Lab has a good video about it:
youtu.be/5eqRuVp65eY?...

AI can't cross this line and we don't know why.

YouTube video by Welch Labs

youtu.be

November 19, 2025 at 5:36 PM

Fabien Niñoles

@ninoles.bsky.social

Yes, they seem to have improved it since I tried a few weeks ago. Idem for Gemini and Claude. Still, you never know when it will fail, and that's the main problem. And sure, humans are not absolved from errors and typo, but its why you asked for sources. LLM are doing a poor job there, yet.

November 19, 2025 at 4:43 PM

Fabien Niñoles

@ninoles.bsky.social

A few weeks ago, all popular chats were giving me such non-sense answer. They seem to have improved on that, but in general rule, if you're looking for precise information, like dates, position, or other kind of identity information, you have a non-negligeable chance they have something wrong.

November 19, 2025 at 4:40 PM

Reposted by Fabien Niñoles

Olivia Guest · Ολίβια Γκεστ

@olivia.science

4. the disregard for the corrosive power of anthropomorphism, which is taken advantage of by industry to sell & steal our data, in the base case scenario, and in the worst to abuse and push vulnerable groups to dependance and worse.

(Section 3.4 here doi.org/10.5281/zeno...)
6/n

3.4 Anthropomorphism and other circular reasoning
While opacity is a distinguishing feature of many other areas of science and technology, the myths surrounding computing may stem less from the fact that it is an opaque
esoteric subject and more from the way in which it can be seen to blur the boundary between people and machines (Turkle 1984). To be sure, most people do not understand
the workings of a television set or how to program their video cassette recorders properly, but then they do not usually believe that these machines can have intelligence. The
public myths about computing and AI are also no doubt due to the ways in which computers are often depicted in the mass media — e.g. as an abstract source of wisdom, or
as a mechanical brain.
Brian P. Bloomfield (1987, p. 72)
There is circular reasoning at play when we suggest and assume machines can think, reason, or argue
like humans can, and therefore, treat them — and test them — like humans. Within human-machine
10
Against the Uncritical Adoption of ‘AI’ Technologies in Academia
interaction research, often, AI technology output is compared to human performance, mistakenly
assuming such benchmarks are informative about AI’s capabilities. However, correlations with human output mean little to substantiate claims of human-likeness, especially when the input to the AI
models tested is the output of human cognition in the first place. There are so many cases of this from
daily life and the history of science that it appears shocking such results are taken so uncritically to
be cognition (Bernardi 2024; Guest 2025; Guest and Martin 2023; Placani 2024; van Rooij and Guest
2025). An example from the 1960s:
Weizenbaum (1966) was afraid of

September 6, 2025 at 8:34 AM

Fabien Niñoles

@ninoles.bsky.social

No need to go that far: here Mistral's answer for the age of Susan Calvin:

What age Susan Calvin, a character created by Asimov in 1941, would have today if she was real?

Susan Calvin, the fictional robopsychologist created by Isaac Asimov, first appeared in the short story "Liar!" published in 1941. In that story, she is described as a 20-year-old graduate student. If Susan Calvin were a real person, she would have been born around 1921.
If she were still alive today (November 19, 2025), Susan Calvin would be 104 years old.

November 19, 2025 at 4:29 PM

Fabien Niñoles

@ninoles.bsky.social

Most of current LLM based deployments lack a post-generation validation mechanism. A good example of such system is Alpha Geometry, which use LLM to generate constructs, and a symbolic engine to validate.

deepmind.google/blog/alphage...

AlphaGeometry: An Olympiad-level AI system for geometry

Our AI system surpasses the state-of-the-art approach for geometry problems, advancing AI reasoning in mathematics

deepmind.google

November 19, 2025 at 3:41 PM

Fabien Niñoles

@ninoles.bsky.social

Even with clean data, LLM (like any ANN) will still push some garbage out. That is basically what training is about: reduce (not eliminate!) the amount of garbage out through gradient descent on error.
The surprising thing about LLM is how often it is right. The garbage is the expected part.

November 19, 2025 at 3:25 PM

Fabien Niñoles

@ninoles.bsky.social

For me, LLM's deception is not about its lies, but the lies around it:
* anthropomorphize interface and terminology
* no understanding of how the outcome is reach
* impossible to came with a confidence factor on the results
* cost paid != resources consumed
* very undeserved hype around it

November 19, 2025 at 3:19 PM

Fabien Niñoles

@ninoles.bsky.social

I think what he means is that LLM doesn't even know how to reproduce lies or deceipts. They can generate sentences that no humans have ever said, like "There are 5 shapes in this image, which are a square, a circle and a triangle." That sentence is just wrong and it doesn't know that.

November 19, 2025 at 2:58 PM

Fabien Niñoles

@ninoles.bsky.social

Ask LLM what are the states with a r in their name and see if it makes consensus. Sounds like a trick question? Well, now, tell me, how do you distinguish those cases from the others? When can you tell that it got it wrong? LLM's hallucinations are never consensus. They are blatant inventions.

November 19, 2025 at 2:41 PM

Fabien Niñoles

@ninoles.bsky.social

Ajoutez à ça un NPD complètement décharné et qui n'a pas encore fini de lécher ses blessures et ce n'est pas pour rien que seulement le BQ s'est opposé concrètement.

C'est malheureux mais bien le résultat de l'influence polarisante de la politique américaine sur notre système politique.

2/2

November 19, 2025 at 2:04 PM

Fabien Niñoles

@ninoles.bsky.social

L'article est bizarre: l'auteure reproché au PLC son "arrogance" mais la cause sur la désorganisation de l'opposition. À mon avis, le problème est plus du côté du CPC qui est devenu sous Harper plus réactionnaire que conservateur et donc incapable de former une opposition effective.

1/..

November 19, 2025 at 2:04 PM

Fabien Niñoles

@ninoles.bsky.social

Anti-terrorist laws are always rooted into a desire for authoritarianism from the government in place (including the minority parties). It's a failure of democracy, of governance, saying that our military and judicial systems cannot succeed without reducing the rights of the bystanders.

November 19, 2025 at 1:34 PM

Fabien Niñoles

@ninoles.bsky.social

The game has 20 endings achievable in exactly 31 days, and 3 of them are considered victories. I think you can only achieve one of them by approving everything (if you answer correctly at some final questions).

November 19, 2025 at 1:06 PM

Fabien Niñoles

@ninoles.bsky.social

That's impossible. He thinks humanity exists for him and sees himself as Nietzsche's übermensch (perverted), Humanity's Ultime Finality (not what Nietzsche means). So, everything good for him must be good for humanity and the only things that are good for humanity are things that are good for him.

November 19, 2025 at 4:43 AM

Fabien Niñoles

@ninoles.bsky.social

He insulted women and others quite often, and treated them like objects even those few he likes. But for me, it was the imperative "Quite, Piggy!" which tries to establish a feudal dominance ("I'm your lord, you're a lowly serf") that really makes it more upsetting.

November 19, 2025 at 2:23 AM

Fabien Niñoles

@ninoles.bsky.social

That happens when people think those methods are just different tools to do development the "old" way.
Iterative development (not just agile) is a different approach and most agile methodologies are tools built upon them.
But you must do ID first for them to make sense.

November 19, 2025 at 2:12 AM

Fabien Niñoles

@ninoles.bsky.social

De plus, je reprends en mes mots ce qu'il dit plus éloquemment que moi, mais c'est loin d'être une citation. Je ne prétends donc pas que c'est de lui, bien que c'est assez proche de ce qu'il dit.

November 18, 2025 at 8:59 AM

Fabien Niñoles

@ninoles.bsky.social

Désolé, c'est tellement connu cette citation, je n'ai pas pensé qu'il était nécessaire de le préciser.
Un peu comme parler de la relativité sans mentionner Einstein.
Maintenant, je crois que ce serait bien que plus de monde s'approprie cette pensée.

November 18, 2025 at 8:54 AM

Fabien Niñoles

@ninoles.bsky.social

Your cat would have been the first to remind you if you had forgotten. That's just what cats expect.

November 17, 2025 at 9:38 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news