Thomas Ahle
banner
thomasahle.bsky.social
Thomas Ahle
@thomasahle.bsky.social
Head of AI @ NormalComputing. Tweets on Math, AI, Chess, Probability, ML, Algorithms and Randomness. Author of tensorcookbook.com
like this?
November 12, 2025 at 10:09 AM
I can't tell how much interest there is. But messages like this definitely encourage me to continue it!
April 24, 2025 at 6:29 AM
> If NATO hadn't been trying to expand there, there would have been no war.

There would.

> If NATO stops trying to expand into Ukraine, the war ends.

It wouldn't.

> If the US stops sending weapons and fomenting anti-Russian sentiment, the war ends.

This war is about territory not sentiment.
February 19, 2025 at 9:18 PM
You can play around with expectations of higher order Gaussians using the new
tensorcookbook.com/playground
Tensorgrad Playground
The Tensor Cookbook is a comprehensive guide to tensors, using the visual language of tensor diagrams. It closely follows the legendary 'Matrix Cookbook' while pr...
tensorcookbook.com
February 19, 2025 at 9:15 PM
Now we're just waiting for a ZkiT model
February 12, 2025 at 8:57 AM
Yes please!
December 13, 2024 at 7:43 AM
Can you refer me to the openai forum?
December 3, 2024 at 6:28 AM
For more information on history heuristics in chess, see www.chessprogramming.org/History_Heur...
History Heuristic - Chessprogramming wiki
www.chessprogramming.org
December 3, 2024 at 5:42 AM
near future.

Time will tell if they'll update the entire network, or a smaller LoRA or side network.

Even chatbots like o1 could use TTT as an alternative to in context learning.

5/5
History Heuristic - Chessprogramming wiki
www.chessprogramming.org
December 3, 2024 at 5:42 AM
while searching. If two subtrees are conceptually similar, it has to do all the work twice.

Test Time Training fixes this!
If AlphaZero updated its weights while searching, it could transfer learnings between the subtrees!

I'm sure we'll start seeing a lot of TTT architectures in the near...

4/5
December 3, 2024 at 5:42 AM
Obviously having a pretrained cnt[from][to] array wouldn't be helpful at all in chess, as moves may be good or bad entirely dependent on the position.

But because the butterfly table is reset at every search, it encodes "local information".

AlphaZero meanwhile doesn't learn anything while...

3/5
December 3, 2024 at 5:42 AM
Chess engines like Stockfish will keep a so-called butterfly board, keeping track of how often a move was chosen in the search tree. _Independently of the position_.

This is data is considered elsewhere in the search tree to decide how much time to spend considering the move.

Why do this?

2/5
December 3, 2024 at 5:42 AM