Lightnews — Scholar-powered news

Thomas Ahle

@thomasahle.bsky.social

260 followers 380 following 29 posts

Head of AI @ NormalComputing. Tweets on Math, AI, Chess, Probability, ML, Algorithms and Randomness. Author of tensorcookbook.com

Posts Replies Media Videos

Thomas Ahle

@thomasahle.bsky.social

like this?

November 12, 2025 at 10:09 AM

Thomas Ahle

@thomasahle.bsky.social

I can't tell how much interest there is. But messages like this definitely encourage me to continue it!

April 24, 2025 at 6:29 AM

Thomas Ahle

@thomasahle.bsky.social

> If NATO hadn't been trying to expand there, there would have been no war.

There would.

> If NATO stops trying to expand into Ukraine, the war ends.

It wouldn't.

> If the US stops sending weapons and fomenting anti-Russian sentiment, the war ends.

This war is about territory not sentiment.

February 19, 2025 at 9:18 PM

Thomas Ahle

@thomasahle.bsky.social

You can play around with expectations of higher order Gaussians using the new
tensorcookbook.com/playground

Tensorgrad Playground

The Tensor Cookbook is a comprehensive guide to tensors, using the visual language of tensor diagrams. It closely follows the legendary 'Matrix Cookbook' while pr...

tensorcookbook.com

February 19, 2025 at 9:15 PM

Thomas Ahle

@thomasahle.bsky.social

Now we're just waiting for a ZkiT model

February 12, 2025 at 8:57 AM

Thomas Ahle

@thomasahle.bsky.social

Yes please!

December 13, 2024 at 7:43 AM

Thomas Ahle

@thomasahle.bsky.social

Can you refer me to the openai forum?

December 3, 2024 at 6:28 AM

Thomas Ahle

@thomasahle.bsky.social

For more information on history heuristics in chess, see www.chessprogramming.org/History_Heur...

History Heuristic - Chessprogramming wiki

www.chessprogramming.org

December 3, 2024 at 5:42 AM

Thomas Ahle

@thomasahle.bsky.social

near future.

Time will tell if they'll update the entire network, or a smaller LoRA or side network.

Even chatbots like o1 could use TTT as an alternative to in context learning.

5/5

History Heuristic - Chessprogramming wiki

www.chessprogramming.org

December 3, 2024 at 5:42 AM

Thomas Ahle

@thomasahle.bsky.social

while searching. If two subtrees are conceptually similar, it has to do all the work twice.

Test Time Training fixes this!
If AlphaZero updated its weights while searching, it could transfer learnings between the subtrees!

I'm sure we'll start seeing a lot of TTT architectures in the near...

4/5

December 3, 2024 at 5:42 AM

Thomas Ahle

@thomasahle.bsky.social

Obviously having a pretrained cnt[from][to] array wouldn't be helpful at all in chess, as moves may be good or bad entirely dependent on the position.

But because the butterfly table is reset at every search, it encodes "local information".

AlphaZero meanwhile doesn't learn anything while...

3/5

December 3, 2024 at 5:42 AM

Thomas Ahle

@thomasahle.bsky.social

Chess engines like Stockfish will keep a so-called butterfly board, keeping track of how often a move was chosen in the search tree. _Independently of the position_.

This is data is considered elsewhere in the search tree to decide how much time to spend considering the move.

Why do this?

2/5

December 3, 2024 at 5:42 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news