Owen L
banner
diogenesdude.bsky.social
Owen L
@diogenesdude.bsky.social
Someone got Q learning to be stable for 5 months, that’s crazy arxiv.org/abs/2601.033...
Mastering the Game of Go with Self-play Experience Replay
The game of Go has long served as a benchmark for artificial intelligence, demanding sophisticated strategic reasoning and long-term planning. Previous approaches such as AlphaGo and its successors, h...
arxiv.org
January 8, 2026 at 5:01 PM