Burny
banner
burnytech.bsky.social
Burny
@burnytech.bsky.social
On the quest to understand the fundamental equations of intelligence and of the universe with curiosity. http://burnyverse.com Upskilling
@StanfordOnline
July 28, 2025 at 6:22 AM
Grok suddenly developing a liking for Hitler might be explained by him being trained on more right-wing data, which accidentally activated it in him.
Similar things happen in open research too.
For example you just need the model to be trained on insecure code, and in it's persona features (1/2)
July 10, 2025 at 1:55 AM
Sean Carroll x Eric Weinstein is fascinating. I think Sean's responses were nice, and Eric dodged so many questions from Sean that asked him to make his theory of everything in physics more scientifically valid, it's fascinating tactics form Erik to dodge reality checks. (1/n)
youtu.be/DUr4Tb8uy-Q
June 3, 2025 at 1:47 PM
My favorite usecase of AI is for discovering new physics. We're still in infancy there. But we're getting there, slowly but surely.
www.youtube.com/watch?v=XRL5...
May 21, 2025 at 3:59 PM
May 20, 2025 at 6:35 AM
May 20, 2025 at 6:34 AM
May 20, 2025 at 6:33 AM
Amazing that Terrence Tao works in mathematics x AI!
May 15, 2025 at 11:43 AM
New Google DeepMind's AI's discoveries in mathematics and computer science
deepmind.google/discover/blo...
www.youtube.com/watch?v=vC9n...
May 14, 2025 at 8:02 PM
I just uploaded a 6 hour YouTube video about intelligence, AI, AGI, brain, physics, math, STEM, cognitive science, philosophy, complexity, foundations, consciousness, futurology, transdisciplinarity, philosophy, transhumanism, technology and everything!
www.youtube.com/watch?v=8N6_...
April 21, 2025 at 4:22 PM
Llama 4 is a disappointment so far. Too big to run on smaller hardware. Tiny improvement over previous models on benchmarks but its much bigger model. Reporting worse long context comprehension, coding, etc. I now understand why Meta went panic mode when DeepSeek-R1 was released.
April 6, 2025 at 9:26 PM
Llama 4 wins over even the latest DeepSeek-V3 base model on these classic benchmarks, so it's probably the best base model out there right now, and it's soon open source
April 5, 2025 at 7:40 PM
March 17, 2025 at 10:53 PM
March 17, 2025 at 10:51 PM
There is so much AI research emerging in thinking in latent space and implementations of better memory. My prediction is that those will be the next two scalable breakthroughs in algorithmic improvement.
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
March 1, 2025 at 7:18 AM
I wonder if they'll do a RL reasoning model over this relatively stronger base model compared to GPT-4o, if it will overshoot other models in terms of STEM+reasoning or not.
Compounding different scaling laws.
February 27, 2025 at 11:48 PM
I wonder if they'll do a RL reasoning model over this relatively stronger base model compared to GPT-4o, if it will overshoot other models in terms of STEM+reasoning or not.
Compounding different scaling laws.
February 27, 2025 at 11:48 PM
I wonder if they'll do a RL reasoning model over this relatively stronger base model compared to GPT-4o, if it will overshoot other models in terms of STEM+reasoning or not.
Compounding different scaling laws.
February 27, 2025 at 11:48 PM
How can AI help physicists search for new particles?

The ATLAS and CMS collaborations are using state-of-the-art machine learning techniques to search for exotic-looking collisions that could indicate new physics
February 21, 2025 at 5:57 AM
A lot of researchers think that the "stolen data" claim cope from OpenAI that everyone is now taking at face value is pretty unlikely.
Deepseek R1's original paper shows how they're using pure reinforcement learning via GPRO. This is different from previous approaches which human or AI data.
January 31, 2025 at 6:52 AM
Diagram of getting to DeepSeek-R1, an AI model comparable to OpenAI's o1
github.com/deepseek-ai/...
January 21, 2025 at 7:52 PM
DeepSeek-R1 explored using MCTS, recognizing its potential advantages, but they couldn't make it work due to scaling and other challenges
github.com/deepseek-ai/...
January 21, 2025 at 12:03 AM
New Google's Titans AI architecture is better at long context thanks to better memory mechanism.

"Our experimental results on language modeling, common-sense reasoning, genomics, and time series tasks show that Titans are more effective than Transformers and recent modern linear recurrent models."
January 16, 2025 at 8:33 AM
My dream was basically literally this today but with more abstract algebra like group theory to describe the symmetries
January 6, 2025 at 3:22 PM
Major part of my meaning of life currently is to try to understand:
- The most complete fundamental equation/s of intelligence:
- The most complete fundamental equation/s of the universe and the world in general
December 31, 2024 at 12:01 PM