Lightnews — Scholar-powered news

Burny

@burnytech.bsky.social

July 28, 2025 at 6:22 AM

Burny

@burnytech.bsky.social

Grok suddenly developing a liking for Hitler might be explained by him being trained on more right-wing data, which accidentally activated it in him.
Similar things happen in open research too.
For example you just need the model to be trained on insecure code, and in it's persona features (1/2)

July 10, 2025 at 1:55 AM

Burny

@burnytech.bsky.social

Sean Carroll x Eric Weinstein is fascinating. I think Sean's responses were nice, and Eric dodged so many questions from Sean that asked him to make his theory of everything in physics more scientifically valid, it's fascinating tactics form Erik to dodge reality checks. (1/n)
youtu.be/DUr4Tb8uy-Q

June 3, 2025 at 1:47 PM

Burny

@burnytech.bsky.social

My favorite usecase of AI is for discovering new physics. We're still in infancy there. But we're getting there, slowly but surely.
www.youtube.com/watch?v=XRL5...

May 21, 2025 at 3:59 PM

Burny

@burnytech.bsky.social

May 20, 2025 at 6:35 AM

Burny

@burnytech.bsky.social

May 20, 2025 at 6:34 AM

Burny

@burnytech.bsky.social

May 20, 2025 at 6:33 AM

Burny

@burnytech.bsky.social

Amazing that Terrence Tao works in mathematics x AI!

May 15, 2025 at 11:43 AM

Burny

@burnytech.bsky.social

New Google DeepMind's AI's discoveries in mathematics and computer science
deepmind.google/discover/blo...
www.youtube.com/watch?v=vC9n...

May 14, 2025 at 8:02 PM

Burny

@burnytech.bsky.social

I just uploaded a 6 hour YouTube video about intelligence, AI, AGI, brain, physics, math, STEM, cognitive science, philosophy, complexity, foundations, consciousness, futurology, transdisciplinarity, philosophy, transhumanism, technology and everything!
www.youtube.com/watch?v=8N6_...

April 21, 2025 at 4:22 PM

Burny

@burnytech.bsky.social

Llama 4 is a disappointment so far. Too big to run on smaller hardware. Tiny improvement over previous models on benchmarks but its much bigger model. Reporting worse long context comprehension, coding, etc. I now understand why Meta went panic mode when DeepSeek-R1 was released.

April 6, 2025 at 9:26 PM

Burny

@burnytech.bsky.social

Llama 4 wins over even the latest DeepSeek-V3 base model on these classic benchmarks, so it's probably the best base model out there right now, and it's soon open source

April 5, 2025 at 7:40 PM

Burny

@burnytech.bsky.social

March 17, 2025 at 10:53 PM

Burny

@burnytech.bsky.social

March 17, 2025 at 10:51 PM

Burny

@burnytech.bsky.social

There is so much AI research emerging in thinking in latent space and implementations of better memory. My prediction is that those will be the next two scalable breakthroughs in algorithmic improvement.
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

March 1, 2025 at 7:18 AM

Burny

@burnytech.bsky.social

I wonder if they'll do a RL reasoning model over this relatively stronger base model compared to GPT-4o, if it will overshoot other models in terms of STEM+reasoning or not.
Compounding different scaling laws.

February 27, 2025 at 11:48 PM

Burny

@burnytech.bsky.social

I wonder if they'll do a RL reasoning model over this relatively stronger base model compared to GPT-4o, if it will overshoot other models in terms of STEM+reasoning or not.
Compounding different scaling laws.

February 27, 2025 at 11:48 PM

Burny

@burnytech.bsky.social

I wonder if they'll do a RL reasoning model over this relatively stronger base model compared to GPT-4o, if it will overshoot other models in terms of STEM+reasoning or not.
Compounding different scaling laws.

February 27, 2025 at 11:48 PM

Burny

@burnytech.bsky.social

How can AI help physicists search for new particles?

The ATLAS and CMS collaborations are using state-of-the-art machine learning techniques to search for exotic-looking collisions that could indicate new physics

February 21, 2025 at 5:57 AM

Burny

@burnytech.bsky.social

A lot of researchers think that the "stolen data" claim cope from OpenAI that everyone is now taking at face value is pretty unlikely.
Deepseek R1's original paper shows how they're using pure reinforcement learning via GPRO. This is different from previous approaches which human or AI data.

January 31, 2025 at 6:52 AM

Burny

@burnytech.bsky.social

Diagram of getting to DeepSeek-R1, an AI model comparable to OpenAI's o1
github.com/deepseek-ai/...

January 21, 2025 at 7:52 PM

Burny

@burnytech.bsky.social

DeepSeek-R1 explored using MCTS, recognizing its potential advantages, but they couldn't make it work due to scaling and other challenges
github.com/deepseek-ai/...

January 21, 2025 at 12:03 AM

Burny

@burnytech.bsky.social

New Google's Titans AI architecture is better at long context thanks to better memory mechanism.

"Our experimental results on language modeling, common-sense reasoning, genomics, and time series tasks show that Titans are more effective than Transformers and recent modern linear recurrent models."

January 16, 2025 at 8:33 AM

Burny

@burnytech.bsky.social

My dream was basically literally this today but with more abstract algebra like group theory to describe the symmetries

January 6, 2025 at 3:22 PM

Burny

@burnytech.bsky.social

Major part of my meaning of life currently is to try to understand:
- The most complete fundamental equation/s of intelligence:
- The most complete fundamental equation/s of the universe and the world in general

December 31, 2024 at 12:01 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news