Lightnews — Scholar-powered news

Jon Barron

@jonbarron.bsky.social

3.7K followers 150 following 250 posts

Principal research scientist at Google DeepMind. Synthesized views are my own.
📍SF Bay Area 🔗 http://jonbarron.info

This feed is a mostly-incomplete mirror of https://x.com/jon_barron, I recommend you just follow me there.

Posts Replies Media Videos

Pinned

Jon Barron @jonbarron.bsky.social · Apr 28

Here's a recording of my 3DV keynote from a couple weeks ago. If you're already familiar with my research, I recommend skipping to ~22 minutes in where I get to the fun stuff (whether or not 3D has been bitter-lesson'ed by video generation models)

www.youtube.com/watch?v=hFlF...

Radiance Fields and the Future of Generative Media

YouTube video by Jon Barron

www.youtube.com

Reposted by Jon Barron

Paul Gavrikov

@paulgavrikov.bsky.social

Is basic image understanding solved in today’s SOTA VLMs? Not quite.

We present VisualOverload, a VQA benchmark testing simple vision skills (like counting & OCR) in dense scenes. Even the best model (o3) only scores 19.8% on our hardest split.

September 8, 2025 at 3:28 PM

Reposted by Jon Barron

Hà Phan

@hpdailyrant.bsky.social

Here’s what I’ve been working on for the past year. This is SkyTour, a 3D exterior tour utilizing Gaussian Splat. The UX is in the modeling of the “flight path.” I led the prototyping team that built the first POC. I was the sole designer and researcher on the project, one of the 1st inventors.

July 16, 2025 at 3:43 AM

Reposted by Jon Barron

Matthias Niessner

@niessner.bsky.social

🚀🚀🚀Announcing our $13M funding round to build the next generation of AI: 𝐒𝐩𝐚𝐭𝐢𝐚𝐥 𝐅𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬 that can generate entire 3D environments anchored in space & time. 🚀🚀🚀

Interested? Join our world-class team:
🌍 spaitial.ai

youtu.be/FiGX82RUz8U

SpAItial AI: Building Spatial Foundation Models

YouTube video by SpAItial AI

youtu.be

May 27, 2025 at 9:26 AM

Reposted by Jon Barron

U of T Department of Computer Science

@uoftcompsci.bsky.social

📺 Now available: Watch the recording of Aaron Hertzmann's talk, "Can Computers Create Art?" www.youtube.com/watch?v=40CB...
@uoftartsci.bsky.social

April 30, 2025 at 8:23 PM

Jon Barron

@jonbarron.bsky.social

Radiance Fields and the Future of Generative Media

YouTube video by Jon Barron

www.youtube.com

April 28, 2025 at 8:52 PM

Jon Barron

@jonbarron.bsky.social

A thread of thoughts on radiance fields, from my keynote at 3DV:

Radiance fields have had 3 distinct generations. First was NeRF: just posenc and a tiny MLP. This was slow to train but worked really well, and it was unusually compressed --- The NeRF was smaller than the images.

April 8, 2025 at 5:25 PM

Jon Barron

@jonbarron.bsky.social

Here's Bolt3D: fast feed-forward 3D generation from one or many input images. Diffusion means that generated scenes contain lots of interesting structure in unobserved regions. ~6 seconds to generate, renders in real time.

Project page: szymanowiczs.github.io/bolt3d
Arxiv: arxiv.org/abs/2503.14445

March 19, 2025 at 6:37 PM

Jon Barron

@jonbarron.bsky.social

I made this handy cheat sheet for the jargon that 6DOF math maps to for cameras and vehicles. Worth learning if you, like me, are worried about embarrassing yourself in front of a cinematographer or naval admiral.

March 19, 2025 at 4:50 PM

Jon Barron

@jonbarron.bsky.social

Can someone point me to a video of renderings of some 3DGS-like algorithm *as it is being optimized*? I want to see all those little Gaussians wobbling around.

February 24, 2025 at 11:07 PM

Jon Barron

@jonbarron.bsky.social

Next week is the one year anniversary of this paper showing that videos generated from Sora are nearly 3D-consistent. I'm surprised we never saw any follow-up papers in this line evaluating other videos models this way, it would be helpful to track these metrics over time. arxiv.org/abs/2402.17403

February 23, 2025 at 6:37 PM

Jon Barron

@jonbarron.bsky.social

Veo 2 now has a public price point: $0.50 per second. Very important number to keep in mind when considering the future of generative and non-generative media. Taken from cloud.google.com/vertex-ai/ge...

February 22, 2025 at 6:52 PM

Reposted by Jon Barron

Chris Offner

@chrisoffner3d.bsky.social

To quote @jonbarron.bsky.social.

February 18, 2025 at 7:02 AM

Jon Barron

@jonbarron.bsky.social

I just pushed a new paper to arXiv. I realized that a lot of my previous work on robust losses and nerf-y things was dancing around something simpler: a slight tweak to the classic Box-Cox power transform that makes it much more useful and stable. It's this f(x, λ) here:

February 18, 2025 at 6:43 PM

Jon Barron

@jonbarron.bsky.social

My pitch for an "LLM-native" alternative to citation count/h-index etc:
1) Train an LLM on a new paper and record the average loss during training.
2) Evaluate the retrained LLM on benchmarks.
Your Google Scholar records avg_train_loss * benchmark_delta, and you go write your next paper.

February 12, 2025 at 6:55 PM

Reposted by Jon Barron

Sander Dieleman

@sedielem.bsky.social

Great interview with @jascha.sohldickstein.com about diffusion models! This is the first in a series: similar interviews with Yang Song and yours truly will follow soon.

(One of these is not like the others -- both of them basically invented the field, and I occasionally write a blog post 🥲)

History of Diffusion - Jascha Sohl-Dickstein

YouTube video by Bain Capital Ventures

www.youtube.com

February 10, 2025 at 10:28 PM

Reposted by Jon Barron

nealwadhwa.bsky.social

@nealwadhwa.bsky.social

With the CVPR 2025 rebuttal deadline over, it’s the perfect time to submit a demo application for CVPR 2025. Demos can be any application of computer vision in the real world and are a great way to show off your work to other #computervision enthusiasts. Submit here docs.google.com/forms/d/e/1F...

CVPR 2025 Demo Submission

Submission page for the Demo Track at CVPR 2025. Read the call for demos on the CVPR Website.

docs.google.com

February 9, 2025 at 2:54 AM

Jon Barron

@jonbarron.bsky.social

fun test for image and video generation systems: add "the camera is upside down" to the prompt (especially for shots of people) and then vertically mirror the output. Even the best models struggle, with upside-down teeth and blinks, and gravity tugging everything up. Here's Veo 2.

February 7, 2025 at 4:56 PM

Jon Barron

@jonbarron.bsky.social

I fed the "spinning dancer" illusion (a silhouette of a spinning figure that can be seen as rotating clockwise or counter-clockwise, left) into Runway Gen-3 (right). It resolved the ambiguity by having the dancer face the camera and oscillate, which is kinda clever.

January 25, 2025 at 10:55 PM

Jon Barron

@jonbarron.bsky.social

`A 1960s NASA scientist with a white button down shirt and black heavy rimmed glasses, with a giant thick alien umbilical cord coming out of the back of his body. The cord is holding him up in space, and he is levitating around his office. Wide angle, full body shot.` #Veo2

January 10, 2025 at 6:52 PM

Jon Barron

@jonbarron.bsky.social

"A fun children's educational program where Mr. See-thru teaches kids about how the gastrointestinal system works using his semitransparent abdomen." #Veo2

I'm surprised by how well Veo 2 understands human anatomy, and amused by the things that it doesn't yet understand.

January 10, 2025 at 6:21 PM

Reposted by Jon Barron

Ethan Mollick

@emollick.bsky.social

I have been having fun turning the Mario Brothers into a 1940s industrial film using Veo 2.

January 3, 2025 at 8:58 PM

Jon Barron

@jonbarron.bsky.social

In case it's helpful to others writing papers or doing comparisons, I'm hosting raw mp4s for the #Veo2 results I've been posting, with no Bluesky-induced compression: drive.google.com/drive/folder....

Here's `A kraken emerging from the ocean at the beach on coney island`, which I forgot to post.