ranjithkatta.bsky.social
@ranjithkatta.bsky.social
PhD IIT Ropar , Punjab, India
Area of Interest : 3D Content Creation using Text-to-3D
Reposted
The first keynote today at #CVMP2025 is by Angela Dai @adai.bsky.social from 3DAI Lab at TU Munich

"Can Transformers speak geometry?"

There are lots of different 3D representations we use in learning but industry widely uses meshes.
December 4, 2025 at 1:34 PM
Reposted
𝗗𝗲𝗢𝗰𝗰-𝟭-𝘁𝗼-𝟯: 𝟯𝗗 𝗗𝗲-𝗢𝗰𝗰𝗹𝘂𝘀𝗶𝗼𝗻 𝗳𝗿𝗼𝗺 𝗮 𝗦𝗶𝗻𝗴𝗹𝗲 𝗜𝗺𝗮𝗴𝗲 𝘃𝗶𝗮 𝗦𝗲𝗹𝗳-𝗦𝘂𝗽𝗲𝗿𝘃𝗶𝘀𝗲𝗱 𝗠𝘂𝗹𝘁𝗶-𝗩𝗶𝗲𝘄 𝗗𝗶𝗳𝗳𝘂𝘀𝗶𝗼𝗻
Yansong Qu, Shaohui Dai, Xinyang Li ... Rongrong Ji
arxiv.org/abs/2506.21544
Trending on www.scholar-inbox.com
July 1, 2025 at 6:00 AM
Reposted
🚀🚀🚀Announcing our $13M funding round to build the next generation of AI: 𝐒𝐩𝐚𝐭𝐢𝐚𝐥 𝐅𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬 that can generate entire 3D environments anchored in space & time. 🚀🚀🚀

Interested? Join our world-class team:
🌍 spaitial.ai

youtu.be/FiGX82RUz8U
SpAItial AI: Building Spatial Foundation Models
YouTube video by SpAItial AI
youtu.be
May 27, 2025 at 9:26 AM
Reposted
Quoting one slides from @yann-lecun.bsky.social talk…

arxiv is filled by papers that treat symptoms (or not even!) without ever diagnosing the disease
May 25, 2025 at 12:29 PM
Reposted
My Vision Transformer lecture snippet
Vision Transformer by CSProfKGD
YouTube video by CSProfKGD
youtu.be
May 19, 2025 at 9:37 PM
Reposted
DAGM GCPR'25: 15 days.
AAAI'26 (abs): 67 days.
AAAI'26 (paper): 74 days.
3DV'26: 90 days.
May 20, 2025 at 9:00 AM
Reposted
This is a big/under-discussed deal given that most students are feeding assigned readings through LLMs instead of reading them in full.
"Even when explicitly prompted for accuracy, most LLMs produced broader generalizations of scientific results than those in the original texts."
May 20, 2025 at 12:41 AM
Reposted
Amid all the super-duper research advances, I am excited to share my video on a super-duper basic topic:

How to sample your signals?

Here we discuss sampling, why aliasing occurs, and how to prevent it. It's fascinating to view this through the lens of frequency analysis. youtu.be/fTJjPGaPsq4
February 28, 2025 at 3:26 PM
Reposted
I made a quick and dirty comparison between "optimal" and midpoint in a repo, feel free to try it out, and if you are a reprojcel, please let me know why the benchmark is biased, stupid, and wrong)!
github.com/Parskatt/tri...
February 20, 2025 at 2:17 AM
Reposted
When vision people do graphics, they call it "image synthesis." When graphics people do vision, they call it "inverse rendering". ;)
November 11, 2024 at 9:17 AM
Reposted
3D content creation with touch!

We exploit tactile sensing to enhance geometric details for text- and image-to-3D generation.

Check out our #NeurIPS2024 work on Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation: ruihangao.github.io/TactileDream...
1/3
December 11, 2024 at 9:08 AM
Reposted
Another gem from Bill Freeman, Katie Bouman & team 🌌

A differentiable rendering framework for direct #exoplanet imaging, leveraging wavefront sensing to refine starlight subtraction. Tested on JWST, it approaches noise limits and reveals faint planets like never before! 🚀

#ai #astronomy
January 6, 2025 at 7:13 AM
Reposted
I have just realised, that if LLM reads only the paper content, w/o meta-data, ALL the papers present results as novel and sota, so it is not easy to understand, which one is really new...
January 6, 2025 at 8:29 AM
Reposted
i recently gave a talk about questions like this at a workshop organized by the amazing "learning theory alliance" (let-all.com), and several students reached out to tell me how much my talk resonated with them.
link to the slides here:
docs.google.com/presentation...
let-all-gergo
Research styles: “fast” or “slow”? Some thoughts by Gergely Neu
docs.google.com
December 17, 2024 at 12:19 PM
Reposted
I still don’t understand why it can be that distillation works, given the same data.

Is it a way to smuggle more computation into smaller model without looking at the data much more times?
December 21, 2024 at 3:23 PM
Reposted
How to design your presentation?

Presentation is an essential skill for academics.

After attending so many meetings, prelims, thesis defenses, research talks, and lectures, I’ve realized that I may have been approaching it all wrong ...

Some (bitter) lessons I learned. 👇
December 7, 2024 at 1:11 AM
Reposted
I recently gave a tutorial on the DUSt3R paper (web: dust3r.europe.naverlabs.com, paper: tinyurl.com/5t2ks575, code: github.com/naver/dust3r) in a research group meeting. In case you missed it, didn’t understand it or would like to hear some perspectives on why it’s such a cool idea, read on… 1/23
DUSt3R: Geometric 3D Vision Made Easy
dust3r.europe.naverlabs.com
November 18, 2024 at 11:18 PM