Vivek Kumar
v1vekkumar.bsky.social
Vivek Kumar
@v1vekkumar.bsky.social
Senior Manager, Foundational Research , @GoogleDeepMind

Googler, Ex @Dolby & @Broadcom

Talks and Investments 👉🏽 http://portfolio.v1vek.com
Ever wanted to easily edit individual sounds in a complex audio scene—like removing a cough or making a doorbell louder?

New paper - Recomposer(arxiv.org/abs/2509.05256) from our Sound Understanding team introduces a powerful way to do just that.

Huge congrats to all the authors 🎉

#AudioEditing
September 11, 2025 at 7:48 PM
Reposted by Vivek Kumar
Today we are releasing the dataset of table tennis ball trajectories used to train the Google DeepMind robot that can play amateur table tennis with humans (sites.google.com/corp/view/co...). This work was accepted for presentation at #ICRA2025 and we hope to see you there!
April 30, 2025 at 4:15 PM
Reposted by Vivek Kumar
In December, I posted about our new paper on mastering board games using internal + external planning. 👇

Here's a talk now on Youtube about it given by my awesome colleague John Schultz!

www.youtube.com/watch?v=JyxE...
January 17, 2025 at 5:26 PM
Reposted by Vivek Kumar
Theory: We were so busy shipping / publishing last year we forgot to publish our year in review.😜

Jokes aside, huge progress in LLM, Reasoning, Generative Media. In science, so many breakthroughs, flood prediction, GraphCast, semiconductor design, quantum computing. Wow!
blog.google/technology/a...
2024: A year of extraordinary progress and advancement in AI
As we move into 2025, we’re looking back at the astonishing progress in AI in 2024.
blog.google
January 24, 2025 at 10:45 AM
Reposted by Vivek Kumar
Did you know you can use '@' or 'Gemini' to access Gemini in the desktop Chrome address bar? Super convenient!

You can also use:
Tabs to search your tabs
History to search your history
Bookmarks to find bookmarks

9to5google.com/2024/05/02/g...
Chrome's Gemini address bar shortcut rolls out
Following Tuesday's announcement, the Gemini shortcut in desktop Chrome's address bar has rolled out. In the Omnibox...
9to5google.com
January 7, 2025 at 7:09 PM
Fantastic opportunity for any student researcher passionate about generative audio - www.linkedin.com/posts/john-h...
December 11, 2024 at 5:45 PM
Congrats to the PaliGemma 2 team! 🎉 Bigger models, better results 🚀
🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.

1/7
December 5, 2024 at 8:11 PM
Reposted by Vivek Kumar
Really cool new work out of Deep Mind for video game world generation using latent diffusion! Soon you'll be able to speed run a game just by tricking a model to morph you from one location to another.

deepmind.google/discover/blo...
Genie 2: A large-scale foundation world model
Generating unlimited diverse training environments for future general agents
deepmind.google
December 4, 2024 at 4:31 PM
Great 🦋-storm highlighting the importance and implications of AI for science. It's already helping win Nobel Prizes! 🤯

Juan's amazing essay to spark debate about the most important AI for Science opportunities, ingredients, risks and policy ideas. www.aipolicyperspectives.com/p/a-new-gold...
November 26, 2024 at 2:16 PM
Reposted by Vivek Kumar
🤖 ML/AI Mega Starter Pack

1. Open-source LLMS
go.bsky.app/FELkyDr

🧵
November 22, 2024 at 9:28 AM
Reposted by Vivek Kumar
The #SANE2024 talks are up on YouTube! Feat. Quan Wang,
@gretatuckute.bsky.social, Mark Hamilton, Bhuvana Ramabhadran, Zhiyao Duan, Chris Donahue.
Binge watching playlist⬇️
youtube.com/playlist?lis...
SANE 2024 @ Google Cambridge - YouTube
SANE 2024, a one-day event gathering researchers and students in speech and audio from the Northeast of the American continent, was held on Thursday October ...
youtube.com
November 25, 2024 at 4:05 PM
Reposted by Vivek Kumar
TIL that there's a Gemini @gradio-hf.bsky.social library that lets you automatically build Python chat bots and web apps with just a few lines of code, then (optionally) deploy them as apps or in @huggingface.bsky.social Spaces.

✨🙌 Amazing work, @_akhaliq!!

🔗 github.com/AK391/gemini...
November 25, 2024 at 5:17 AM