Lightnews — Scholar-powered news

Kaggle

@kaggle.com

The Game Arena event has concluded but the analysis is just beginning. 🤖

We're looking for the best community-created benchmarks that propose new games or dynamic tests for LLMs to feature for this week’s #TaskTuesday!

February 6, 2026 at 4:28 PM

Kaggle

@kaggle.com

🎉 The first Kaggle Game Arena event of 2026 has officially concluded.

We watched AI models navigate strategic reasoning, social deduction, and risk management through three different games.
🏆The Champions:

🃏Poker: GPT 5.2
🐺 Werewolf: Gemini 3 Pro Preview
♟️Chess: Gemini 3 Pro Preview

February 5, 2026 at 6:54 PM

Kaggle

@kaggle.com

What a show! 🏆

A huge thank you to everyone who tuned in and to our amazing partners @gmhikaru.bsky.social Nick Schulman, Liv Boeree, @dougpolkvids for the fantastic commentary and analysis across all three games, Poker, Chess and Werewolf.

Graphic for the "AI Poker Showdown Final Result," hosted by Kaggle. The image displays the completed tournament bracket showing o3 as the overall winner after defeating GPT 5.2 in the Day 3 Championship.

The final bracket highlights:
Day 1 Winners: o3, Gemini 3 Flash, GPT 5.2, and Opus 4.5.
Day 2 Winners: o3 (defeating Gemini 3 Flash) and GPT 5.2 (defeating Opus 4.5).
Final Result: o3 is crowned the champion

February 4, 2026 at 7:31 PM

Kaggle

@kaggle.com

The pressure is on across Poker, Werewolf and Chess as we head into the final hour of the Game Arena. Who will emerge as the ultimate champion?

Join @gmhikaru.bsky.social and Nick Schulman now! 👇
www.youtube.com/watch?v=vzMj...

Kaggle Poker / Chess / Werewolf Game Arena Day 3 w Nick Schulman and Hikaru #ad

YouTube video by GMHikaru

www.youtube.com

February 4, 2026 at 6:32 PM

Kaggle

@kaggle.com

⏳ 30 minutes until @gmhikaru.bsky.social and Nick Schulman are live for the championship rounds. 🏆

Who takes the title in Poker, Chess and Werewolf? Grab your seat for the ultimate AI showdown: 👇
www.youtube.com/watch?v=vzMj...

Kaggle Poker / Chess / Werewolf Game Arena Day 3 w Nick Schulman and Hikaru #ad

YouTube video by GMHikaru

www.youtube.com

February 4, 2026 at 5:00 PM

Kaggle

@kaggle.com

📢The Grand Finale is here! 🏆

What happens when a chess Grandmaster and a Poker legend analyze AI? ♟️🃏

Graphic for "Game Arena AI Poker Showdown Day 3," a live event hosted by Kaggle and Google DeepMind on Wednesday, Feb 4th, from 9:30–11:30 AM PT. The image features a playful illustration of a robot at a poker table with a lightbulb over its head, showcasing the championship matchup: GPT 5.2 vs o3

February 4, 2026 at 4:30 PM

Kaggle

@kaggle.com

That’s a wrap on the semi-finals of the Game Arena! We have our Poker and Chess finalists locked in, and in Werewolf, the detective levels are off the charts.

Graphic for the "AI Poker Showdown," a tournament bracket hosted by Kaggle. The image displays the results of a multi-day competition leading to a Day 3 Championship match between o3 and GPT 5.2.

The bracket illustrates the following matchups: Day 1: o3 vs. Deepseek 3.2, Grok 4 vs. Gemini 3 Flash, GPT 5.2 vs. Gemini 3 Pro, and Opus 4.5 vs. Sonnet 4.5.
Day 2: o3 vs. Gemini 3 Flash and GPT 5.2 vs. Opus 4.5.
Day 3: The final showdown between o3 and GPT 5.2.

February 3, 2026 at 9:50 PM

Kaggle

@kaggle.com

⏰ Going live in 30 minutes!

Join @gmhikaru.bsky.social and Nick Schulman as they break down every bluff, blunder and brilliant move from our final four models. You don’t want to miss this co-hosted deep dive.

www.youtube.com/watch?v=4TJw...

Kaggle Poker / Chess / Werewolf Game Arena Day 2 w Nick Schulman and Hikaru #ad

YouTube video by GMHikaru

www.youtube.com

February 3, 2026 at 5:02 PM

Kaggle

@kaggle.com

It's the semi-finals today! Four models remain, and the stakes are doubling. 🃏♟️We’re live for the Poker Semi-Finals, Chess deep dives, and the penultimate Werewolf rounds!

Graphic for "Game Arena AI Poker Showdown," a live event hosted by Kaggle and Google DeepMind on Tuesday, Feb 3rd, from 9:30–11:30 AM PT. The image features a playful illustration of a robot at a poker table with a lightbulb over its head, surrounded by matchups for the tournament: GPT 5.2 vs Opus 4.5 and Gemini 3 Flash vs o3

February 3, 2026 at 4:29 PM

Kaggle

@kaggle.com

Day 1 of Game Arena is officially in the books!

Congratulations to our AI poker showdown semi-finalists o3, Gemini 3 Flash, GPT 5.2, and Opus 4.5!

February 2, 2026 at 9:59 PM

Kaggle

@kaggle.com

🎬 We’re live! Watch GMHikaru and Nick Schulman break down the first round of the poker bracket and chess newcomer matches.

Game Arena AI Poker Showdown," a live event hosted by Kaggle and Google DeepMind on Monday, Feb 2nd, from 9:30–11:30 AM PT. The image features a playful illustration of a robot at a poker table with a lightbulb over its head, surrounded by matchups for the tournament: o3 vs. DeepSeek 3.2, Grok 4 vs. Gemini 3 Flash, GPT 5.2 vs. Gemini 3 Pro, and Opus 4.5 vs. Sonnet 4.5.

February 2, 2026 at 5:18 PM

Kaggle

@kaggle.com

Game Arena kicks off today! 📣

Top AI models compete in Poker, Werewolf, and Chess, testing reasoning, social strategy, and risk management.

🎙️ Co-hosted by GM Hikaru & Poker Hall-of-Famer Nick Schulman: www.youtube.com/GMHikaru

🗓️ Feb 2–4 | 9:30–11:30 AM PT

More info 👇

February 2, 2026 at 4:27 PM

Kaggle

@kaggle.com

📌 Mark Your Calendar: Live Game Arena Event This Monday!

We are releasing two new games, Poker and Werewolf, along with an updated Chess leaderboard next Monday, February 2, running daily from 9:30 AM PT to 11:30 AM PT through February 4

January 29, 2026 at 5:12 PM

Kaggle

@kaggle.com

In case you missed it👇

We just launched Community Benchmarks! Build, run and share AI benchmarks on top models - fully transparent and reproducible.

Learn more 👇
blog.google/innovation-a...

Introducing Community Benchmarks on Kaggle

Community Benchmarks on Kaggle lets the community build, share and run custom evaluations for AI models.

blog.google

January 15, 2026 at 1:55 PM

Reposted by Kaggle

Paige Bailey

@dynamicwebpaige.bsky.social

🙌 Build, run, and share custom AI benchmarks on @kaggle.com that are evaluated on the leading AI models with reproducible results!

www.kaggle.com/discussions/...

January 14, 2026 at 4:21 PM

Kaggle

@kaggle.com

🚀 Introducing Community Benchmarks on Kaggle!

As AI evolves at an unprecedented pace, measuring intelligence requires more than a few AI research labs alone – it requires the imagination and collective expertise of the global community. That’s why we’re launching Community Benchmarks.

January 14, 2026 at 2:17 PM

Kaggle

@kaggle.com

📣 Hackathon Launch Alert! MedGemma Impact Challenge hosted by Google Research

🎯Build human-centered AI applications by using MedGemma and other open models
💰 $100,000 Prize Pool
⏰ Final Submission: Feb 24, 2026
www.kaggle.com/competitions...

The MedGemma Impact Challenge

Build human-centered AI applications with MedGemma and other open models from Google’s Health AI Developer Foundations (HAI-DEF).

www.kaggle.com

January 14, 2026 at 12:43 PM

Kaggle

@kaggle.com

📣 Competition Launch Alert - Stanford RNA 3D Folding Part 2 is now live!

🎯 To predict the 3D structure of RNA molecules using their sequences
💰 $75,000 Prize Pool
⏰ Entry Deadline: March 18, 2026

kaggle.com/competitions/stanford-rna-3d-folding-2

Stanford RNA 3D Folding Part 2

Solve RNA structure prediction, one of biology's remaining grand challenges

kaggle.com

January 8, 2026 at 2:07 PM

Kaggle

@kaggle.com

🏆 Announcing the winners of the Agents Intensive Capstone Project! 🎉

We're excited to announce the top 12 teams who showcased exceptional creativity & technical skill using AI agents! Check out their innovative projects & learn more about their submissions here:

www.kaggle.com/competitions...

December 18, 2025 at 3:16 PM

Reposted by Kaggle

Jeff Dean

@jeffdean.bsky.social

We’ve pushed out the Pareto frontier of efficiency vs. intelligence again.

With Gemini 3 Flash ⚡️, we are seeing reasoning capabilities previously reserved for our largest models. This opens up entirely new categories of near real-time applications that require complex thought.

More in thread ⬇️

December 17, 2025 at 5:38 PM

Kaggle

@kaggle.com

📣 Competition Launch Alert! Deep Past Challenge: Translate Akkadian to English hosted by Deep Past AI

🎯 Build an AI model that translates 4,000-year-old Old Assyrian business records into English
💰 $50,000 Prize Pool
⏰ Entry Deadline: March 23, 2026

www.kaggle.com/competitions...

Deep Past Challenge - Translate Akkadian to English

Bringing Bronze Age Voices Back to Life – Machine Translation of Old Assyrian Cuneiform

www.kaggle.com

December 17, 2025 at 1:57 PM

Kaggle

@kaggle.com

🚀 New on Kaggle Benchmarks: DeepSearchQA developed by Google DeepMind!

This benchmark focuses on complex web research tasks and tests agent comprehensiveness.

Check the leaderboard: www.kaggle.com/benchmarks/g...

A screenshot of the Kaggle DeepSearchQA leaderboard, showing the top five ranked models.

December 11, 2025 at 6:30 PM

Kaggle

@kaggle.com

📢 The FACTS Benchmark Suite is now live on Kaggle!

Developed by Google DeepMind and Google Research, this suite measures LLM factuality across four dimensions: Parametric knowledge, Search, Multimodal understanding & Grounding.

Explore the leaderboard: www.kaggle.com/benchmarks/g...

A screenshot of the Kaggle FACTS Benchmark Suite leaderboard. The table displays several large language models like GPT-4, Gemini, and others, ranked by their overall FACTS Score and performance breakdown in the four categories: Parametric Knowledge, Search, Multimodal Understanding, and Grounding. The overall score and dimension scores are visible.

December 11, 2025 at 11:53 AM

Kaggle

@kaggle.com

🚀 Benchmark your AI across India’s languages with IndicGenBench!

Developed by Google DeepMind, this benchmark spans 29 Indic languages, including first-ever evaluation data for 18 Indic languages. It supports language tasks like summarization, translation and question answering.

A screenshot of the IndicGenBench leaderboard on Kaggle Benchmarks. The leaderboard ranks various AI models based on their performance across 29 Indic languages on generative tasks. The top models and their scores are visible, showing a comparison of AI performance on tasks like cross-lingual summarization, machine translation and question answering for Indian languages.

December 9, 2025 at 12:57 PM

Kaggle

@kaggle.com

Hackathon Launch Alert! Vibe Code with Gemini 3 Pro in AI Studio hosted by Google Deepmind

🎯Build real-world AI apps using Gemini 3 Pro in Google AI Studio
💰 Prize Pool: $500,000 in Credits
⏰ Hackathon Timeline: Dec 5 - 12, 2025 (now extended!)

www.kaggle.com/competitions...

Google DeepMind - Vibe Code with Gemini 3 Pro in AI Studio

Build with Gemini 3 and compete for $500,000 in credits

www.kaggle.com

December 5, 2025 at 5:01 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news