Lightnews — Scholar-powered news

Abhishek Divekar

@adivekar.bsky.social

59 followers 150 following 24 posts

ML Science Lead @Amazon; prev @UT Austin. Team Lead for India at the International AI Olympiad 2025.

Posts Replies Media Videos

Reposted by Abhishek Divekar

Nick Fleisher

@nickfleisher.bsky.social

It brings me no pleasure to report that completing a minor task you've been avoiding (1) is not very hard and (2) makes you feel better afterwards

September 16, 2025 at 4:42 PM

Abhishek Divekar

@adivekar.bsky.social

Logo drop! 🇮🇳 This is what Team India will wear for its historic first appearance at the International AI Olympiad!

The theme: 8 feathers for our 8 incredible Olympians. Let's cheer them on!

#IOAI2025 #TeamIndia #AI

A blue peacock with eight feathers, three orange and five green. The peacock is stylized with circuit-like cutout patterns. Text below reads "Team India IOAI 2025"

July 30, 2025 at 3:25 PM

Reposted by Abhishek Divekar

John Gallagher

@johnrgallagher.bsky.social

I wrote a very long blog post about AI writing. I hope you'll read it.

meresophistry.substack.com/p/the-mental...

The mental tyranny of AI writing

An arduously long blog post

meresophistry.substack.com

March 29, 2025 at 7:10 PM

Reposted by Abhishek Divekar

Andreas Kirsch

@blackhc.bsky.social

I want to share my latest (very short) blog post: "Active Learning vs. Data Filtering: Selection vs. Rejection."

What is the fundamental difference between active learning and data filtering?

Well, obviously, the difference is that:

1/11

May 17, 2025 at 11:47 AM

Reposted by Abhishek Divekar

Nathan Lambert

@natolambert.bsky.social

rlhfbook also available on arxiv for SEO 😀 happy friday
arxiv.org/abs/2504.12501

Reinforcement Learning from Human Feedback

Reinforcement learning from human feedback (RLHF) has become an important technical and storytelling tool to deploy the latest machine learning systems. In this book, we hope to give a gentle…

arxiv.org

April 18, 2025 at 4:07 PM

Reposted by Abhishek Divekar

Sung Kim

@sungkim.bsky.social

DeepSeek-R1 Thoughtology: Let’s <think> about LLM reasoning

142-page report diving into the reasoning chains of R1. It spans 9 unique axes: safety, world modeling, faithfulness, long context, etc.

April 13, 2025 at 3:04 AM

Reposted by Abhishek Divekar

Tuhin Chakrabarty

@tuhinchakr.bsky.social

Very happy to see "Can AI writing be salvaged? Mitigating Idiosyncrasies and Improving Human-AI Alignment in the Writing Process through Edits" get a Best Paper Honorable Mention and is in the Top 5% of submissions for #CHI2025! 🎉 @chi.acm.org

Check it out here: arxiv.org/pdf/2409.14509

March 29, 2025 at 3:58 PM

Reposted by Abhishek Divekar

Samuel Vaiter

@samuelvaiter.com

Graham's Scan (1972) is an O(n log n) algorithm for finding the convex hull of a set of 2D points. It sorts points by polar angle, then builds the hull by pushing points onto a stack, popping them when a clockwise turn is detected. en.wikipedia.org/wiki/Graham_...

March 27, 2025 at 6:00 AM

Reposted by Abhishek Divekar

Daniel van Strien

@danielvanstrien.bsky.social

AM-DeepSeek-R1-Distilled-1.4M: Massive reasoning dataset for LLM training

- 1.4M high-quality reasoning problems with verified solutions
- 900K entries distilled from DeepSeek-R1-671B
- Covers math, code, and complex reasoning tasks
- Bilingual (Chinese/English)

huggingface.co/datasets/a-m...

a-m-team/AM-DeepSeek-R1-Distilled-1.4M · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

March 26, 2025 at 10:00 AM

Reposted by Abhishek Divekar

Sung Kim

@sungkim.bsky.social

A reinforcement learning system to beat Pokémon Red. The system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques.

drubinstein.github.io/pokerl/

March 5, 2025 at 8:03 PM

Reposted by Abhishek Divekar

Sung Kim

@sungkim.bsky.social

InternLM v3

- Performance surpasses models like Llama3.1-8B and Qwen2.5-7B
- Capable of deep reasoning with system prompts
- Trained only on 4T high-quality tokens

huggingface.co/collections/...

January 15, 2025 at 8:24 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news