Abhishek Divekar
adivekar.bsky.social
Abhishek Divekar
@adivekar.bsky.social
ML Science Lead @Amazon; prev @UT Austin. Team Lead for India at the International AI Olympiad 2025.
Reposted by Abhishek Divekar
It brings me no pleasure to report that completing a minor task you've been avoiding (1) is not very hard and (2) makes you feel better afterwards
September 16, 2025 at 4:42 PM
Logo drop! 🇮🇳 This is what Team India will wear for its historic first appearance at the International AI Olympiad!

The theme: 8 feathers for our 8 incredible Olympians. Let's cheer them on!

#IOAI2025 #TeamIndia #AI
July 30, 2025 at 3:25 PM
Reposted by Abhishek Divekar
I wrote a very long blog post about AI writing. I hope you'll read it.

meresophistry.substack.com/p/the-mental...
The mental tyranny of AI writing
An arduously long blog post
meresophistry.substack.com
March 29, 2025 at 7:10 PM
Reposted by Abhishek Divekar
I want to share my latest (very short) blog post: "Active Learning vs. Data Filtering: Selection vs. Rejection."

What is the fundamental difference between active learning and data filtering?

Well, obviously, the difference is that:

1/11
May 17, 2025 at 11:47 AM
Reposted by Abhishek Divekar
Reposted by Abhishek Divekar
DeepSeek-R1 Thoughtology: Let’s <think> about LLM reasoning

142-page report diving into the reasoning chains of R1. It spans 9 unique axes: safety, world modeling, faithfulness, long context, etc.
April 13, 2025 at 3:04 AM
Reposted by Abhishek Divekar
Very happy to see "Can AI writing be salvaged? Mitigating Idiosyncrasies and Improving Human-AI Alignment in the Writing Process through Edits" get a Best Paper Honorable Mention and is in the Top 5% of submissions for #CHI2025! 🎉 @chi.acm.org

Check it out here: arxiv.org/pdf/2409.14509
March 29, 2025 at 3:58 PM
Reposted by Abhishek Divekar
Graham's Scan (1972) is an O(n log n) algorithm for finding the convex hull of a set of 2D points. It sorts points by polar angle, then builds the hull by pushing points onto a stack, popping them when a clockwise turn is detected. en.wikipedia.org/wiki/Graham_...
March 27, 2025 at 6:00 AM
Reposted by Abhishek Divekar
AM-DeepSeek-R1-Distilled-1.4M: Massive reasoning dataset for LLM training

- 1.4M high-quality reasoning problems with verified solutions
- 900K entries distilled from DeepSeek-R1-671B
- Covers math, code, and complex reasoning tasks
- Bilingual (Chinese/English)

huggingface.co/datasets/a-m...
a-m-team/AM-DeepSeek-R1-Distilled-1.4M · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
March 26, 2025 at 10:00 AM
Reposted by Abhishek Divekar
A reinforcement learning system to beat Pokémon Red. The system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques.

drubinstein.github.io/pokerl/
March 5, 2025 at 8:03 PM
Reposted by Abhishek Divekar
InternLM v3

- Performance surpasses models like Llama3.1-8B and Qwen2.5-7B
- Capable of deep reasoning with system prompts
- Trained only on 4T high-quality tokens

huggingface.co/collections/...
January 15, 2025 at 8:24 AM