Sagnik Anupam
sagnikanupam.bsky.social
Sagnik Anupam
@sagnikanupam.bsky.social
CIS PhD at Penn | MIT CS + Math '24
sagnikanupam.com

PhD student working on AI reasoning in large multimodal models. I design methods to build better models for math, code, visual reasoning, agents, and robotics.
Example user-submitted task: “Find me the last available train from Cardiff Central to Barry Docks station today on trainline”

Deepseek-R1 GIF:
October 14, 2025 at 6:14 AM
Introducing an evaluation platform for web agents–BrowserArena! Combining the awesome @lmarena.bsky.social platform with BrowserUse, we rank LLMs side-by-side to compare their ability to solve web navigation tasks!

Users vote for models using GIFs and text outputs to judge task performance.
October 14, 2025 at 6:14 AM