Rohit Saxena
rohit-saxena.bsky.social
Rohit Saxena
@rohit-saxena.bsky.social
PhD student at University of Edinburgh
Long Context | Summarization | Vision and Language | Narratives

https://saxenarohit.github.io/
Reposted by Rohit Saxena
Congrats! Looks like time is a big failure case for these models (cc @neuralnoise.com @aryopg.bsky.social @rohit-saxena.bsky.social )
bsky.app/profile/emil...
May 17, 2025 at 7:07 AM
Can multimodal LLMs truly understand research poster images?📊

🚀 We introduce PosterSum—a new multimodal benchmark for scientific poster summarization!

📂 Dataset: huggingface.co/datasets/rohitsaxena/PosterSum
📜 Paper: arxiv.org/abs/2502.17540
March 10, 2025 at 2:19 PM