Rohit Saxena
rohit-saxena.bsky.social
Rohit Saxena
@rohit-saxena.bsky.social
PhD student at University of Edinburgh
Long Context | Summarization | Vision and Language | Narratives

https://saxenarohit.github.io/
Reposted by Rohit Saxena
Congrats! Looks like time is a big failure case for these models (cc @neuralnoise.com @aryopg.bsky.social @rohit-saxena.bsky.social )
bsky.app/profile/emil...
May 17, 2025 at 7:07 AM
Work done with @neuralnoise.com Frank Keller
March 10, 2025 at 2:19 PM
We tested state-of-the-art multimodal LLMs on this challenging task—and they struggled! 🤖📉

We also propose a new method:
🔥SEGMENT & SUMMARIZE, a training-free approach that outperforms existing models by:
🔹 Segmenting the poster into logical regions
🔹 Performing local & global summarization
March 10, 2025 at 2:19 PM
📊 PosterSum features 16,305 poster-abstract pairs from major ML conferences.

Task: Summarize a research poster image into a concise abstract summary.
March 10, 2025 at 2:19 PM
🙋‍♂️
November 20, 2024 at 5:20 PM
I'd love to be added!
Thanks
November 20, 2024 at 12:15 PM
Would love to be added!
November 20, 2024 at 12:08 PM
Hello, can you please add me? Thanks
November 20, 2024 at 11:59 AM
I'd love to be added!
Thanks
November 20, 2024 at 11:48 AM