Lightnews — Scholar-powered news

Stephen with a ph...D

@stephenmcgeephd.bsky.social

32 followers 390 following 7 posts

Data nerd focused on integrating AI in into healthcare genetics

Posts Replies Media Videos

Stephen with a ph...D

@stephenmcgeephd.bsky.social

Fascinating (albeit ominously titled) dataset for benchmarking emerging multi-modal models - Humanity’s Last Exam

Over 3000 questions gathered from a global collaborative of over 1000 contributors from >50 countries.

lastexam.ai

Humanity's Last Exam

Humanity's Last Exam Dataset

lastexam.ai

February 2, 2025 at 2:26 PM

Stephen with a ph...D

@stephenmcgeephd.bsky.social

Microsoft pushed out a smarter retrieval paradigm, Chain-of-thought RAG (CoRAG) - how to leverage o1-like models to retrieve and reason to better refine queries at each step thereby improving accuracy and helping to escape limitations of conventional RAG systems #MLSky - arxiv.org/abs/2501.143...

Chain-of-Retrieval Augmented Generation

This paper introduces an approach for training o1-like RAG models that retrieve and reason over relevant information step by step before generating the final answer. Conventional RAG methods usually p...

arxiv.org

January 29, 2025 at 12:28 PM

Stephen with a ph...D

@stephenmcgeephd.bsky.social

Just realized that with images, OpenAI inflates prompt tokens for billing purposes when using GPT-4o-mini to equal image costs for GPT-4o. Not a deal-breaker, by any means, but the cost-efficient promise of 4o-mini seems to be text-specific.

January 29, 2025 at 5:04 AM

Stephen with a ph...D

@stephenmcgeephd.bsky.social

Recently saw a post stating OCR is dead. I don’t think we’re quite there yet. Though multimodal models have dramatically improved structuring clinical docs, OCR still has much to offer that VLMs haven’t yet conquered (bounding box detection, hallucinations in poor quality images, etc)
#MLSky

January 25, 2025 at 2:19 PM