Lightnews — Scholar-powered news

IAMJB

@iamjbd.bsky.social

🧵 What if AI could learn from millions of unlabeled radiology images and reports—and then flexibly adapt to new clinical tasks? In a new comprehensive review in
@radiology_rsna, colleagues at stanford dive into how foundation models (FMs) are set to revolutionize radiology!

March 10, 2025 at 10:44 PM

IAMJB

@iamjbd.bsky.social

"Second, we develop budget forcing to control test-time compute by forcefully terminating the model's thinking process or lengthening it by appending "Wait" multiple times to the model's generation when it tries to end."

What a trick...

February 3, 2025 at 5:54 PM

IAMJB

@iamjbd.bsky.social

Is this the last benchmark before AGI? Humanity's Last Exam (HLE)

🤯 3,000 expert-level questions across 100+ subjects, created by nearly 1,000 subject matter experts globally.

January 25, 2025 at 7:00 PM

IAMJB

@iamjbd.bsky.social

DeepSeek-R1: next level

January 25, 2025 at 5:14 AM

IAMJB

@iamjbd.bsky.social

𝗔 𝗦𝗶𝗺𝗽𝗹𝗲 𝗚𝘂𝗶𝗱𝗲 𝘁𝗼 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 𝗠𝗲𝗺𝗼𝗿𝘆 🌟

An agent's memory helps it plan and react by leveraging past interactions or external data via prompt context. Here’s a breakdown:

𝟭. Episodic Memory: Logs past actions/interactions (e.g., stored in a vector database for semantic search).

January 24, 2025 at 5:50 PM

IAMJB

@iamjbd.bsky.social

🧩 The future of creativity is elemental. ✨

Kling AI just announced Elements

🌎 First, world building:
Craft your characters, environments, props. Plan your motion and VFX.
🎛️ Then, remixing:
Bring it all together into a cohesive story.

January 19, 2025 at 6:09 PM

IAMJB

@iamjbd.bsky.social

January 17, 2025 at 7:00 PM

IAMJB

@iamjbd.bsky.social

Amazing. Agent Roles:
⛳ PhD Agent: Conducts literature reviews, interprets results, writes reports.
⛳ Postdoc Agent: Plans research, designs experiments.
⛳ ML Engineer Agent: Prepares data, writes, optimizes code.
⛳ Professor Agent: Oversees, refines reports.

January 16, 2025 at 6:00 PM

IAMJB

@iamjbd.bsky.social

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era
>> Hybrid linear-softmax attention working very well at large scale and long-context
filecdn.minimax.chat/_Arxiv_MiniM...

January 15, 2025 at 10:33 PM

IAMJB

@iamjbd.bsky.social

first look into what the Qwen team used to develop QwQ
arxiv.org/pdf/2501.07301

January 15, 2025 at 4:38 AM

IAMJB

@iamjbd.bsky.social

Neat: Representing Long Volumetric Video with Temporal Gaussian Hierarchy

Contrib: Temporal Gaussian Hierarchy representation for long volumetric video.

January 14, 2025 at 2:32 AM

IAMJB

@iamjbd.bsky.social

Nice visualization of RAG vs. Agentic RAG

January 13, 2025 at 5:37 PM

IAMJB

@iamjbd.bsky.social

Neat. Converts images, PDFs, and Office documents to Markdown or JSON using OCR and LLM models, with features for caching, distributed processing, and PII removal

January 12, 2025 at 4:33 AM

IAMJB

@iamjbd.bsky.social

volume rendering made easy and free 😍

January 10, 2025 at 7:23 PM

IAMJB

@iamjbd.bsky.social

How do you even coordinate this?

January 8, 2025 at 9:04 PM

IAMJB

@iamjbd.bsky.social

🚀 PRIME + Eurus-2 beat Qwen2.5-Math-Instruct with 1/10 the data!
✨ Implicit PRM (no labels)
🔄 Online updates, zero overhead
🎯 Token-level rewards + RLOO

Scaling up with 3x more data!

January 5, 2025 at 9:06 PM

IAMJB

@iamjbd.bsky.social

2 OLMo 2 Furious captures every lesson learned since OLMo 1, featuring in-depth explorations of:
• Stable pretraining
• LR annealing, data curricula, and soups
• Tulu post-training
• Compute infrastructure

January 4, 2025 at 3:14 AM

IAMJB

@iamjbd.bsky.social

Top 25 Open Source AI models on Hugging Face in 2025

December 31, 2024 at 12:47 AM

IAMJB

@iamjbd.bsky.social

💥 Gemini 2.0 is on paper-central. Talk with any paper from the 🤗 Hugging Face paper page. Example with GenEx 👇

December 16, 2024 at 6:40 PM

IAMJB

@iamjbd.bsky.social

EasyRef is on 🤗 Hugging Face

After DiffSensei yesterday, @ylecun is once again being style-transferred!

(⚡Llama 3.3) Chat with the paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/zongzhuofan...
🤗 Demo: huggingface.co/spaces/zong...
🤗 Paper: huggingface.co/papers/2412...

December 13, 2024 at 6:04 PM

IAMJB

@iamjbd.bsky.social

SuperCharged Euclid is on 🤗 Hugging Face

Also, this is the best paper heading I’ve seen in quite some time. The 'en tête' looks fantastic.

(⚡Llama 3.3) Chat with the paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/euclid-mult...
🤗 Dataset: huggingface.co/datasets/eu...

December 13, 2024 at 5:51 PM

IAMJB

@iamjbd.bsky.social

Phi-4 technical report is out.

What are the main contributions, how does it compare to SOTA? Find out 👇

🤗 Chat with the paper (⚡Llama 3.3) huggingface.co/spaces/hugg...

December 13, 2024 at 5:43 PM

IAMJB

@iamjbd.bsky.social

InternLM-XComposer-2.5-OmniLive is on 🤗 Hugging Face

(⚡Llama 3.3) Chat with paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/internlm/in...
🤗 Paper: huggingface.co/papers/2412...
🐍 GitHub: github.com/InternLM/In...

December 13, 2024 at 5:35 PM

IAMJB

@iamjbd.bsky.social

Started training my new LLM today and everything is going well.

December 12, 2024 at 7:17 PM

IAMJB

@iamjbd.bsky.social

⚔️ CodeArena is on 🤗 Hugging Face

🤗 Dataset: huggingface.co/datasets/CS...

💡 An excellent way to get an overview in a glimpse: ask paper-central to summarize the scores in a markdown table (⚡by Llama 3.3!)

Try it out for any paper: huggingface.co/spaces/hugg...

December 11, 2024 at 5:47 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news