IAMJB
iamjbd.bsky.social
IAMJB
@iamjbd.bsky.social
🤗 ML at Hugging Face
🌲 Academic Staff at Stanford University (AIMI Center)
🦴 Radiology AI is my stuff
🧵 What if AI could learn from millions of unlabeled radiology images and reports—and then flexibly adapt to new clinical tasks? In a new comprehensive review in
@radiology_rsna, colleagues at stanford dive into how foundation models (FMs) are set to revolutionize radiology!
March 10, 2025 at 10:44 PM
"Second, we develop budget forcing to control test-time compute by forcefully terminating the model's thinking process or lengthening it by appending "Wait" multiple times to the model's generation when it tries to end."

What a trick...
February 3, 2025 at 5:54 PM
Is this the last benchmark before AGI? Humanity's Last Exam (HLE)

🤯 3,000 expert-level questions across 100+ subjects, created by nearly 1,000 subject matter experts globally.
January 25, 2025 at 7:00 PM
DeepSeek-R1: next level
January 25, 2025 at 5:14 AM
𝗔 𝗦𝗶𝗺𝗽𝗹𝗲 𝗚𝘂𝗶𝗱𝗲 𝘁𝗼 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 𝗠𝗲𝗺𝗼𝗿𝘆 🌟

An agent's memory helps it plan and react by leveraging past interactions or external data via prompt context. Here’s a breakdown:

𝟭. Episodic Memory: Logs past actions/interactions (e.g., stored in a vector database for semantic search).
January 24, 2025 at 5:50 PM
🧩 The future of creativity is elemental. ✨

Kling AI just announced Elements

🌎 First, world building:
Craft your characters, environments, props. Plan your motion and VFX.
🎛️ Then, remixing:
Bring it all together into a cohesive story.
January 19, 2025 at 6:09 PM
January 17, 2025 at 7:00 PM

Amazing. Agent Roles:
⛳ PhD Agent: Conducts literature reviews, interprets results, writes reports.
⛳ Postdoc Agent: Plans research, designs experiments.
⛳ ML Engineer Agent: Prepares data, writes, optimizes code.
⛳ Professor Agent: Oversees, refines reports.
January 16, 2025 at 6:00 PM
MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era
>> Hybrid linear-softmax attention working very well at large scale and long-context
filecdn.minimax.chat/_Arxiv_MiniM...
January 15, 2025 at 10:33 PM
first look into what the Qwen team used to develop QwQ
arxiv.org/pdf/2501.07301
January 15, 2025 at 4:38 AM
Neat: Representing Long Volumetric Video with Temporal Gaussian Hierarchy

Contrib: Temporal Gaussian Hierarchy representation for long volumetric video.
January 14, 2025 at 2:32 AM
Nice visualization of RAG vs. Agentic RAG
January 13, 2025 at 5:37 PM
Neat. Converts images, PDFs, and Office documents to Markdown or JSON using OCR and LLM models, with features for caching, distributed processing, and PII removal
January 12, 2025 at 4:33 AM
volume rendering made easy and free 😍
January 10, 2025 at 7:23 PM
How do you even coordinate this?
January 8, 2025 at 9:04 PM
🚀 PRIME + Eurus-2 beat Qwen2.5-Math-Instruct with 1/10 the data!
✨ Implicit PRM (no labels)
🔄 Online updates, zero overhead
🎯 Token-level rewards + RLOO

Scaling up with 3x more data!
January 5, 2025 at 9:06 PM
2 OLMo 2 Furious captures every lesson learned since OLMo 1, featuring in-depth explorations of:
• Stable pretraining
• LR annealing, data curricula, and soups
• Tulu post-training
• Compute infrastructure
January 4, 2025 at 3:14 AM
Top 25 Open Source AI models on Hugging Face in 2025
December 31, 2024 at 12:47 AM
💥 Gemini 2.0 is on paper-central. Talk with any paper from the 🤗 Hugging Face paper page. Example with GenEx 👇
December 16, 2024 at 6:40 PM
EasyRef is on 🤗 Hugging Face

After DiffSensei yesterday, @ylecun is once again being style-transferred!

(⚡Llama 3.3) Chat with the paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/zongzhuofan...
🤗 Demo: huggingface.co/spaces/zong...
🤗 Paper: huggingface.co/papers/2412...
December 13, 2024 at 6:04 PM
SuperCharged Euclid is on 🤗 Hugging Face

Also, this is the best paper heading I’ve seen in quite some time. The 'en tête' looks fantastic.

(⚡Llama 3.3) Chat with the paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/euclid-mult...
🤗 Dataset: huggingface.co/datasets/eu...
December 13, 2024 at 5:51 PM
Phi-4 technical report is out.

What are the main contributions, how does it compare to SOTA? Find out 👇

🤗 Chat with the paper (⚡Llama 3.3) huggingface.co/spaces/hugg...
December 13, 2024 at 5:43 PM
InternLM-XComposer-2.5-OmniLive is on 🤗 Hugging Face

(⚡Llama 3.3) Chat with paper: huggingface.co/spaces/hugg...
🤗 Model: huggingface.co/internlm/in...
🤗 Paper: huggingface.co/papers/2412...
🐍 GitHub: github.com/InternLM/In...
December 13, 2024 at 5:35 PM
Started training my new LLM today and everything is going well.
December 12, 2024 at 7:17 PM
⚔️ CodeArena is on 🤗 Hugging Face

🤗 Dataset: huggingface.co/datasets/CS...

💡 An excellent way to get an overview in a glimpse: ask paper-central to summarize the scores in a markdown table (⚡by Llama 3.3!)

Try it out for any paper: huggingface.co/spaces/hugg...
December 11, 2024 at 5:47 PM