Working on a book about Citizen Diplomacy.
Living in the woods.
Also - being mom to four boys and one baby girl 🤘🏻
Since so many people asked, we are making our article on Agent2Agent (A2A) free to read on @hf.co
🧵
Since so many people asked, we are making our article on Agent2Agent (A2A) free to read on @hf.co
🧵
Memory-efficiency, inference speed, without compromising model quality.
LFMs have been benchmarked on real hardware, proving that they can beat Transformers.
Liquid AI have also just released Hyena Edge👇
Memory-efficiency, inference speed, without compromising model quality.
LFMs have been benchmarked on real hardware, proving that they can beat Transformers.
Liquid AI have also just released Hyena Edge👇
In the new Inference episode, I sat down with Amjad Masad, CEO and co-founder at Replit, to discuss the evolution in coding.
Are we entering a post-coding world?
www.youtube.com/watch?v=PlDe...
In the new Inference episode, I sat down with Amjad Masad, CEO and co-founder at Replit, to discuss the evolution in coding.
Are we entering a post-coding world?
www.youtube.com/watch?v=PlDe...
▪️ Test-Time Reinforcement Learning
▪️ LLMs are Greedy Agents
▪️ Paper2Code
▪️ Efficient Pretraining Length Scaling
▪️ The Sparse Frontier
▪️ Roll the dice & look before you leap
▪️ Discovering and Analyzing Values in Real-World Language Model Interactions
🧵
▪️ Test-Time Reinforcement Learning
▪️ LLMs are Greedy Agents
▪️ Paper2Code
▪️ Efficient Pretraining Length Scaling
▪️ The Sparse Frontier
▪️ Roll the dice & look before you leap
▪️ Discovering and Analyzing Values in Real-World Language Model Interactions
🧵
▪️ Hyena Edge
▪️ Tina: Tiny Reasoning Models via LoRA
▪️ Kimi-Audio
▪️ Aimo-2 winning solution
▪️ Eagle 2.5
▪️ Trillion-7B
▪️ Surya OCR
▪️ ThinkPRM
▪️ Skywork R1V2
🧵
▪️ Hyena Edge
▪️ Tina: Tiny Reasoning Models via LoRA
▪️ Kimi-Audio
▪️ Aimo-2 winning solution
▪️ Eagle 2.5
▪️ Trillion-7B
▪️ Surya OCR
▪️ ThinkPRM
▪️ Skywork R1V2
🧵
Yet even with the loud launch and 50 big-name partners, Google's A2A remains underappreciated. Why?
Here are several reasons 👇
www.turingpost.com/p/a2a
Yet even with the loud launch and 50 big-name partners, Google's A2A remains underappreciated. Why?
Here are several reasons 👇
www.turingpost.com/p/a2a
ElevenLabs is working to make such communication possible
We talked to it's co-founder and CEO Mati Staniszewski about what the next few years might look like👇
ElevenLabs is working to make such communication possible
We talked to it's co-founder and CEO Mati Staniszewski about what the next few years might look like👇
▪️ AI as a medium
▪️ AI as a feedback loop
Details🧵
▪️ AI as a medium
▪️ AI as a feedback loop
Details🧵
My top 10:
▪️ AI for Software Engineering
▪️ Inference-Time Scaling for Reward Modeling
▪️ Inference-Time Scaling for Complex Tasks
▪️ Open-Reasoner-Zero
▪️ SynWorld
▪️ Agent S2
▪️ KnowSelf
▪️ ZClip
▪️ MegaScale-Infer
▪️ Scaling Laws in Scientific Discovery
🧵
My top 10:
▪️ AI for Software Engineering
▪️ Inference-Time Scaling for Reward Modeling
▪️ Inference-Time Scaling for Complex Tasks
▪️ Open-Reasoner-Zero
▪️ SynWorld
▪️ Agent S2
▪️ KnowSelf
▪️ ZClip
▪️ MegaScale-Infer
▪️ Scaling Laws in Scientific Discovery
🧵
• Command A
• TransMamba
• HallOumi
• ScholarCopilot
• OThink-MR1
• RIG
• Z1
🧵
• Command A
• TransMamba
• HallOumi
• ScholarCopilot
• OThink-MR1
• RIG
• Z1
🧵
5 types based on how the model reasons:
▪️ Probabilistic
▪️ Rule-based
▪️ Logical
▪️ Abductive
▪️ Fuzzy
4 inference types based on its execution contexts:
▪️ Batch
▪️ Real-time
▪️ Edge
▪️ Cloud
Save the list and check this out for useful resources: huggingface.co/posts/Ksenia...
5 types based on how the model reasons:
▪️ Probabilistic
▪️ Rule-based
▪️ Logical
▪️ Abductive
▪️ Fuzzy
4 inference types based on its execution contexts:
▪️ Batch
▪️ Real-time
▪️ Edge
▪️ Cloud
Save the list and check this out for useful resources: huggingface.co/posts/Ksenia...
It covers important topics, such as:
- Path to advanced AI capabilities
- Sources of misalignment risk
- Technical approaches: safe design patterns, oversight, robust training, etc.
- AI governance
- Testing and evaluating AI failures
Watch 👇
It covers important topics, such as:
- Path to advanced AI capabilities
- Sources of misalignment risk
- Technical approaches: safe design patterns, oversight, robust training, etc.
- AI governance
- Testing and evaluating AI failures
Watch 👇
- Inference time
- Total generation time
- Latency
- Time to First Token (TTFT)
- Time Per Output Token (TPOT)
- Throughput
- Cost per inference
- Scalability
- Accuracy
- Entropy
Learn what they mean and other main things about inference in our article:
- Inference time
- Total generation time
- Latency
- Time to First Token (TTFT)
- Time Per Output Token (TPOT)
- Throughput
- Cost per inference
- Scalability
- Accuracy
- Entropy
Learn what they mean and other main things about inference in our article:
▪️"Dictionary learning"
▪️ Monosemanticity
▪️ "On the Biology of a Large Language Model"
🧵
▪️"Dictionary learning"
▪️ Monosemanticity
▪️ "On the Biology of a Large Language Model"
🧵
• Google's Gemini 2.5
• DeepSeek-V3-0324
• Zhipu's AutoGLM Rumination
• Alibaba's Qwen2.5-Omni
🧵
• Google's Gemini 2.5
• DeepSeek-V3-0324
• Zhipu's AutoGLM Rumination
• Alibaba's Qwen2.5-Omni
🧵
▪️ KAM-CoT
▪️ Multimodal Visualization-of-Thought (MVoT)
▪️ Compositional CoT
▪️ URSA
▪️ MM-Verify
▪️ Duty-Distinct CoT
▪️ Multimodal-CoT
▪️ Graph-of-Thought
▪️ Hypergraph-of-Thought
Save the list, and check this out for more info: huggingface.co/posts/Ksenia...
▪️ KAM-CoT
▪️ Multimodal Visualization-of-Thought (MVoT)
▪️ Compositional CoT
▪️ URSA
▪️ MM-Verify
▪️ Duty-Distinct CoT
▪️ Multimodal-CoT
▪️ Graph-of-Thought
▪️ Hypergraph-of-Thought
Save the list, and check this out for more info: huggingface.co/posts/Ksenia...
Yes, sometimes human is just another callable function in an AI agent's toolbox.
▪️ Human in the loop (HITL) is a design pattern, where humans are built into the decision loop to:
- validate outputs
- steer actions
- override the machine when necessary
🧵
Yes, sometimes human is just another callable function in an AI agent's toolbox.
▪️ Human in the loop (HITL) is a design pattern, where humans are built into the decision loop to:
- validate outputs
- steer actions
- override the machine when necessary
🧵
It's a new attention mechanism that allows models to be 2x faster and cut memory use by 32 times!
▪️ What's the secret?
It uses the same math as Multi-Head Attention (MHA) but applies one clever trick🧵
It's a new attention mechanism that allows models to be 2x faster and cut memory use by 32 times!
▪️ What's the secret?
It uses the same math as Multi-Head Attention (MHA) but applies one clever trick🧵
Researchers from Technion and Google Research defined what this "knowing" means and also explored it and found that:
LLMs often know more than they say but getting them to "say" it can be surprisingly hard
Key findings🧵
Researchers from Technion and Google Research defined what this "knowing" means and also explored it and found that:
LLMs often know more than they say but getting them to "say" it can be surprisingly hard
Key findings🧵
Our top 2
▪️ Xattention
▪️ Inside-Out: Hidden Factual Knowledge in LLMs
▪️ Rwkv-7 "Goose"
▪️ ϕ-Decoding
▪️ Frac-connections
▪️ DAPO
▪️ Reinforcement learning for reasoning in small LLMs
▪️ MetaLadder
▪️ Why do multi-agent LLM systems fail?
🧵
Our top 2
▪️ Xattention
▪️ Inside-Out: Hidden Factual Knowledge in LLMs
▪️ Rwkv-7 "Goose"
▪️ ϕ-Decoding
▪️ Frac-connections
▪️ DAPO
▪️ Reinforcement learning for reasoning in small LLMs
▪️ MetaLadder
▪️ Why do multi-agent LLM systems fail?
🧵
• KBLaM from @microsoft.com
• Fin-R1
• NVIDIA’s Cosmos-Reason1
• NVIDIA’s Cosmos-Transfer1
• M3 by NVIDIA
• Tencent’s T1
• Roblox’ Cube
🧵
• KBLaM from @microsoft.com
• Fin-R1
• NVIDIA’s Cosmos-Reason1
• NVIDIA’s Cosmos-Transfer1
• M3 by NVIDIA
• Tencent’s T1
• Roblox’ Cube
🧵
Lamini, and co-creator of a top AI course on Coursera, was full of incredible insights!
We discussed important topics, such as:
- AI hallucinations
- Agents and RAG hype
- The keys to GenAI
- AI education
www.youtube.com/watch?v=RW01...
Lamini, and co-creator of a top AI course on Coursera, was full of incredible insights!
We discussed important topics, such as:
- AI hallucinations
- Agents and RAG hype
- The keys to GenAI
- AI education
www.youtube.com/watch?v=RW01...
▪️ Original RoPE
▪️ LongRoPE
▪️ LongRoPE2
▪️ MRoPE (Multimodal RoPE)
▪️ DRoPE (Directional RoPE)
▪️ VideoRoPE
▪️ VRoPE
▪️ XPos
Save the list and check this out for the links and more info: huggingface.co/posts/Ksenia...
▪️ Original RoPE
▪️ LongRoPE
▪️ LongRoPE2
▪️ MRoPE (Multimodal RoPE)
▪️ DRoPE (Directional RoPE)
▪️ VideoRoPE
▪️ VRoPE
▪️ XPos
Save the list and check this out for the links and more info: huggingface.co/posts/Ksenia...
It's a full‑fledged agentic ecosystem that lets Qwen models autonomously plan, call functions, and execute complex, multi‑step tasks right out of the box.
Here are examples of Qwen-Agent's applications:
It's a full‑fledged agentic ecosystem that lets Qwen models autonomously plan, call functions, and execute complex, multi‑step tasks right out of the box.
Here are examples of Qwen-Agent's applications: