Eugene Yan
banner
eugeneyan.com
Eugene Yan
@eugeneyan.com
RecSys, AI, Engineering; Principal Applied Scientist @ Amazon. Led ML @ Alibaba, Lazada, Healthtech Series A. Writing @ eugeneyan.com, aiteratelabs.com.
yeap the plan is to map products to semantic IDs which the model can understand
September 17, 2025 at 3:57 AM
code for data prep, training the RQ-VAE, SASRec, Qwen3-8B, chatting with the model, etc

github.com/eugeneyan/se...
GitHub - eugeneyan/semantic-ids-llm: Semantic IDs: How to train an LLM-Recommender Hybrid with steerability and reasoning on recommendations.
Semantic IDs: How to train an LLM-Recommender Hybrid with steerability and reasoning on recommendations. - eugeneyan/semantic-ids-llm
github.com
September 17, 2025 at 2:05 AM
demo of the LLM-recommender hybrid returning both semantic IDs & english, and:
• steering recs via natural language
• explaining the recommendation
• naming the bundle of recommendations
• multi-turn conversation to get recs

watch till the end for the bloopers lol

www.youtube.com/watch?v=_0n4...
LLM-Recomender Hybrid with Steerable Recommendations and Reasoning on Recommendations
YouTube video by Eugene Yan
www.youtube.com
September 17, 2025 at 2:05 AM
For example, given a sequence of items, it can recommend the next best item. But better than that, you can steer the recommendations with natural language! And it can explain why it gave that recommendation, as well as creatively name recommendation bundles.
September 17, 2025 at 2:05 AM
For example, Amazon started to implement the first version of Amazon Prime in late 2004 and announced it on February 2 2005, six weeks later. An account of how it came amount and lots of anecdotes here. vox.com/recode/2019/...

Also this list: patrickcollison.com/fast
The making of Amazon Prime, the internet’s most successful and devastating membership program
An oral history of the subscription service that changed online shopping forever.
vox.com
May 20, 2025 at 2:23 AM
converted all images to webp and hopefully made the site faster. something i wouldn't have bothered in the past
May 18, 2025 at 11:09 PM
Previously, these tasks weren't worth the effort but now they can be done in hours. What an amazing time to build and play =D
May 18, 2025 at 9:07 PM
opps! thanks for letting me know, fixed!
May 7, 2025 at 2:57 AM
p.s., If you’re interested in topics like this, my friends Ben and Swyx are organizing the AI Engineer World’s Fair in San Francisco on 3rd - 5th June. Come talk to builders deploying AI systems in production. Here’s a big discount for tickets: ti.to/software-3/a...
AI Engineer World's Fair 2025
The AI Engineer World's Fair is the biggest technical AI event of the year, happening Summer 2025, the one place you can meet with ~every major AI lab from OpenAI to Anthropic to Cohere, every AI infr...
ti.to
May 7, 2025 at 12:25 AM
Here's the code for the mcp-server (src), prompts (context), and generated summaries from May 4th (summaries).

github.com/eugeneyan/ne...
GitHub - eugeneyan/news-agents: Building News Agents to Summarize News with MCP, Q, and tmux
Building News Agents to Summarize News with MCP, Q, and tmux - eugeneyan/news-agents
github.com
May 7, 2025 at 12:25 AM
Here's a three-minute demo of news-agents in action. It's pretty cool at the 30-second mark how the sub-agents get spawned! We then see the main agent assigning tasks and polling for progress, and finally shutting the sub-agents down when they're done with their assigned tasks.
May 7, 2025 at 12:24 AM