Marco Herzog
banner
marcoher.bsky.social
Marco Herzog
@marcoher.bsky.social
Connecting AI research & Applied AI since 2018 | Previously: ML, NLP, RecSys | Now: GenAI, LLMs, RAG | Natural science, tech and climbing is my thing.
I’ve tested the new llama 3.3 70b 🦙 and the instruction following so far is magnifique. 👌
December 7, 2024 at 2:18 PM
If you are a leader, you shouldn't ignore this paper from MIT for your business and your workforce. This research from a large U.S. firm's R&D department shows real-world effects of AI — positive and negative.

These findings can be adopted by other businesses and departments as well. 🧵

#AI #ML
December 6, 2024 at 9:56 AM
Tencent open sourced this, Impressive! But I think we need to start talking about AI generated content detection on social media more seriously …

This open source is 🤯

More content with other capabilities in 🧵
aivideo.hunyuan.tencent.com
December 4, 2024 at 4:31 PM
This essay from DeepMind is something you shouldn't miss, it made me very excited about the future of science.

A new golden age of discovery
Seizing the AI for Science opportunity

deepmind.google/public-polic...

#AI #ML #Science
A new golden age of discovery
In this essay, we take a tour of how AI is transforming scientific disciplines from genomics to computer science to weather forecasting. Some scientists are training their own AI models, while...
deepmind.google
December 4, 2024 at 4:08 PM
If this is not telling an interesting story, than what does?

Accenture makes more money with GenAI than OpenAI.

GenAI revenue:
OpenAI: $3.4bn
Accenture: $3.7bn

I’m still thinking what that means…
December 3, 2024 at 8:33 PM
Unveiling the true inspiration behind the ‘attention’ operator in Transformers! From Bahdanau’s emails to Karpathy, we learn how attention transformed neural networks. Surprising that ‘Attention is All You Need’ outshines its predecessor by Bahdanau et al. #AI #ML #Transformers
December 3, 2024 at 8:06 PM
Reposted by Marco Herzog
Cursor vs Windsurf

I just spent a week deep-diving into Cursor and Windsurf, so you don’t have to.

Here's everything you need to know 👇🧵
November 28, 2024 at 8:45 PM
By far the best visualization and walkthrough of a Transformer I've ever seen. You can explore the algorithm down to every add & multiply, seeing the whole process in action. With animation and explanation. That was definitely created with a lot of passion and work. By Brendan Bycroft.

#AI #LLM #LM
November 29, 2024 at 1:32 PM
A new update on the AI feed. The feed now has 66 unique negative filters. They remove any form of AI art. There are 186 filters that are filtering content related to the sphere of AI.

bsky.app/profile/did:...

#AI #LLM #ML
November 28, 2024 at 8:48 PM
@bsky.app Impression stats are really missing. Not necessarily visible for everyone on the post, but at least for the post creator.

1. To know how big the audience really is
2. To see the actual quality of post (engagement/Impressions)
November 28, 2024 at 7:52 PM
Do you want to see the new Anthropic Model Context Protocol in action? It was announced just 3 d ago, and we already see great apps. This time build by Alec Velikanov.

Take this project as an example to build something yourself. I'm off, need to cook 7 courses with the stuff Claude ordered.

#AI
November 28, 2024 at 10:24 AM
What can we do about the benchmark fatigue for #LLM? More people I speak to don’t take them seriously anymore, and I can’t blame them. I still hesitate, but I’m about to drop them as well. I think we need something new for #AI eval. Or is there something I’m not aware of?
November 27, 2024 at 11:17 PM
I've updated the filters for the AI feed. Using 186 filters in total now. Feel free to ping me if you want the filter list.

bsky.app/profile/did:...

#AI #LLM #ML
November 27, 2024 at 4:13 PM
Once moving away from naive RAG, chunking becomes pretty important. I use semantic chunking with BERT, and I'm quite happy with it. I didn't use this, but looks promising. Quite handy if you don't use bigger libs like Haystack, LangChain, LLamaIndex etc.

github.com/bhavnicksm/c...

#LLM #RAG #AI
GitHub - bhavnicksm/chonkie: 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library - bhavnicksm/chonkie
github.com
November 27, 2024 at 3:47 PM
I used to think #LLM were just fancy next-token parrots. Turns out they’re secretly doing math. The end of Scaling laws? Surprise! Your #AI can now do symbolic reasoning. I can't be more thrilled to let it do my taxes.

arxiv.org/html/2407.11...

Hopefully it doesn't think in these symbols:
November 27, 2024 at 2:38 PM
I had a talk with Elena Samuylova from Evidently AI and she was super helpful. Evidently is an open-source framework to evaluate, test and monitor ML and LLM-powered systems.

Does anyone have experience with it in prod env or an opinion on other eval tools?
November 25, 2024 at 4:39 PM
An AI starter pack to get quickly into the community by following the most active members. The goal is to get meaningful content fast and engage with the community.

Please tag very active members in the comments to add them into the list.

go.bsky.app/CanJ3xW

#AI #ML #starterPack
November 24, 2024 at 9:57 PM
I like his brutal honesty. 😆

"...in my opinion there's like 5% of an idea and then 95% is like smoke and mirrors trying to couch things in modern words that have already existed for a long time, and they're fundamentally nothing new. I don't want to be too harsh..."

www.youtube.com/watch?v=gfU5...
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (Paper Explained)
YouTube video by Yannic Kilcher
www.youtube.com
November 24, 2024 at 12:07 PM
Quite an unusual benchmark. RE-Bench tests AI Agents against human experts in ML research tasks.

Key findings:
• AI shines in sprints up to 4 h
• Humans lead in longer projects
• Sonnet 3.5 and o1-prev do substantially better than humans given 2 h
• Human improvement is much steeper

#AI #ML
November 23, 2024 at 7:14 PM
Navigating Bluesky's content curation tools 🧭
• Lists: Organize accounts
• Moderated Lists: Tighter content control
• Starter Packs: Onboard new users (max 150)
• Feeds: Custom algorithmic timelines
I was lost at first, but here's what I learned! 🧵👇
November 23, 2024 at 5:15 PM
Actually, an interesting thought for generalization in AI. It seems there is a connection between spacial location and latent context similarly to influence decision-making, despite the task being non-spatial, at least in the hippocampus. Is not Fei-Fei Li currently working on something similar?
Huge congrats to @karyna-mi.bsky.social for her paper published today in Science! She found that the hippocampus is really important for a key strategy we use to make decisions called hidden state inference! 🧪 🧠https://www.science.org/doi/10.1126/science.adq5874 1/7
Hidden state inference requires abstract contextual representations in the ventral hippocampus
The ability to use subjective, latent contextual representations to influence decision-making is crucial for everyday life. The hippocampus is hypothesized to bind together otherwise abstract combinat...
www.science.org
November 23, 2024 at 5:02 AM
Feel free to ping me, if I haven't added you to the RAG - Retrieval Augmented Generation starter pack yet.
go.bsky.app/PUts9PH
November 23, 2024 at 4:58 AM
Feel free to ping me, if I haven't added you to the AI Agents starter pack yet. Enjoy the community.
go.bsky.app/U8M5Rk6
November 23, 2024 at 4:57 AM
Reposted by Marco Herzog
Just created a starter pack to get a steady stream of Applied AI related conversations! Think RAG in Enterprises, Structured Extraction from documents, AI in consumer apps and Agents!

I will keep adding folks to this list! Who else should I add?!

https://buff.ly/3ObxkCD
Applied AI Folks!
Join the conversation
buff.ly
November 17, 2024 at 8:40 AM
🔬 𝗔𝗜 𝗔𝗧 𝗪𝗢𝗥𝗞 — 𝗘𝗻𝗵𝗮𝗻𝗰𝗶𝗻𝗴 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆, 𝗗𝗶𝗺𝗶𝗻𝗶𝘀𝗵𝗶𝗻𝗴 𝗦𝗮𝘁𝗶𝘀𝗳𝗮𝗰𝘁𝗶𝗼𝗻
New MIT study.

The numbers tell a striking story:
• Overall 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝘃𝗶𝘁𝘆 ⬆️ 𝟰𝟰%
• Patent filings ⬆️ 39%
• BUT 𝗷𝗼𝗯 𝘀𝗮𝘁𝗶𝘀𝗳𝗮𝗰𝘁𝗶𝗼𝗻 ⬇ 𝗳𝗼𝗿 𝟴𝟮%

Here's the twist:
• Top performers: 𝗣𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝘃𝗶𝘁𝘆 𝘀𝘂𝗿𝗴𝗲 ⬆️ 𝟴𝟭%
• Bottom third: Minimal gains
November 21, 2024 at 8:51 PM