Pedro Madruga
banner
pedromadruga.com
Pedro Madruga
@pedromadruga.com
Applied AI Scientist (in 🇩🇰 since 2012) • http://pedromadruga.com • Interested in Information Retrieval at scale • Lead AI Scientist at Karnov Group

Opinions are my own.
I am a fan of SDD but it often gets confused with Vibe Coding, specially in companies where innovation is met with resistance.

This articles shares some criticism, yet by someone who actually tried it. I disagree with the article and it seems that HN people in thread do too.

Give SDD a try!
Spec-Driven Development: The Waterfall Strikes Back | Hacker News
Its not a surprise to me that this approach also helps AI coding agents to work more effectively, as in-depth planning is essentially moving the thinking upfront.(I wrote more about this here:…
news.ycombinator.com
November 18, 2025 at 11:01 AM
I have been trying lefthook and damn this thing is fast. Also, the ability to run scripts is handy if you like to keep the config tidy.
GitHub - evilmartians/lefthook: Fast and powerful Git hooks manager for any type of projects.
Fast and powerful Git hooks manager for any type of projects. - evilmartians/lefthook
github.com
November 17, 2025 at 6:08 PM
“Before being technical, science is visionary”
Carlo Rovelli, Reality Is Not What It Seems
November 15, 2025 at 9:28 AM
There are hardly any papers, blog posts or talks that have a general solution for domain-dependent problems.

At best, there are indicators of the path to go, which is why experimentation is key.
November 14, 2025 at 10:00 AM
Business-applied AI development requires a much higher level of experimentation than traditional software development.

This is because AI development requires a very tight connection to domain knowledge. And each domain requires its own approach, for the most part.
November 14, 2025 at 10:00 AM
A short (way too short) yet fantastic reading. Rovelli has the gift of explaining complex things simply.

I have quickly moved to reading The Order of Time, another of his books. The way he explains time is just so accessible - thus mind blowing.
November 11, 2025 at 6:45 AM
It’s reasonably amusing to see the resistance from some traditional software developers to some AI tooling. MCP for example, gets a lot of focus from tech doomers because it’s new.

It’s Ok to not use those don’t fit a use case but until they try it it’s just preemptive criticism.
November 10, 2025 at 11:30 AM
TIL!
I started using the ThunderAI add-on in Thunderbird. Now my local LLM automatically classifies, auto-replies (reviewable), and summarizes emails across multiple accounts. I did not expect to appreciate it this much!

#Thunderbird #ThunderAI #Productivity #PrivacyFirst #LLM
November 7, 2025 at 8:13 PM
Tivemos o Rui que nem rei.
November 4, 2025 at 10:05 PM
The term Agentic Coding is something that resonates quite a lot. But I’ve heard Spec Driven Development and it is spot on.

Both these are almost polar opposite to what Vibe Coding is (imo), even though all are AI powered.

Also, for prototyping: vibe coding all the way.
November 4, 2025 at 1:33 PM
Really great recommendations, thanks!
November 1, 2025 at 8:00 AM
An important component of the success of AI-based products is the ability to intertwine data science and (software) engineering.

Enforcing a separation of those is a recipe for failure.

When it works though, it feels like a dance: there might be toe-stepping but the song is the same.
September 12, 2025 at 7:54 PM
When people ask what am I doing with so many Pi’s
September 7, 2025 at 6:06 PM
Reposted by Pedro Madruga
this seems like a very good idea actually
August 15, 2025 at 1:38 PM
Phi4 didn’t even try
August 12, 2025 at 7:16 PM
So Anthropic is now the enterprise AI market leader. After having been playing with Opus 4 for the last month, this is rather unsurprising.

Open-source LLMs adoption has stalled.

Src: menlovc.com/perspective/...
August 5, 2025 at 5:46 AM
Context engineering (CE) makes sense.

It exposes one of the limitations of RAG: the retrieved chunks lack a structure, thus turning the prompt into a spaghetti of information (inc w/ rerankers).

CE, by providing formatting guidelines, improves the final output - while making testing easier.
August 3, 2025 at 6:07 AM
I've finally clean the dust of my blog and have a few nice blogposts coming in. I'll be writing mostly on Applied AI (scalability, techniques, technical leadership of a successful AI product, etc) from experience in the industry.

Sharing my learnings here: pedromadruga.com/newsletter/
Newsletter
Hi! This newsletter is a way to get the content I write on my blog delivered to your inbox. Alternatively, you can subscribe to my RSS feed. I’ll write about industry-applied Artificial Intelligence,…
pedromadruga.com
July 27, 2025 at 6:00 AM
“In science if you know what you are doing you should not be doing it.

In engineering if you do not know what you are doing you should not be doing it.”

- Richard Hamming, The Art of Doing Science and Engineering
July 25, 2025 at 2:44 PM
Lumo.

Mistral it is. 👍🏼
July 24, 2025 at 4:21 PM
Seems to be using only self-hosted models, according to their Privacy.

Proton (@proton.me)
Proton (@proton.me)
Introducing Lumo, a privacy-first AI built by Proton, where every conversation is confidential ✅ Zero-access encryption ✅ No-logs policy ✅ Open-source and auditable Try @asklumo.proton.me for free,…
bsky.app
July 24, 2025 at 12:05 PM
Reposted by Pedro Madruga
What are your favorite recent papers on using LMs for annotation (especially in a loop with human annotators), synthetic data for task-specific prediction, active learning, and similar?

Looking for practical methods for settings where human annotations are costly.

A few examples in thread ↴
July 23, 2025 at 8:10 AM
One of the best parts of SIGIR2025 were the talks I had with the authors during breaks. So much information to unpack that can only be obtained by being there.

Another great thing was all the amazing work presented.
July 22, 2025 at 9:58 AM