Lightnews — Scholar-powered news

Manuel Sánchez

@manuel-sh.bsky.social

13 followers 15 following 17 posts

Building AI at scale in the enterprise world.

manuelsh.github.io

Posts Replies Media Videos

Manuel Sánchez

@manuel-sh.bsky.social

The error rate of GPT4o is even higher, 1.5%. An agent making 20 calls will have an error rate of 26%! That's not something scalable.

Of course, there are mechanism to reduce it, like the agent running tests over its results, but still this implies more calls. (3/3)

April 17, 2025 at 8:19 AM

Manuel Sánchez

@manuel-sh.bsky.social

But taking the level of the best model, a 0.7% error rate is equivalent to ~13% error rate of an agent that performs 20 calls to that LLM. Many times they do more calls.

13% is a very high error rate if we want to use that at scale. (2/3)

April 17, 2025 at 8:19 AM

Manuel Sánchez

@manuel-sh.bsky.social

linked it from the blog post! thanks!

January 16, 2025 at 10:03 PM

Manuel Sánchez

@manuel-sh.bsky.social

Top in the "How to argue" hierarchy: pointing out a flaw in the central point ;-) just a liiiiiiitle flaw

January 16, 2025 at 9:41 PM

Manuel Sánchez

@manuel-sh.bsky.social

Well explained in this website:
phillipi.github.io/prh/#what_co...

The Platonic Representation Hypothesis

phillipi.github.io

January 12, 2025 at 11:21 PM

Manuel Sánchez

@manuel-sh.bsky.social

Hey, great to hear it! Where can we find it?

January 12, 2025 at 10:31 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news