John Berryman
banner
jnbrymn.bsky.social
John Berryman
@jnbrymn.bsky.social
LLM consulting @ https://arcturus-labs.com/

Author x2: https://amzn.to/3TXmDHk, https://amzn.to/3zKIxGG

Formerly Eventbrite, GitHub (code search and Copilot)
In this talk, I'll explore the state-of-the-art in LLM evaluation, covering modern techniques like LLM-as-Judge. I’ll also discuss the essential processes and culture shifts that are required to launch reliable AI products and drive ongoing improvement.
June 25, 2025 at 7:18 PM
Generative AI has made it easier than ever for companies to build products quickly. However, LLMs are inherently nondeterministic and unpredictable. Integrating LLMs into products demands an unprecedented level of quality assurance – requiring new strategies for continuous evaluation.
June 25, 2025 at 7:18 PM
Then you can make a classifier that is tuned to whatever threshold matches a training set most accurately. Easy peasy.

Full post here: arcturus-labs.com/blog/2025/03...
Supercharging LLM Classifications with Logprobs
Turn your LLM into a precision instrument for classification – no fine-tuning required. This post shows how to go beyond simple
arcturus-labs.com
June 12, 2025 at 3:14 PM
The idea is really simple – basically you just make a classifier the returns a single token – good/bad or red/green/blue or 1/2/3/4 – But rather than looking at that token you look at the probability of the tokens.
June 12, 2025 at 3:14 PM
Read the full deep-dive on why visual reasoning is the next frontier in AI:
arcturus-labs.com/blog/2025/03...

Follow for more.
Visual Reasoning is Coming Soon - Arcturus Labs
From silly cat costumes to world-changing innovations, OpenAI's latest release marks the beginning of something extraordinary. The fascinating world of visual reasoning is emerging, where AI models wi...
arcturus-labs.com
April 7, 2025 at 4:03 AM
Picture this: AI solving physics problems by visualizing objects in motion, or predicting social interactions by imagining body language and facial expressions. That's where we're headed.
April 7, 2025 at 4:03 AM
The key insight: Just like chain-of-thought reasoning transformed how AI thinks through problems with words, visual reasoning will let AI work through problems by creating and analyzing sequences of images.
April 7, 2025 at 4:03 AM
Today's AI can put a detective hat on your cat. Tomorrow's AI will help design your garden, rearrange your furniture, and solve complex spatial puzzles by actually visualizing different scenarios and their outcomes.
April 7, 2025 at 4:03 AM
Thanks for reading our book and posting about it. I'm glad you got something from it!
January 20, 2025 at 7:31 PM
A perfect reply. Thank you Evgeny.
December 27, 2024 at 6:34 AM
I think I'll try this once a year until someone bites :)
December 24, 2024 at 9:48 PM