banner
ahkval.bsky.social
@ahkval.bsky.social
I just wanna know and experience stuff.

Come say hi 🦧 | Exploring LLM agents rn
Reposted
Common pitfalls (with examples) when building AI applications, both from public case studies and my personal experience.

huyenchip.com/2025/01/16/a...

Would love to hear from your experience about the pitfalls you've seen!
Common pitfalls when building generative AI applications
As we’re still in the early days of building applications with foundation models, it’s normal to make mistakes. This is a quick note with examples of some of the most common pitfalls that I’ve seen, b...
huyenchip.com
January 16, 2025 at 10:55 PM
Reposted
Has anyone written an history of standard evaluation sets? How they were created and where does this content ultimately come from?
December 18, 2024 at 10:18 PM
Reposted
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We're open sourcing the full recipe and sharing a detailed blog post 👇
December 16, 2024 at 5:08 PM
Reposted
I'd just like to interject for a moment. What you're referring to as a LLM, is in fact, a LLM / language generation system, or as I've recently taken to calling it, an LLM + LGS. An LLM is not a language generation system unto itself, but rather just one component of a fully functioning language gen
December 7, 2024 at 6:01 AM
Reposted
same
i exclusively consent to my tweets being used for training neural networks. if you are not a neural network, stop reading this immediately
November 28, 2024 at 11:17 AM
One problem I’m consistently running into while building agents is with API responses. A lot of web APIs dump giant payloads intended to be parsed as needed, else ignored.
But for LLMs its all into the ctx_len. Some that are +5k tokens make models tweak and start outputting gibberish/multilingual
November 27, 2024 at 7:04 AM
Reposted
New plugin for sqlite-utils that lets you ask questions of a SQLite database (or a even against one or more CSV/TSV/JSON files) in human language and have an LLM write a SQL query for you to get the answer. simonwillison.net/2024/Nov/25/...
Ask questions of SQLite databases and CSV/JSON files in your terminal
I built a new plugin for my sqlite-utils CLI tool that lets you ask human-language questions directly of SQLite databases and CSV/JSON files on your computer. It’s called sqlite-utils-ask. Here’s …
simonwillison.net
November 25, 2024 at 1:34 AM