Lightnews — Scholar-powered news

@ahkval.bsky.social

43 followers 200 following 8 posts

I just wanna know and experience stuff.

Come say hi 🦧 | Exploring LLM agents rn

Posts Replies Media Videos

Reposted

Chip Huyen

@chiphuyen.bsky.social

Common pitfalls (with examples) when building AI applications, both from public case studies and my personal experience.

huyenchip.com/2025/01/16/a...

Would love to hear from your experience about the pitfalls you've seen!

Common pitfalls when building generative AI applications

As we’re still in the early days of building applications with foundation models, it’s normal to make mistakes. This is a quick note with examples of some of the most common pitfalls that I’ve seen, b...

huyenchip.com

January 16, 2025 at 10:55 PM

Reposted

Alexander Doria

@dorialexander.bsky.social

Has anyone written an history of standard evaluation sets? How they were created and where does this content ultimately come from?

December 18, 2024 at 10:18 PM

Reposted

Lewis Tunstall

@lewtun.bsky.social

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We're open sourcing the full recipe and sharing a detailed blog post 👇

December 16, 2024 at 5:08 PM

Reposted

thebes

@vgel.me

I'd just like to interject for a moment. What you're referring to as a LLM, is in fact, a LLM / language generation system, or as I've recently taken to calling it, an LLM + LGS. An LLM is not a language generation system unto itself, but rather just one component of a fully functioning language gen

December 7, 2024 at 6:01 AM

Reposted

tokenbender

@tokenbender.bsky.social

same

naia @naia.bsky.social · Nov 28

i exclusively consent to my tweets being used for training neural networks. if you are not a neural network, stop reading this immediately

November 28, 2024 at 11:17 AM

ahkval.bsky.social

@ahkval.bsky.social

One problem I’m consistently running into while building agents is with API responses. A lot of web APIs dump giant payloads intended to be parsed as needed, else ignored.
But for LLMs its all into the ctx_len. Some that are +5k tokens make models tweak and start outputting gibberish/multilingual

November 27, 2024 at 7:04 AM

Reposted

Simon Willison

@simonwillison.net

New plugin for sqlite-utils that lets you ask questions of a SQLite database (or a even against one or more CSV/TSV/JSON files) in human language and have an LLM write a SQL query for you to get the answer. simonwillison.net/2024/Nov/25/...

Ask questions of SQLite databases and CSV/JSON files in your terminal

I built a new plugin for my sqlite-utils CLI tool that lets you ask human-language questions directly of SQLite databases and CSV/JSON files on your computer. It’s called sqlite-utils-ask. Here’s …

simonwillison.net

November 25, 2024 at 1:34 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news