Lightnews — Scholar-powered news

Dylan

@dylancastillo.co

AI-powered team collaboration

May 15, 2025 at 10:39 AM

Dylan

@dylancastillo.co

All the latest models keep breaking SOTA coding benchmarks but I’m not sure if they’re that much better.

The only thing I’m sure about is that I’m rarely able to ask something without getting an overengineered solution and a random new README in my codebase.

April 24, 2025 at 7:30 AM

Dylan

@dylancastillo.co

I was wrong to trash React and other frontend frameworks for the past few years.

Once a project is big enough, they definitely make you more productive vs vanilla, htmx, etc

But I'm happy that I didn't switch earlier, writing frontend code without AI tools must be horrible.

April 22, 2025 at 7:30 AM

Dylan

@dylancastillo.co

Somehow, my most random side project made it to the FT 😅

It's a quick test designed to assess your estimation skills: estimator.dylancastillo.co/

This is inspired by @codinghorror's great posts: blog.codinghorror.com/how-good-an...

archive.is/qDc0v

April 18, 2025 at 8:53 AM

Dylan

@dylancastillo.co

The biggest life hack is having a job that feels like a hobby

March 13, 2025 at 8:30 AM

Dylan

@dylancastillo.co

It's late but I finally finished my 2024 review.

Last year:

💵 I worked on 9 projects with 7 clients. Doubled revenue, costs are up by 155%.
💻 Coded 322 days. Wrote 14 blog posts.
🧠 Struggled with focus. Nearly burned out.
📸 Debi tirar mas fotos.

dylancastillo.co/posts/2024-...

2024: Personal Snapshot – Dylan Castillo

dylancastillo.co

March 11, 2025 at 8:30 AM

Dylan

@dylancastillo.co

I got an email from Google saying that one of my side projects, deepsheet, got 1,000% more clicks.

After a bit of digging, I realized that it was just due to people misspelling "DeepSeek."

There are now people out there who think that China's top AI is a 💩 that makes charts.

February 4, 2025 at 8:30 AM

Dylan

@dylancastillo.co

New pinned tab

January 21, 2025 at 8:30 AM

Dylan

@dylancastillo.co

Always remember that using a response schema for an LLM is not the same as using one for your API.

Sounds easy, but happens to everyone.

Here's OpenAI breaking the CoT reasoning of an LLM judge.

January 14, 2025 at 8:30 AM

Dylan

@dylancastillo.co

Note to self: your only job is not to break the chain.

January 9, 2025 at 8:30 AM

Dylan

@dylancastillo.co

Found 2 big issues with Gemini's structured outputs (SO):

1. Using constrained decoding seems to lower performance in reasoning tasks.
2. The Generative AI SDK can break your model's reasoning.

Just re-ran Let Me Speak Freely benchmarks with Gemini and got some interesting news

January 7, 2025 at 8:30 AM

Dylan

@dylancastillo.co

Structured outputs can decrease LLM's performance in some tasks

I replicated @willkurt.bsky.social / @dottxtai.bsky.social rebuttal of Let Me Speak Freely? (LMSF) using gpt-4o-mini

The rebuttal correctly highlights many flaws with the original study, but ironically, LMSF's conclusion still holds

December 12, 2024 at 10:30 AM

Dylan

@dylancastillo.co

Me after using ChatGPT to reproduce and patch a security vulnerability in a package downloaded 1 million times per month.

December 10, 2024 at 8:30 AM

Dylan

@dylancastillo.co

There's 0% chance that @bsky.app survives long-term without AI tools for moderation, filtering bots, etc.

h/t @alpindale.bsky.social and @danielvanstrien.bsky.social for helping with that goal

November 29, 2024 at 12:01 PM

Dylan

@dylancastillo.co

The typical anti-AI person writing an angry post on their way home doesn't realize that AI:

1. Silently fixes their spelling mistakes on their iPhone
2. Calculate the fastest way home on Uber
3. Pick the right music for the ride on Spotify
4. Keeps their credit card safe to buy their subscriptions

Jeremy Howard @howard.fm · Nov 28

Did you know that 99% of email today is spam? Your inbox isn’t 99% spam because AI is used to filter it.

The same 99% will happen here too, but if AI researchers continue to get perma-banned for making available the datasets needed to filter it, it’s going to make this platform unusable.

November 29, 2024 at 11:52 AM

Dylan

@dylancastillo.co

Crypto is to greed what porn is to lust.

You can be skeptical about its real-life utility (which I am), but it's hard to bet against such powerful motivators.

November 23, 2024 at 1:51 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news