Lightnews — Scholar-powered news

Vivek Kalyan

@vivekkalyan.com

Most guides on RAG systems are over-optimizing for latency. If you are using RAG to automate any type of knowledge work, accuracy is much more important.

I explore the idea of spending more compute for RAG systems to significantly improve performance.

www.vivekkalyan.com/writing/scal...

Scaling Compute for RAG

Unlock higher RAG accuracy by strategically spending compute.

www.vivekkalyan.com

January 20, 2025 at 6:44 AM

Vivek Kalyan

@vivekkalyan.com

This is a super high impact project. There are tons of production models in the real world still running BERT/RoBERTa models from the 2018-2019 era, I'm sincerely hoping these models are easy to finetune, just the 8k context length will be good enough reason to upgrade.

Jeremy Howard @howard.fm · Dec 19

I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧵

December 19, 2024 at 5:33 PM

Vivek Kalyan

@vivekkalyan.com

We're launching early access for Cartograph (cartograph.app). It takes your codebase, and automatically generates architecture diagrams and documentation for them.

See some demos of open-source repos here (no sign-up required):

cartograph.app/demo

(Reply here if you'd like to see others added)

December 8, 2024 at 3:01 PM

Vivek Kalyan

@vivekkalyan.com

Spent more than 2 hours yesterday night trying to figure out why my solution for day 6 part 2 was working on the examples but not on the test input. Slept on it, and figured it out in 5mins when I woke up today morning. 🤦‍♂️

Vivek Kalyan @vivekkalyan.com · Dec 1

Oh look, it's that time of the year. I will doing Advent of Code 2024 in Rust, hoping to get more experience using Rust to solve a wide range of problems.

github.com/vivekkalyan/...

GitHub - vivekkalyan/advent-of-code-2024: Solving Advent of Code in Rust

Solving Advent of Code in Rust. Contribute to vivekkalyan/advent-of-code-2024 development by creating an account on GitHub.

github.com

December 7, 2024 at 4:23 AM

Vivek Kalyan

@vivekkalyan.com

Oh look, it's that time of the year. I will doing Advent of Code 2024 in Rust, hoping to get more experience using Rust to solve a wide range of problems.

github.com/vivekkalyan/...

GitHub - vivekkalyan/advent-of-code-2024: Solving Advent of Code in Rust

Solving Advent of Code in Rust. Contribute to vivekkalyan/advent-of-code-2024 development by creating an account on GitHub.

github.com

December 1, 2024 at 8:37 AM

Vivek Kalyan

@vivekkalyan.com

@eugeneyan.bsky.social's blog is a gold mine if you are doing ML/AI in the industry. Writing design docs before ML projects start is an important process that I introduced at my prev org, and the post on design docs was one of the references I used to create our template.

Eugene Yan @eugeneyan.com · Nov 27

Feels good to be mentioned on HN for engineers learning AI 🥰 Helping others is a big reason I write. Here's a list on ML/AI:

## Building AI systems
• Patterns for Building LLM-based Systems: eugeneyan.com/writing/llm-...
• What We’ve Learned From A Year of Building with LLMs: applied-llms.org

Read through this making flashcards as you to: https://eugeneyan.com/writing/llm-patterns/
Then spin up a RAG-enhanced chatbot using pgvector on your favourite subject, and keep improving it when you learn about cool techniques

---

Lots of people can get impressive demos up and running, but if you want to run AI products in production, you're going to have to do system evals. System evals make sure your product is doing what it says on the box with unquantifiable qualities.
We wrote a zine on system evals without jargon: https://forestfriends.tech
Eugene Yan has written extensively on it https://eugeneyan.com/writing/evals/
Hamel has as well. https://hamel.dev/blog/posts/evals/

November 27, 2024 at 5:36 PM

Vivek Kalyan

@vivekkalyan.com

Methodology paper detailing how to train small, efficient off-topic classifiers using synthetic data from LLMs.

Gabriel Chua @gabrielchua.bsky.social · Nov 27

🚨 new applied ai paper from govtech

LLMs are powerful, but they're prone to off-topic misuse, where users push them beyond their intended scope. Think harmful prompts, jailbreaks, and misuse. So how do we build better guardrails?

arxiv.org/abs/2411.12946

November 27, 2024 at 2:42 AM

Vivek Kalyan

@vivekkalyan.com

The Gemini team really delivered with gemini-exp-1121. It slaps for writing tasks, it's outputs avoid the typical AI feel that other models (like ChatGPT) have, something previously only Claude achieved.

November 26, 2024 at 3:57 AM

Vivek Kalyan

@vivekkalyan.com

I'm really tempted to update my blog...

Emily Liu @emilyliu.me · Nov 25

By popular request, here's how to add Bluesky replies as your blog's comment section!

This requires some technical know-how for now, but I'm hoping that we see some no-code solutions for this pop up soon, like Ghost or Wordpress plugins.

emilyliu.me/blog/comments

November 25, 2024 at 8:24 AM

Vivek Kalyan

@vivekkalyan.com

Reading the TÜLU 3 paper from @ai2.bsky.social. It's refreshing to see a research lab treating AI as a real science with full reports, data, code, logs, evals.

Paper: allenai.org/papers/tulu-...
Demo: playground.allenai.org
Code: github.com/allenai/open...
Eval: github.com/allenai/olmes

Notes

allenai.org

November 24, 2024 at 6:04 PM

Vivek Kalyan

@vivekkalyan.com

I spent today making this with wife, really happy how it turned out!

November 23, 2024 at 2:32 PM

Vivek Kalyan

@vivekkalyan.com

It feels like I'm inside a black hole, space and time are warping around me.

Theo Sanderson @theo.io · Nov 16

Exploring the full bluesky firehose, in three dimensions: firehose3d.theo.io

November 23, 2024 at 11:33 AM

Vivek Kalyan

@vivekkalyan.com

println!("Hello, {}!", "🦋");

November 23, 2024 at 11:22 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news