Vivek Kalyan
vivekkalyan.com
Vivek Kalyan
@vivekkalyan.com
Applied AI. Building cartograph.app. Previously: Head of AI @ handshakes.ai 🇸🇬

I write (sparsely) at: vivekkalyan.com
Most guides on RAG systems are over-optimizing for latency. If you are using RAG to automate any type of knowledge work, accuracy is much more important.

I explore the idea of spending more compute for RAG systems to significantly improve performance.

www.vivekkalyan.com/writing/scal...
Scaling Compute for RAG
Unlock higher RAG accuracy by strategically spending compute.
www.vivekkalyan.com
January 20, 2025 at 6:44 AM
This is a super high impact project. There are tons of production models in the real world still running BERT/RoBERTa models from the 2018-2019 era, I'm sincerely hoping these models are easy to finetune, just the 8k context length will be good enough reason to upgrade.
I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧵
December 19, 2024 at 5:33 PM
We're launching early access for Cartograph (cartograph.app). It takes your codebase, and automatically generates architecture diagrams and documentation for them.

See some demos of open-source repos here (no sign-up required):

cartograph.app/demo

(Reply here if you'd like to see others added)
December 8, 2024 at 3:01 PM
Spent more than 2 hours yesterday night trying to figure out why my solution for day 6 part 2 was working on the examples but not on the test input. Slept on it, and figured it out in 5mins when I woke up today morning. 🤦‍♂️
December 7, 2024 at 4:23 AM
Oh look, it's that time of the year. I will doing Advent of Code 2024 in Rust, hoping to get more experience using Rust to solve a wide range of problems.

github.com/vivekkalyan/...
GitHub - vivekkalyan/advent-of-code-2024: Solving Advent of Code in Rust
Solving Advent of Code in Rust. Contribute to vivekkalyan/advent-of-code-2024 development by creating an account on GitHub.
github.com
December 1, 2024 at 8:37 AM
@eugeneyan.bsky.social's blog is a gold mine if you are doing ML/AI in the industry. Writing design docs before ML projects start is an important process that I introduced at my prev org, and the post on design docs was one of the references I used to create our template.
Feels good to be mentioned on HN for engineers learning AI 🥰 Helping others is a big reason I write. Here's a list on ML/AI:

## Building AI systems
• Patterns for Building LLM-based Systems: eugeneyan.com/writing/llm-...
• What We’ve Learned From A Year of Building with LLMs: applied-llms.org
November 27, 2024 at 5:36 PM
Methodology paper detailing how to train small, efficient off-topic classifiers using synthetic data from LLMs.
🚨 new applied ai paper from govtech

LLMs are powerful, but they're prone to off-topic misuse, where users push them beyond their intended scope. Think harmful prompts, jailbreaks, and misuse. So how do we build better guardrails?

arxiv.org/abs/2411.12946
November 27, 2024 at 2:42 AM
The Gemini team really delivered with gemini-exp-1121. It slaps for writing tasks, it's outputs avoid the typical AI feel that other models (like ChatGPT) have, something previously only Claude achieved.
November 26, 2024 at 3:57 AM
I'm really tempted to update my blog...
By popular request, here's how to add Bluesky replies as your blog's comment section!

This requires some technical know-how for now, but I'm hoping that we see some no-code solutions for this pop up soon, like Ghost or Wordpress plugins.

emilyliu.me/blog/comments
November 25, 2024 at 8:24 AM
Reading the TÜLU 3 paper from @ai2.bsky.social. It's refreshing to see a research lab treating AI as a real science with full reports, data, code, logs, evals.

Paper: allenai.org/papers/tulu-...
Demo: playground.allenai.org
Code: github.com/allenai/open...
Eval: github.com/allenai/olmes

Notes
allenai.org
November 24, 2024 at 6:04 PM
I spent today making this with wife, really happy how it turned out!
November 23, 2024 at 2:32 PM
It feels like I'm inside a black hole, space and time are warping around me.
Exploring the full bluesky firehose, in three dimensions: firehose3d.theo.io
November 23, 2024 at 11:33 AM
println!("Hello, {}!", "🦋");
November 23, 2024 at 11:22 AM