Caleb Fahlgren
calebfahlgren.hf.co
Caleb Fahlgren
@calebfahlgren.hf.co
SWE @hf.co
It doesn't get easier than this. Why are you writing SQL by yourself when it's almost 2025
December 2, 2024 at 12:48 PM
I did it via

Settings > Account > Handle > I have my own domain

and it should show there!
November 26, 2024 at 4:22 PM
You can literally do the histogram in one line in less than 10 seconds 💨

> from histogram(train, "Average ⬆️")
November 26, 2024 at 12:55 PM
Here's what the model licenses look like:

Lots of great open licenses in there too! 💪
November 26, 2024 at 12:55 PM
Let us know what you think or what you want to see :)

cc: @davidberenstein.bsky.social
November 25, 2024 at 7:54 PM
Let’s go!
November 22, 2024 at 8:03 PM
** log and get out of the way **
November 21, 2024 at 8:36 PM
using supabase theme, @tylerhillery.com would approve
November 21, 2024 at 8:26 PM
ray.so it's great with lots of themes!
Create beautiful images of your code
Turn your code into beautiful images. Choose from a range of syntax colors, hide or show the background, and toggle between a dark and light window.
ray.so
November 21, 2024 at 8:26 PM
Here's the library! Was fun collaborating with
@davidberenstein.bsky.social bringing the datasets and argilla all together!
github.com/cfahlgren1/o...
GitHub - cfahlgren1/observers: Track OpenAI compatible requests to a dataset
Track OpenAI compatible requests to a dataset. Contribute to cfahlgren1/observers development by creating an account on GitHub.
github.com
November 21, 2024 at 8:06 PM
The main three stores are:
• DuckDB (local, SQL over traces)
• Hugging Face Datasets (dataset viewer, sql console)
• Argilla - annotation and filtering UI
November 21, 2024 at 8:06 PM
That’s okay, there are lots of incomplete and even snapshots. The UpVoteWeb reddit dataset is one that comes to mind.

Any data that is more accessible is a win :). My hub stats dataset is just a cron script as well haha

huggingface.co/datasets/Ope...
OpenCo7/UpVoteWeb · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
November 21, 2024 at 5:03 PM
+1 and let me know if you need any help with it @tobilg.com would be nice to have the dataset viewer for it!
November 21, 2024 at 3:34 PM
We just released a library that makes it pretty seamless to send traces and LLM requests to datasets

github.com/cfahlgren1/o...

Would love to hear what you think is missing for prompts?
GitHub - cfahlgren1/observers: Track OpenAI compatible requests to a dataset
Track OpenAI compatible requests to a dataset. Contribute to cfahlgren1/observers development by creating an account on GitHub.
github.com
November 21, 2024 at 3:31 PM