you need to go deeper.
Ask:
✅ Is your retriever surfacing the right chunks?
✅ Is your generator actually using them — or just hallucinating? 🤔
Here is how I like to think about my RAG metrics (inspired by RAGAS)
you need to go deeper.
Ask:
✅ Is your retriever surfacing the right chunks?
✅ Is your generator actually using them — or just hallucinating? 🤔
Here is how I like to think about my RAG metrics (inspired by RAGAS)
Here are the essential agentic workflows. 👇
Here are the essential agentic workflows. 👇
It's a leading format for data engineering, data science, and machine learning.
youtube.com/shorts/_CnEK...
It's a leading format for data engineering, data science, and machine learning.
youtube.com/shorts/_CnEK...
This didn't happen to me recently :)
To learn more:
An Empirical Analysis of the Python Package Index (PyPI) - arxiv.org/pdf/1907.11073
blog.checkpoint.com/securing-the...
blog.orsinium.dev/posts/py/pyp...
My Video:
youtube.com/shorts/H1Uja...
This didn't happen to me recently :)
To learn more:
An Empirical Analysis of the Python Package Index (PyPI) - arxiv.org/pdf/1907.11073
blog.checkpoint.com/securing-the...
blog.orsinium.dev/posts/py/pyp...
My Video:
youtube.com/shorts/H1Uja...
Mechanistic Interpretability is a useful tool for investigation and fixing the issue.
I am using Transluce's Monitor here:
My video summary: www.youtube.com/shorts/Kuh-i...
Try Monitor: monitor.transluce.org/dashboard
Mechanistic Interpretability is a useful tool for investigation and fixing the issue.
I am using Transluce's Monitor here:
My video summary: www.youtube.com/shorts/Kuh-i...
Try Monitor: monitor.transluce.org/dashboard
youtube.com/shorts/dt4uX...
youtube.com/shorts/dt4uX...
What is the best single node dataframe?
For Polars check out:
github.com/pola-rs/polars
Polars vs. pandas: What’s the Difference?
blog.jetbrains.com/pycharm/2024...
Database-like ops benchmark - duckdblabs.github.io/db-benchmark/
Short Video:
youtube.com/shorts/8DkIR...
What is the best single node dataframe?
For Polars check out:
github.com/pola-rs/polars
Polars vs. pandas: What’s the Difference?
blog.jetbrains.com/pycharm/2024...
Database-like ops benchmark - duckdblabs.github.io/db-benchmark/
Short Video:
youtube.com/shorts/8DkIR...
Check out a scientific approach that experiments with model architecture, synthetic datasets, and tasks to understand how language models work.
My short intro: youtu.be/9saXkwHKaLs
Longer Video: ICML 2024 Tutorial by Zeyuan Allen-Zhu - youtu.be/yBL7J0kgldU
Check out a scientific approach that experiments with model architecture, synthetic datasets, and tasks to understand how language models work.
My short intro: youtu.be/9saXkwHKaLs
Longer Video: ICML 2024 Tutorial by Zeyuan Allen-Zhu - youtu.be/yBL7J0kgldU
the importance of data quality
My video: youtube.com/shorts/-_DGp...
Background:
Hannaneh Hajishirzi - OLMo: Accelerating the Science of Language Modeling (COLM)
www.youtube.com/watch?v=qMTz...
Molmo and PixMo paper -
arxiv.org/pdf/2409.17146
the importance of data quality
My video: youtube.com/shorts/-_DGp...
Background:
Hannaneh Hajishirzi - OLMo: Accelerating the Science of Language Modeling (COLM)
www.youtube.com/watch?v=qMTz...
Molmo and PixMo paper -
arxiv.org/pdf/2409.17146
Test yourself:
Are you smarter than a language model? -
joel.tools/smarter/
Language modeling game!
rr-lm-game.herokuapp.com
Are You Smarter Than An LLM?
d.erenrich.net/are-you-smar...
My video on the topic:
youtu.be/kXQGivEAF1U
Test yourself:
Are you smarter than a language model? -
joel.tools/smarter/
Language modeling game!
rr-lm-game.herokuapp.com
Are You Smarter Than An LLM?
d.erenrich.net/are-you-smar...
My video on the topic:
youtu.be/kXQGivEAF1U
Exploring Mean Error, Mean Squared Error, and Log Loss
youtu.be/S_zxVfKI55c
Exploring Mean Error, Mean Squared Error, and Log Loss
youtu.be/S_zxVfKI55c
The VN1 Forecasting Competition showed winning techniques including time series foundation models, deep learning, statistical methods, machine learning, and ensembling.
Check out the techniques of the top 5 teams.
www.youtube.com/watch?v=CRGA...
The VN1 Forecasting Competition showed winning techniques including time series foundation models, deep learning, statistical methods, machine learning, and ensembling.
Check out the techniques of the top 5 teams.
www.youtube.com/watch?v=CRGA...