rajistics.bsky.socia
rajistics.bsky.social
rajistics.bsky.socia
@rajistics.bsky.social
Fixated on practical AI - short posts and lots of short videos - at Contextual AI was at huggingface, snorkel, datarobot
If you want to really assess your RAG system —
you need to go deeper.

Ask:
✅ Is your retriever surfacing the right chunks?
✅ Is your generator actually using them — or just hallucinating? 🤔

Here is how I like to think about my RAG metrics (inspired by RAGAS)
April 1, 2025 at 2:50 PM
Are you going to chat all day with LLMs? 🤐
Here are the essential agentic workflows. 👇
January 4, 2025 at 10:30 PM
Dive into Parquet

It's a leading format for data engineering, data science, and machine learning.

youtube.com/shorts/_CnEK...
Why Parquet? Introducing the the file format for data engineering and machine learning
YouTube video by Rajistics - data science, AI, and machine learning
youtube.com
December 16, 2024 at 3:02 PM
PyPI Name Squatting

This didn't happen to me recently :)

To learn more:
An Empirical Analysis of the Python Package Index (PyPI) - arxiv.org/pdf/1907.11073

blog.checkpoint.com/securing-the...

blog.orsinium.dev/posts/py/pyp...

My Video:
youtube.com/shorts/H1Uja...
PyPI Name Squatting - How to reserve your python package name
YouTube video by Rajistics - data science, AI, and machine learning
youtube.com
December 13, 2024 at 8:13 PM
Why do language models think 9.11 is greater than 9.9? 🤔
Mechanistic Interpretability is a useful tool for investigation and fixing the issue.
I am using Transluce's Monitor here:
My video summary: www.youtube.com/shorts/Kuh-i...
Try Monitor: monitor.transluce.org/dashboard
Using Mechanistic Interpretability to Steer a Model's Predictions (Transluce's Monitor)
YouTube video by Rajistics - data science, AI, and machine learning
www.youtube.com
December 4, 2024 at 1:32 PM
My ranking of the top 26 algorithms for practical data science, breaking down their strengths, quirks, and when (or if) you should use them.

youtube.com/shorts/dt4uX...
Top 26 Data Science Algorithms
YouTube video by Rajistics - data science, AI, and machine learning
youtube.com
November 30, 2024 at 5:14 PM
Polars verus Pandas
What is the best single node dataframe?

For Polars check out:
github.com/pola-rs/polars

Polars vs. pandas: What’s the Difference?
blog.jetbrains.com/pycharm/2024...

Database-like ops benchmark - duckdblabs.github.io/db-benchmark/

Short Video:
youtube.com/shorts/8DkIR...
Pandas versus Polars: A Quick Comparison of Single Node DataFrame Alternatives
YouTube video by Rajistics - data science, AI, and machine learning
youtube.com
November 29, 2024 at 5:51 PM
The Physics of Language Models
Check out a scientific approach that experiments with model architecture, synthetic datasets, and tasks to understand how language models work.

My short intro: youtu.be/9saXkwHKaLs

Longer Video: ICML 2024 Tutorial by Zeyuan Allen-Zhu - youtu.be/yBL7J0kgldU
Physics of Language Models - Extracting Knowledge
YouTube video by Rajistics - data science, AI, and machine learning
youtu.be
November 24, 2024 at 9:57 PM
Ai2's 700k examples > Meta's 6B examples
the importance of data quality

My video: youtube.com/shorts/-_DGp...

Background:
Hannaneh Hajishirzi - OLMo: Accelerating the Science of Language Modeling (COLM)
www.youtube.com/watch?v=qMTz...
Molmo and PixMo paper -
arxiv.org/pdf/2409.17146
Improving Data Quality in MultiModal Models (Molmo and PixMO)
YouTube video by Rajistics - data science, AI, and machine learning
youtube.com
November 20, 2024 at 7:45 PM
Are you smarter than GPT-3 (you don't have a chance against GPT-4)
Test yourself:

Are you smarter than a language model? -
joel.tools/smarter/
Language modeling game!
rr-lm-game.herokuapp.com
Are You Smarter Than An LLM?
d.erenrich.net/are-you-smar...

My video on the topic:
youtu.be/kXQGivEAF1U
Are you Smarter than a AI Language Model like GPT4?
YouTube video by Rajistics - data science, AI, and machine learning
youtu.be
November 18, 2024 at 3:47 PM
Why do we use LogLoss as an error metric?
Exploring Mean Error, Mean Squared Error, and Log Loss

youtu.be/S_zxVfKI55c
Why Logloss is a better loss function than Mean Squared Error
YouTube video by Rajistics - data science, AI, and machine learning
youtu.be
November 15, 2024 at 4:22 PM
Reposted by rajistics.bsky.socia
Wow. I had no idea on BlueSky you can enable external media so you can watch YouTube videos on this platform. It overcomes the problem of only being able to upload 60 seconds of video here. It’s gets better & better
November 14, 2024 at 8:47 AM
What are the cutting-edge time series approaches? 📈✨
The VN1 Forecasting Competition showed winning techniques including time series foundation models, deep learning, statistical methods, machine learning, and ensembling.
Check out the techniques of the top 5 teams.
www.youtube.com/watch?v=CRGA...
November 14, 2024 at 1:58 PM
4 Techniques for Dimensionality Reduction: PCA, AutoEncoder, TSNE, and UMAP

youtu.be/EHWBP-OQwHk
4 Techniques for Dimensionality Reduction: PCA, AutoEncoder, TSNE, and UMAP
YouTube video by Rajistics - data science, AI, and machine learning
youtu.be
November 11, 2024 at 2:47 PM