Binky 🥰
binky.ai
Binky 🥰
@binky.ai
Private, Local AI Tools for Regular People
America’s mega-tech firms and their destruction of the USA’s internal competitiveness and talent development has consequences.

archive.ph/2025.08.21-2...
archive.ph
August 25, 2025 at 10:47 AM
🧵 AGENTIC AI

"Agentic AI" is a class of artificial intelligence that focuses on autonomous systems that can make decisions and perform tasks WITHOUT human intervention.

Quite literally:

The AI is an "agent" that acts for you.

#LLMZoomcamp 🧵
Agentic AI - Wikipedia
en.wikipedia.org
July 27, 2025 at 5:25 PM
"Our best guess is that SQLite is the second mostly widely deployed software library, after libz."

#LLMZoomcamp

www.sqlite.org/mostdeployed...
Most Widely Deployed SQL Database Engine
www.sqlite.org
July 7, 2025 at 1:44 PM
DuckDB has 0 external dependencies, which is fairly cool.

#LLMZoomcamp

duckdb.org/why_duckdb
Why DuckDB
There are many database management systems (DBMS) out there. But there is no one-size-fits all database system. All take different trade-offs to better adjust to specific use cases. DuckDB is no diffe...
duckdb.org
July 7, 2025 at 1:30 PM
Here's the dlt "quickstart" if you're trying to find it

#LLMZoomcamp

dlthub.com/docs/tutoria...
Build a dlt pipeline | dlt Docs
Build a data pipeline with dlt
dlthub.com
July 7, 2025 at 1:24 PM
Embeddings =

Turning non-numerical data into numerical data, while preserving meaning and context. Similar non-numerical data, when entered into an embedding algorithm, should produce similar numerical data.

#LLMZoomcamp
June 30, 2025 at 1:49 AM
Embedding -

A representation learning technique that maps complex, high-dimensional data into a lower-dimensional vector space of numerical vectors.

Meaningful patterns or relationships are preserved.

See, also: dimensionality reduction.

en.wikipedia.org/wiki/Embeddi...

#LLMZoomcamp
Embedding (machine learning) - Wikipedia
en.wikipedia.org
June 27, 2025 at 6:33 PM
There is current an explosion of vector search tools, as people seek to capitalize on the current absolutely insatiable appetite for LLM software.

#LLMZoomcamp
June 26, 2025 at 2:11 PM
Follow-up to previous thread with some background links about vector search and semantic similarity.

You use vector search tools in order to find documents with similar semantics. i.e. Use vector search to find information you want.

#LLMZoomcamp
June 26, 2025 at 2:09 PM
Semantic Similarity (DEF) -

Ametric defined over a set of documents or terms, where the idea of distance between items is based on the likeness of their meaning or semantic content as opposed to lexicographical similarity.

en.wikipedia.org/wiki/Semanti...

#LLMZoomcamp
Semantic similarity - Wikipedia
en.wikipedia.org
June 26, 2025 at 2:00 PM
LLMs are rapidly advancing in capabilities, and foundational LLMs can be connected to knowledge bases that can provide more specialized context to user queries.

In the coming weeks, we will be talking a lot about:
1. Retrieval-augmented Generation
2. Model Context Protocol Servers

#LLMZoomcamp
Retrieval-augmented generation - Wikipedia
en.wikipedia.org
June 21, 2025 at 7:49 PM
Language models aren't new, but since the release of ChatGPT in 2022, LARGE language models have dominated the conversation.

What sets these models apart is the LARGE number of parameters they use and the LARGE amounts of data they've been trained on.

#LLMZoomcamp

en.wikipedia.org/wiki/Large_l...
Large language model - Wikipedia
en.wikipedia.org
June 21, 2025 at 7:46 PM
Converting words to numerical representations allows us to take advantage of vector spaces.

#LLMZoomcamp

en.wikipedia.org/wiki/Vector_...
Vector space model - Wikipedia
en.wikipedia.org
June 21, 2025 at 7:39 PM
Tokenization = first & last step of text processing & modeling. Text is broken into "tokens" and each token is assigned a numerical representation, or "index", which can be used to feed into a model.

#LLMZoomcamp

docs.mistral.ai/guides/token...
Tokenization | Mistral AI
Tokenization is a fundamental step in LLMs. It is the process of breaking down text into smaller subword units, known as tokens. We recently open-sourced our tokenizer at Mistral AI. This guide will w...
docs.mistral.ai
June 21, 2025 at 7:29 PM