Posting about data, AI, evals, and cognitive science.
eointravers.com
🇮🇪
In early testing, it's 10x, 50x, even 100x faster than existing type checkers. (We've seen >600x speed-ups over Mypy in some real-world projects.)
In early testing, it's 10x, 50x, even 100x faster than existing type checkers. (We've seen >600x speed-ups over Mypy in some real-world projects.)
eointravers.com/blog/job-scr...
eointravers.com/blog/job-scr...
If you take a one-in-n chance n times (e.g. taking 10 one-in-10 chances), what’s the probability that at least one of them will come off?
Somewhat satisfyingly, the answer, regardless of what n is, turns out to be “around 63%”. Here’s why.
(Also at eointravers.com/blog/one-in-n/)
If you take a one-in-n chance n times (e.g. taking 10 one-in-10 chances), what’s the probability that at least one of them will come off?
Somewhat satisfyingly, the answer, regardless of what n is, turns out to be “around 63%”. Here’s why.
(Also at eointravers.com/blog/one-in-n/)
Over the past two years at Unmind, I got to build two AI features: Nova AI wellbeing coach, and AI practitioner matching. It's lovely to see both get a call out in Fast Company's list of the most innovative companies in the workplace for 2025.
www.fastcompany.com/91270254/wor...
Over the past two years at Unmind, I got to build two AI features: Nova AI wellbeing coach, and AI practitioner matching. It's lovely to see both get a call out in Fast Company's list of the most innovative companies in the workplace for 2025.
www.fastcompany.com/91270254/wor...
Oh by the way hippos have migratory testicles.
Oh by the way hippos have migratory testicles.
LLM prompt-based chatbots are easy and flexible, but hard to control. Rule-based bots give you control, but require more manual work and are inflexible. Is there a middle ground? Yes, and it involves graphs.
eointravers.com/blog/structu...
LLM prompt-based chatbots are easy and flexible, but hard to control. Rule-based bots give you control, but require more manual work and are inflexible. Is there a middle ground? Yes, and it involves graphs.
eointravers.com/blog/structu...
In python, `load_dotenv()` doesn't override variables if they're already set. Use `load_dotenv(override=True)` if you want to always use the values from your `.env` file, and save yourself a lot of debugging headaches.
In python, `load_dotenv()` doesn't override variables if they're already set. Use `load_dotenv(override=True)` if you want to always use the values from your `.env` file, and save yourself a lot of debugging headaches.
Turns out: yes!
Thrilled to share our latest preprint where we used FunSearch to automatically discover symbolic cognitive models of behavior.
1/12
Turns out: yes!
Thrilled to share our latest preprint where we used FunSearch to automatically discover symbolic cognitive models of behavior.
1/12
TL;DR: Everyone [1] agrees that evals are crucial, but no one really knows how to do them for chatbots. I'm trying.
[1]: Well, e.g. @eugeneyan.com, @hamel.bsky.social, the other authors of applied-llms.org
TL;DR: Everyone [1] agrees that evals are crucial, but no one really knows how to do them for chatbots. I'm trying.
[1]: Well, e.g. @eugeneyan.com, @hamel.bsky.social, the other authors of applied-llms.org
Don't get an AI degree; the curriculum will be outdated before you graduate. Instead, study math, stats, or physics as your foundation, and stay current with AI through code-focused books, blogs, and papers.
eointravers.com/blog/llm-pro...
eointravers.com/blog/llm-pro...
github.com/pydantic/pyd...
Great to share small snippets of code (e.g: pydantic.run/store/c544b1...).
github.com/pydantic/pyd...
Great to share small snippets of code (e.g: pydantic.run/store/c544b1...).
From a technical perspective, it’s probably our most ambitious project yet. We’re about 800 PRs deep!
From a technical perspective, it’s probably our most ambitious project yet. We’re about 800 PRs deep!