- No chunking, splitting vectorizing bs
- Stateless, no vector DBs etc.
- Supports any model (deepseek, o3-mini et al)
- Reasoning traces
- Metadata filtering
- Typescript, Python support
- No chunking, splitting vectorizing bs
- Stateless, no vector DBs etc.
- Supports any model (deepseek, o3-mini et al)
- Reasoning traces
- Metadata filtering
- Typescript, Python support
~5 tok/sec.
~5 tok/sec.
On GPQA, PhDs with access to the internet got 34% outside their specialty, up to 81% inside. o3 is 87%.
Frontier Math went from the best AI at 2% to 25%
Some other big ones, too
On GPQA, PhDs with access to the internet got 34% outside their specialty, up to 81% inside. o3 is 87%.
Frontier Math went from the best AI at 2% to 25%
Some other big ones, too
Exploring conditional computation and dynamic depth in language models.
rentry.org/conditional_...
Exploring conditional computation and dynamic depth in language models.
rentry.org/conditional_...
A generative physics engine able to generate 4D dynamical worlds powered by a physics simulation platform designed for general-purpose robotics and physical AI applications.
A generative physics engine able to generate 4D dynamical worlds powered by a physics simulation platform designed for general-purpose robotics and physical AI applications.
Easy to use like DUSt3R/MASt3R, from an uncalibrated RGB video it recovers accurate, globally consistent poses & a dense map.
With @ericdexheimer.bsky.social* @ajdavison.bsky.social (*Equal Contribution)
Easy to use like DUSt3R/MASt3R, from an uncalibrated RGB video it recovers accurate, globally consistent poses & a dense map.
With @ericdexheimer.bsky.social* @ajdavison.bsky.social (*Equal Contribution)
An example would be that a simple punctuation mark, or just a single token, can prompt an LLM to produce a different response.
An example would be that a simple punctuation mark, or just a single token, can prompt an LLM to produce a different response.
Blog: ai.meta.com/blog/spdl-fa...
Repo: github.com/facebookrese...
Blog: ai.meta.com/blog/spdl-fa...
Repo: github.com/facebookrese...
They recently published a video on "Building Machine Learning Systems for a Trillion Trillion Floating Point Operations".
Link: www.youtube.com/watch?v=139U...
They recently published a video on "Building Machine Learning Systems for a Trillion Trillion Floating Point Operations".
Link: www.youtube.com/watch?v=139U...
My goal is not to argue who should get credit for what, but to show a progression of closely related ideas over time and across neighboring fields.
1/n
My goal is not to argue who should get credit for what, but to show a progression of closely related ideas over time and across neighboring fields.
1/n
The Federal Reserve
The Southern District of New York
and The Delaware Court of Chancery
The Federal Reserve
The Southern District of New York
and The Delaware Court of Chancery
One such class of functions is called "Bump functions"
1/6