banner
andrew-n-carr.bsky.social
@andrew-n-carr.bsky.social
co-founder leading science at Cartwheel
AI writer for TLDR AI Newsletter
co-founder Arcade

Past - Codegen at OpenAI, Brain at GoogleAI, world ranked Tetris player
I thought this was an interesting graphic
February 16, 2025 at 10:37 PM
They did it all without Jira... Amazing
December 26, 2024 at 2:55 AM
the mean of a distribution is the point that minimizes the average squared difference of points drawn from that distribution

I've never thought of mean as an argmin before, but it's a neat framing!
December 10, 2024 at 11:10 PM
The top rated iclr paper (relight) is amazing.

Ic light 2 is also out on GitHub

github.com/lllyasviel/I...

Based on the flux suite of models and has stunning results
IC-Light V2 (Flux-based IC-Light models) · lllyasviel IC-Light · Discussion #98
Note that this post is a work in progress (wip). Maybe I will edit it a lot recently. IC-Light V2 is a series of Flux-based IC-Light models with 16ch VAE and native high resolution. We plan to have...
github.com
December 1, 2024 at 1:57 AM
good workflow

prompt r1-preview -> refine
copy all reasoning traces to claude -> prompt again
copy output and original prompt to o1-preview -> verify

This essentially solves every problem I've thrown at it from linguistic to mathematic.
November 27, 2024 at 3:16 PM
Genmo has released LoRA training capabilities for their generative video model Mochi

github.com/genmoai/moch...

Trains quickly on a single 80GB GPU.
November 27, 2024 at 12:24 AM
I am anxious to get my hands on r1 and grok 3.

I've heard some big moves are coming first two weeks of December from oai, Anthropic, and Gemini - but I'm more excited about these other two.

They feel meaningfully orthogonal from approach and group dynamics
November 24, 2024 at 8:29 PM
Gemini live is essentially just as good as advanced voice from oai. And no one is talking about either
November 24, 2024 at 1:41 AM
I've been noodling on a math problem since 2018 or so. I think I finally cracked it after a couple hours with r1-lite
November 23, 2024 at 4:43 AM
Cool new paper from NVIDIA about a hybrid state space + attention model that performs extremely well as a small model. Their 1.5B model even out performs Llama 3.2 3B

arxiv: arxiv.org/abs/2411.13676
November 22, 2024 at 8:27 PM
👀
November 21, 2024 at 2:47 PM
DeepSeek-R1-Lite-Preview is deepseeks answer to o1.

🔍 o1-preview-level performance on AIME & MATH benchmarks.
💡 Transparent thought process in real-time.
🛠️ Open-source models & API coming soon!

🌐 Try it now at chat.deepseek.com
DeepSeek
Chat with DeepSeek AI.
chat.deepseek.com
November 20, 2024 at 3:36 PM
Fun probability fact, the likelihood that two randomly drawn numbers are coprime is 61%!
November 20, 2024 at 1:56 AM
I have nothing to say. Just enjoy this validation loss curve for a moment
November 19, 2024 at 11:09 PM
Where are my AI friends at?
November 19, 2024 at 11:06 PM
Are there turn key machine shops?

Just pay $ and get an automated, garage sized, workshop?
November 19, 2024 at 3:20 AM
When deep learning start ups exit:

Marble floors in Monaco glass
Wrist so frozen, yeah it's built to last
Future vision through a tinted mask
Private hangars where I count my stash
Every move calculated like math
Pull up in that Phantom, tinted glass
Stack them queries deep with this KV cash
November 16, 2024 at 3:38 PM
my favorite phrase to hear when interviewing scientists

"and this is the point where I would ask claude ..."
November 16, 2024 at 3:28 AM
Hello world
November 16, 2024 at 1:55 AM