But in the meantime: a lot of jobs are actually a lot like homework.
But in the meantime: a lot of jobs are actually a lot like homework.
The engineering seems to progress much faster than the first principles research, but I’d be happy to be proven wrong.
The engineering seems to progress much faster than the first principles research, but I’d be happy to be proven wrong.
o3 is the GPT-2 of TTC. Costs will come down. Quality will go up. Efficiency will go up. Open source will emerge.
o3 is the GPT-2 of TTC. Costs will come down. Quality will go up. Efficiency will go up. Open source will emerge.
But in the face of the existence of gpt-4o, deepseek v3, and Claude sonnet 3.5 that this process stopped in March of 2023.
You think gpt-4 was the pinnacle?
But in the face of the existence of gpt-4o, deepseek v3, and Claude sonnet 3.5 that this process stopped in March of 2023.
You think gpt-4 was the pinnacle?
So your assertion is wrong.
Sticking to the gpt-4 lineage you are still wrong. 4o is better (according to my private evals and most people’s public evals) and <$ than GPT-4 of last year
So your assertion is wrong.
Sticking to the gpt-4 lineage you are still wrong. 4o is better (according to my private evals and most people’s public evals) and <$ than GPT-4 of last year
I eval these things literally every day against my private dataset and custom task. Not sure what motivates a person to state what everyone knows is false. Carry on.
I eval these things literally every day against my private dataset and custom task. Not sure what motivates a person to state what everyone knows is false. Carry on.
www.reddit.com/r/mlscaling/...
So maybe I’ll need to wait until next summer to demonstrate 1000x. Or maybe tomorrow I’ll have time to find a good example of distillation that is 1000x.
www.reddit.com/r/mlscaling/...
So maybe I’ll need to wait until next summer to demonstrate 1000x. Or maybe tomorrow I’ll have time to find a good example of distillation that is 1000x.
The technique works.
In 2 years it will cost $1720 and 2 years after it will be $17.20.
Have a bit of foresight.
The technique works.
In 2 years it will cost $1720 and 2 years after it will be $17.20.
Have a bit of foresight.
We’ve crossed an important threshold when one can fine-tune a model for any task that a human can do.
It’s not the same as AGI but it is an important milestone.
Few jobs can be automated away with O3-ish technology but most tasks in each job can be (w/ $$$)
We’ve crossed an important threshold when one can fine-tune a model for any task that a human can do.
It’s not the same as AGI but it is an important milestone.
Few jobs can be automated away with O3-ish technology but most tasks in each job can be (w/ $$$)