1. Ask the AI to be honest, and
2. Train on the honest-prompted generations—while pretending the original prompt requested lying!
1980 TFLOP/s peak at FP8
3.35 TB/s memory bandwidth
For LLM decode at batch size 1, you only need to do about ~2 FLOP for each weight you load. But an H100 can perform ~600 FP8 operations in the time it takes 1 byte to move from the HBM to the cache.
1980 TFLOP/s peak at FP8
3.35 TB/s memory bandwidth
For LLM decode at batch size 1, you only need to do about ~2 FLOP for each weight you load. But an H100 can perform ~600 FP8 operations in the time it takes 1 byte to move from the HBM to the cache.
left: 6h ago
right: now
it has image output!
platform.openai.com/docs/models/...
left: 6h ago
right: now
it has image output!
platform.openai.com/docs/models/...
- It's uncommon for models w/same base to get updated cutoff dates. 3.5-3.7 Sonnet and 4o-4.1 are likely examples but there aren't many more.
- GPT-5 scale models don't take that much compute to train per Epoch's estimates
- Significantly lower SimpleQA than 5/5.1
- Long context improvement could indicate architectural changes
- Higher price could reflect higher price of serving
These aren't strong evidence though, I still lean slightly no
- It's uncommon for models w/same base to get updated cutoff dates. 3.5-3.7 Sonnet and 4o-4.1 are likely examples but there aren't many more.
- GPT-5 scale models don't take that much compute to train per Epoch's estimates
- Significantly lower SimpleQA than 5/5.1
- Long context improvement could indicate architectural changes
- Higher price could reflect higher price of serving
These aren't strong evidence though, I still lean slightly no
- Significantly lower SimpleQA than 5/5.1
- Long context improvement could indicate architectural changes
- Higher price could reflect higher price of serving
These aren't strong evidence though, I still lean slightly no
- Keep US far enough ahead to prevent inter-country race to ASI
- Maintain "high fence around a small yard", restricting China's access to military-relevant tech
- Keep US far enough ahead to prevent inter-country race to ASI
- Maintain "high fence around a small yard", restricting China's access to military-relevant tech