📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!
📝: arxiv.org/abs/2502.13962
📈 We consider how models’ confidence in their answers changes as test-time compute increases. Reasoning longer helps models answer more confidently!
📝: arxiv.org/abs/2502.13962
my favorites based on my first listen:
- luther
- reincarnated
- dodger blue
- gloria
my favorites based on my first listen:
- luther
- reincarnated
- dodger blue
- gloria