Wout Schellaert
banner
woutschellaert.bsky.social
Wout Schellaert
@woutschellaert.bsky.social
Doctoral student working on AI evaluation.
This is one of the best approaches to evaluation I've seen!
ADeLe, a new evaluation method, explains what AI systems are good at—and where they’re likely to fail. By breaking tasks into ability-based requirements, it has the potential to provide a clearer way to evaluate and predict AI model performance: msft.it/6014SkVGC
May 15, 2025 at 7:18 PM