I design tools and processes to support principled evaluation of AI systems.
lukeguerdan.com
This work was in collaboration with the amazing team @devsaxena.bsky.social (co-first author), @schancellor.bsky.social, @zstevenwu.bsky.social , and @kenholstein.bsky.social
Thank you for making my first adventure into qualitative research a delightful experience :)
This work was in collaboration with the amazing team @devsaxena.bsky.social (co-first author), @schancellor.bsky.social, @zstevenwu.bsky.social , and @kenholstein.bsky.social
Thank you for making my first adventure into qualitative research a delightful experience :)
- Protocols to help data scientists identify minimum standards for validity and other criteria, tailored to their specific application context
- Tools designed to help data scientists identify and apply strategies more effectively
- Protocols to help data scientists identify minimum standards for validity and other criteria, tailored to their specific application context
- Tools designed to help data scientists identify and apply strategies more effectively
For example, they use "swapping" to change target variables when the first has unanticipated challenges, or "composing" to capture complementary dimensions of a concept being captured in a target variable
For example, they use "swapping" to change target variables when the first has unanticipated challenges, or "composing" to capture complementary dimensions of a concept being captured in a target variable
💡 Simplicity
⚙️ Resource requirements
🎯 Predictive performance
🌎 Portability
💡 Simplicity
⚙️ Resource requirements
🎯 Predictive performance
🌎 Portability
In contrast, data scientists are often forced to reconcile their measurement goals with *existing* data
In contrast, data scientists are often forced to reconcile their measurement goals with *existing* data