New work presented today at the COLM Workshop on Socially Responsible Language Modelling Research led by Purbid Bambroo and in collaboration with @anamarasovic.bsky.social that probes LLM preference test sets for redundancy and inflated scores.
1/8
New work presented today at the COLM Workshop on Socially Responsible Language Modelling Research led by Purbid Bambroo and in collaboration with @anamarasovic.bsky.social that probes LLM preference test sets for redundancy and inflated scores.
1/8
We propose a new second-order metric for uncertainty quantification in robot learning that we call "Agreement Volatility."
1/5
We propose a new second-order metric for uncertainty quantification in robot learning that we call "Agreement Volatility."
1/5
1/3
1/3