This framework uses a small number of randomly sampled gold standard labels to correct bias in downstream estimates based on error-prone proxies like LLM annotations
This framework uses a small number of randomly sampled gold standard labels to correct bias in downstream estimates based on error-prone proxies like LLM annotations