Marek Suppa
Marek Suppa
@mrshu.bsky.social
𝗛𝗼𝗻𝗲𝘀𝘁𝗟𝗟𝗠

- Introduces 𝙃𝙊𝙉𝙀𝙎𝙀𝙏, a dataset with 930 queries in six categories to evaluate LLM honesty

- Proposes curiosity-driven prompting and two-stage fine-tuning for improving honesty and helpfulness

- Demonstrates up to 124.7% honesty and helpfulness improvement in models like Mistral-7b
December 6, 2024 at 9:06 PM
Multimodal Large Language Models Make Text-to-Image Generative Models Align Better

- VisionPrefer datset captures diverse preferences (prompt-following, aesthetic, fidelity, harmlessness) using multimodal LLMs

- VP-Score model matches human accuracy in preference prediction, guiding model tuning
December 5, 2024 at 10:28 PM
It unfortunately doesn't work that well with short (<200 tokens) responses.

www.nature.com/articles/s41...
November 24, 2024 at 9:47 AM
Does the TULU paper count?

arxiv.org/abs/2306.04751
November 23, 2024 at 9:38 PM