PRISM uses VLMs and iterative in-context learning to automatically generate effective, human-readable prompts using only black-box access to image generation models.
PRISM uses VLMs and iterative in-context learning to automatically generate effective, human-readable prompts using only black-box access to image generation models.
We also find that inference-time compute that is often used to improve model performance can introduce new vulnerabilities and harm robustness.
We also find that inference-time compute that is often used to improve model performance can introduce new vulnerabilities and harm robustness.
With B Trabucco, G Sigurdsson, R Piramuthu
With B Trabucco, G Sigurdsson, R Piramuthu
- 97% accuracy in detecting and filtering harmful content
- 89% success rate in generating feasible tasks
- 82% accuracy in judging successful task completions
- 97% accuracy in detecting and filtering harmful content
- 89% success rate in generating feasible tasks
- 82% accuracy in judging successful task completions
- LLM generates tasks for 150k websites
- LLM agents complete these tasks and produce trajectories
- LLM reviews the trajectories and evaluates their success
- LLM generates tasks for 150k websites
- LLM agents complete these tasks and produce trajectories
- LLM reviews the trajectories and evaluates their success
This minimizes costly intervention calls during training while leveraging PRMs to enhance robustness to off-policy data.
This minimizes costly intervention calls during training while leveraging PRMs to enhance robustness to off-policy data.
Timestamp 36:20 in neurips.cc/virtual/2024...
📎 arxiv.org/abs/2407.01476
#NeurIPS2024 #AdaptiveFoundationModels
Timestamp 36:20 in neurips.cc/virtual/2024...
📎 arxiv.org/abs/2407.01476
#NeurIPS2024 #AdaptiveFoundationModels
Paper: arxiv.org/abs/2410.15153
Code+Dataset: github.com/wrh14/deep_u...
joint work R Wu, C Yadav, K Chaudhuri.
Paper: arxiv.org/abs/2410.15153
Code+Dataset: github.com/wrh14/deep_u...
joint work R Wu, C Yadav, K Chaudhuri.