We introduce IPRO (Iterated Pareto Referent Optimisation)—a principled approach to solving multi-objective problems.
🔗 Paper: arxiv.org/abs/2402.07182
💻 Code: github.com/wilrop/ipro
why must i suffer
why must i suffer
Horrible things happening everywhere apparently
Horrible things happening everywhere apparently
LLMs are trained on just one tokenization per word, but they still understand alternative tokenizations. We show that this can be exploited to bypass safety filters without changing the text itself.
#AI #LLMs #tokenization #alignment
We introduce IPRO (Iterated Pareto Referent Optimisation)—a principled approach to solving multi-objective problems.
🔗 Paper: arxiv.org/abs/2402.07182
💻 Code: github.com/wilrop/ipro
We introduce IPRO (Iterated Pareto Referent Optimisation)—a principled approach to solving multi-objective problems.
🔗 Paper: arxiv.org/abs/2402.07182
💻 Code: github.com/wilrop/ipro
I've put it in its memory, in my details, and I even repeat it in the chat but it's just replying like 👉🥺👈
I've put it in its memory, in my details, and I even repeat it in the chat but it's just replying like 👉🥺👈
French people don't appreciate true genius
French people don't appreciate true genius
Either let me group authors or let me put acknowledgements after the main text. This isn't hard.
Either let me group authors or let me put acknowledgements after the main text. This isn't hard.
Computing a MEDIAN over a list of rows where one of the elements is just an empty array
Computing a MEDIAN over a list of rows where one of the elements is just an empty array
A strong emphasis on theoretically principled algorithms for RLHF followed by motivated practical implementations. Well-written and a clear overview of the relevant background and related work.
10/10 no comments
A strong emphasis on theoretically principled algorithms for RLHF followed by motivated practical implementations. Well-written and a clear overview of the relevant background and related work.
10/10 no comments
Not sure whether to be happy or sad
Not sure whether to be happy or sad