Dialz: A Python Toolkit for Steering Vectors
ArXiv: arxiv.org/abs/2505.06262
Docs: cardiffnlp.github.io/dialz/
Repo: github.com/cardiffnlp/d...
A Python package to help you create, apply and visualise steering vectors for anything you want - from sycophancy to bias.
Dialz: A Python Toolkit for Steering Vectors
ArXiv: arxiv.org/abs/2505.06262
Docs: cardiffnlp.github.io/dialz/
Repo: github.com/cardiffnlp/d...
A Python package to help you create, apply and visualise steering vectors for anything you want - from sycophancy to bias.
Will follow up with favourite papers in blog post form soon!
Will follow up with favourite papers in blog post form soon!