Alignment, Simplified: Steering LLMs with Self-Generated Preferences

Published in Preprint, 2025

Authors: Dyah Adila, Changho Shin, Yijing Zhang, Frederic Sala

Recommended citation: https://arxiv.org/abs/2406.03642