Alignment, Simplified: Steering LLMs with Self-Generated Preferences
Published in Preprint, 2025
Authors: Dyah Adila, Changho Shin, Yijing Zhang, Frederic Sala
Recommended citation: https://arxiv.org/abs/2406.03642
Published in Preprint, 2025
Authors: Dyah Adila, Changho Shin, Yijing Zhang, Frederic Sala
Recommended citation: https://arxiv.org/abs/2406.03642