Pretrained Hybrids with MAD Skills
Published in Conference on Language Modeling (COLM), 2025
Authors: Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala
Recommended citation: https://arxiv.org/abs/2406.00894