Pretrained Hybrids with MAD Skills

Published in Conference on Language Modeling (COLM), 2025

Authors: Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala

Recommended citation: https://arxiv.org/abs/2406.00894