Bagel
1. Scalable Generative Cognitive Model (BAGEL)

BAGEL adopts a Mixture-of-Transformer-Experts (MoT) architecture comprising two transformer experts—one dedicated to multimodal understanding and the other to multimodal generation.
2. LightBagel
