Random Thoughts
Basically some unimplemented ideas.
- MoE for unified models.
Text and Image generation. https://arxiv.org/pdf/2510.24711
- Understanding vision with dynamic view
- distill unified models from different models.
- Benchmark World Knowledge in video models.