Moonshot AI Muon and Moonlight LLM
Moonshot AI introduces Muon, a new optimizer, and Moonlight, a model trained with it. Muon enhances large language model training efficiency and stability, achieving superior performance with reduced computational cost. Moonlight outperforms comparable models in various benchmarks, demonstrating Muon's effectiveness. Open-sourcing promotes further research in efficient training methods.