論文 Hugging Face 発表: 2026-05-10 HF ↑22

Model Merging Scaling Laws in Large Language Models

Model Merging Scaling Laws in Large Language Models

著者: Yuanyi Wang, Yanggan Gu, Yiming Zhang, Qi Zhou, Zhaoyi Yan ほか4名

要約

We study empirical scaling laws for language model merging measured by cross-entropy. Despite its wide practical use, merging lacks a quantitative rule that predicts returns as we add experts or scale the model size. We identify a compact power law that links model size and expert number: the size-d…

#llm

同じカテゴリの記事