罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快

12 月 17 日消息，2025 小米人车家全生态合作伙伴大会于今日举行，Xiaomi MiMo 大模型负责人罗福莉迎来入职后首秀。

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1306″ height=”496″>

小米昨日晚间惊喜发布了 Xiaomi MiMo-V2-Flash 开源 MoE 模型，总参数量 309B，活跃参数量 15B，专为智能体 AI 设计，专注于快。不少IT之家小伙伴体验后发现，该模型推理速度非常快。

罗福莉解释称，MiMo-V2-Flash 围绕极致推理效率设计了模型结构，通过 3 层 MTP 推理加速并行 Token 验证，实现了 2.0~2.6 倍的推理速度提升。

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1433″ height=”421″>

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1440″ height=”460″>

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1440″ height=”378″>

MiMo-V2-Flash 凭借总参数 309B（激活 15B），实现了代码和 Agent 评测基准上全球开源模型 Top2，且初步具备模拟世界的能力，可通过 HTML 写操作系统、模拟太阳系、画一棵圣诞树等。

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1440″ height=”429″>

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1440″ height=”433″>

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1440″ height=”409″>

罗福莉还谈到了下一代智能体系统，认为下一代智能体系统不是一个“语言模拟器”，而是一个真正能够理解世界、并与之共存的“智能体”。

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1430″ height=”429″>

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1405″ height=”438″>

罗福莉加入小米后首秀，解释 MiMo-V2-Flash 模型如何做到推理速度飞快 width=”1420″ height=”423″>

2025 小米“人车家全生态”合作伙伴大会专题

文章版权归作者所有，未经允许请勿转载。

8个月前

3,751335

3个月前

4,645507

7个月前

6,94163

2个月前

3,090614