Microsoft Unveils MAI-Voice-1 and MAI-1-Preview Models for Copilot

The Verge • 2025-08-28T21:13:04+00:00

Microsoft introduced two in-house models: MAI-Voice-1, a speech model that can generate a minute of audio in under one second on a single GPU (used in Copilot Daily and Copilot Labs), and MAI-1-preview, a text instruction-following model trained on about 15,000 Nvidia H100 GPUs. The company plans to roll MAI-1-preview into Copilot and says it will orchestrate specialized models focused on consumer-facing experiences.

Read original ↗

Also mentioned in:

Ars Technica — Microsoft Unveils MAI-Voice-1 Speech Model and MAI-1 LLM to Power Copilot
Mpost — Microsoft Unveils MAI-Voice-1 Speech AI and MAI-1-Preview Foundation Model