Microsoft Unveils MAI-Voice-1 and MAI-1-Preview Models for Copilot
Microsoft introduced two in-house models: MAI-Voice-1, a speech model that can generate a minute of audio in under one second on a single GPU (used in Copilot Daily and Copilot Labs), and MAI-1-preview, a text instruction-following model trained on about 15,000 Nvidia H100 GPUs. The company plans to roll MAI-1-preview into Copilot and says it will orchestrate specialized models focused on consumer-facing experiences.
Also mentioned in:
- Ars Technica — Microsoft Unveils MAI-Voice-1 Speech Model and MAI-1 LLM to Power Copilot
- Mpost — Microsoft Unveils MAI-Voice-1 Speech AI and MAI-1-Preview Foundation Model