Microsoft Unveils MAI-Voice-1 and MAI-1-Preview Models for Copilot

The Verge •

Microsoft introduced two in-house models: MAI-Voice-1, a speech model that can generate a minute of audio in under one second on a single GPU (used in Copilot Daily and Copilot Labs), and MAI-1-preview, a text instruction-following model trained on about 15,000 Nvidia H100 GPUs. The company plans to roll MAI-1-preview into Copilot and says it will orchestrate specialized models focused on consumer-facing experiences.

Read original ↗

Also mentioned in:

  • Ars Technica — Microsoft Unveils MAI-Voice-1 Speech Model and MAI-1 LLM to Power Copilot
  • Mpost — Microsoft Unveils MAI-Voice-1 Speech AI and MAI-1-Preview Foundation Model