Microsoft launches 3 MAI models for transcription, voice, image generation
NewsBytes | April 3, 2026 1:39 AM CST
Microsoft's MAI models faster, more accurate
MAI-Transcribe-1 handles speech-to-text in 25 languages and beats Google's Gemini 3.1 Flash and OpenAI's GPT-Transcribe on accuracy, plus it's 2.5 times faster than Microsoft's existing Azure Fast offering at $0.36 per hour.
MAI-Voice-1 lets you build custom voices quickly ($22 per million characters), while MAI-Image-2 creates images twice as fast as before ($33 per million image tokens).
These upgrades are also making their way into apps like Copilot, Bing, and PowerPoint.
READ NEXT
-
Harry Maguire Signs Contract Extension At Manchester United

-
Fan Dies After Cardiac Arrest During Eintracht Frankfurt vs Cologne Bundesliga Game

-
Perplexity’s revenue has shot up 50% in one month, amid shift in focus to AI agents: FT

-
Prometheus bound: Why xAI cofounder and former OpenAI hand Kyle Kosic is heading to Jeff Bezos’ AI venture

-
Philippines expands airport monitoring measures for staff and passengers
