Microsoft is increasing its AI footprint with the discharge of two new fashions that its groups skilled utterly in-house. MAI-Voice-1 is the tech main’s first pure speech era mannequin, whereas MAI-1-preview is text-based and is the corporate’s first basis mannequin skilled end-to-end. MAI-Voice-1 is at present getting used within the Copilot Each day and Podcast options. Microsoft has made MAI-1-preview accessible for public assessments on LMArena, and can start previewing it in choose Copilot conditions within the coming weeks.
In an interview with Semafor, Microsoft AI division chief Mustafa Suleyman stated the pair of fashions was developed with a concentrate on effectivity and cost-effectiveness. MAI-Voice-1 runs on a single GPU and MAI-1-preview was skilled on about 15,000 Nvidia H-100 GPUs. For context, different fashions, corresponding to xAI’s Grok, took greater than 100,000 of these chips for coaching. “More and more, the artwork and craft of coaching fashions is choosing the right information and never losing any of your flops on pointless tokens that didn’t really train your mannequin very a lot,” Suleyman stated.
Though it’s getting used to check the in-house fashions, Microsoft Copilot is primarily constructed on OpenAI’s GPT tech. The choice to construct its personal fashions, regardless of having sunk billion-dollar investments within the newer AI firm, signifies that Microsoft needs to be an unbiased competitor on this house. Whereas that would take time to achieve parity with the businesses which have emerged as forerunners in AI growth, Suleyman informed Semafor that Microsoft has “an infinite five-year roadmap that we’re investing in quarter after quarter.” With some issues arising that AI might be going through a bubble-pop, Microsoft’s timeline will have to be aggressive to make sure taking the unbiased path is worth it.