Why Microsoft AI's approach is right time, right place

Published June 3, 2026

Microsoft AI launched seven in-house foundation models at Microsoft Build 2026, but comparing benchmarks and the freedom the company has now that it's out of its OpenAI contract is the easy storyline. Microsoft is playing catch-up in foundational models, but the bigger story is that the company has the right approach at the right time.

At Build, Microsoft AI CEO Mustafa Suleyman took the wraps off the company's new in-house models.

MAI-Image-2.5 / 2.5-Flash (image generation & editing)
MAI-Transcribe-1.5 (speech‑to‑text)
MAI-Voice-2 / Voice-2-Flash (speech generation)
MAI-Thinking-1 (35B‑param reasoning model)
MAI-Code-1-Flash (5B coding model)

"These models are all built with real attention to detail and a commitment to making very practical and efficient tools that are tuned to just how you work in the real world," said Suleyman.

Microsoft CEO Satya Nadella emphasized real world use case and co-designing models that can adapt to "hill climb" and learn your enterprise from the bottom up because they are trained on verified and licensed data. Nadella said:

"We believe that time has come for every company to move from consuming a frontier model to fully participating in the frontier ecosystem. You can have your own private evals and outcomes."

Why does this Microsoft approach to models work? The Microsoft AI models and message land just as enterprises are freaking out over token costs, just as they are worried about being locked in with Anthropic or OpenAI and just as they're still hunting for returns and business outcomes. The free spending on AI is going to end in 3, 2, 1.

Microsoft's job in the frontier equation is to provide the scaffolding for models that you can differentiate with IP that you own and control with efficiency, said Nadella, who noted that token costs and performance is critical. Microsoft announced a partnership with the Mayo Clinic to highlight the co-designed model approach. Microsoft also rolled out a private tuning service.

The introduction of Frontier Tuning from Microsoft means that MAI models can adapt to workflows and your data. Microsoft said Frontier Tuning has proven that custom models are better and more efficient.

Here are the themes from Microsoft AI's approach from the Build 2026 keynote and blogs.

Microsoft is focusing in-house models on the edge as well as cloud. Nadella outlined Microsoft's small models that whip with Windows that run on device. Aion-1.0-Instruct (reasoning SLM) and Aion 1.0 Plan (planning model) to form a “full local agentic loop without having to round trip to the cloud," said Nadella.

These models will become more critical as edge inference becomes the norm. It doesn't hurt that Microsoft can preinstall those small models across the Windows ecosystem.

Efficiency matters. Microsoft AI are tuned for efficiency on Maia 200, Microsoft's in-house AI accelerator. Suleyman said: "We have been carefully co‑designing our models with our own silicon. We've optimized MAI Thinking One on our Maia 200 chip and benchmarked it. On top of the 30% performance improvement, we're now seeing a further 1.4x performance per watt gain when we run our MAI models on the Maia 200 end to end."

Microsoft's in-house foundation models can run on various use cases and deliver internal ROI running across the company's products. Microsoft's models are doing local inference across multiple applications. Microsoft's approach to its foundational models may also prove to be useful to enterprises looking to tailor them to individual use case. "We now have a roster of seven new world-class models to keep everybody working at the absolute frontier, and we're really looking forward to everybody being able to co-create your own unique agents adapted to you that you'll control. I really feel like this is a new era in AI, an era of AI that you control on your terms," said Suleyman.

Larry Dignan

Editor in Chief of Constellation Insights
Constellation Research

Larry Dignan is Editor in Chief of Constellation Insights at Constellation Research, where he leads editorial coverage focused on enterprise technology, digital transformation, and emerging trends shaping the future of business. He oversees research-driven news, analysis, interviews, and event coverage designed to help technology buyers and vendors navigate complex markets with clarity and context. ...

Insights News

July 14, 2026

Startups to know: Baz brings governance, planning, specs to AI-generated code

Future of Work

Baz, an agentic coding and AI code review platform, is aiming to move quality upstream in the AI software development lifecycle. Here's what you need to know....

Larry Dignan

Insights News

July 13, 2026

Rightsizing open models may cut your AI inference spend

Data to Decisions

Salesforce said it has cut its AI inference bills by right sizing models. The general idea: Tune specific open source models to complete tasks instead of relying on pricey models. ...

Larry Dignan

Insights News

July 12, 2026

General Mills bets on AI, supply chain redesign to drive $3 billion in savings

Data to Decisions

General Mills is planning to save $3 billion over the next four years via technology modernization and redesign its supply chain to offset inflation, fund growth investments and gr...

Larry Dignan

Insights News

July 10, 2026

SAP eases ERP maintenance and support rules for on-prem customers

Data to Decisions

SAP said it will loosen maintenance and support rules for on-premises ERP deployments in a move that ends a European Commission investigation. The European Commission opened an inv...

Larry Dignan

Insights News

July 09, 2026

OpenAI launches ChatGPT Work, rolls out GPT-5.6 model family

Future of Work

OpenAI launched ChatGPT Work, which embeds Codex in ChatGPT, to get a wide range of work done. The company also rolled out general availability of its GPT-5.6 models. ...

Larry Dignan

Insights News

July 09, 2026

SpaceXAI, Meta puts pricing squeeze on Anthropic, OpenAI

Data to Decisions

OpenAI and Anthropic were already set up to be squeezed by open source large language models (LLM) from China and the US, but now SpaceXAI and Meta are getting in on the fun. ...

Larry Dignan

Published

June 03, 2026

Author

Larry Dignan

Why Microsoft AI's approach is right time, right place

Startups to know: Baz brings governance, planning, specs to AI-generated code

Rightsizing open models may cut your AI inference spend

General Mills bets on AI, supply chain redesign to drive $3 billion in savings

SAP eases ERP maintenance and support rules for on-prem customers

OpenAI launches ChatGPT Work, rolls out GPT-5.6 model family

SpaceXAI, Meta puts pricing squeeze on Anthropic, OpenAI

Published

Author

Research

Analyst Services

Videos

Communities

Events

Insights Live

Why Microsoft AI's approach is right time, right place

Results

Startups to know: Baz brings governance, planning, specs to AI-generated code

Rightsizing open models may cut your AI inference spend

General Mills bets on AI, supply chain redesign to drive $3 billion in savings

SAP eases ERP maintenance and support rules for on-prem customers

OpenAI launches ChatGPT Work, rolls out GPT-5.6 model family

SpaceXAI, Meta puts pricing squeeze on Anthropic, OpenAI

Published

Author

Related Blog Posts