Anthropic wants to run Claude models on Microsoft's Maia chip
Briefly

Anthropic wants to run Claude models on Microsoft's Maia chip
Anthropic is in early-stage talks with Microsoft about using Azure Maia servers for AI inference. Maia 200 is designed to run inference efficiently, focusing on executing existing AI models rather than training new ones. Microsoft is a major Anthropic customer, with $30 billion committed to Azure spending, and it has already achieved cost savings using Maia for Copilot tools that support both OpenAI and Anthropic models. Microsoft and Anthropic also have a broader investment relationship, including Microsoft’s planned up to $5 billion investment and a partnership with Nvidia involving up to $15 billion. Additional Nvidia capacity and new server clusters are being allocated to Anthropic, alongside Copilot-related Claude model commitments.
"Anthropic is in talks with Microsoft about using servers with Azure Maia, Microsoft's proprietary AI chip. According to The Information, the negotiations are still in the early stages. Maia 200 is designed to run AI inference as efficiently as possible. Microsoft is one of Anthropic's largest customers, having committed $30 billion in Azure spending. The chip in question is the Maia 200, which Microsoft developed specifically for inference. In other words: running existing AI models, not training new ones."
"Microsoft has already realized cost savings with this chip through Copilot tools, which support both OpenAI and Anthropic models. We recently discussed this AI processor with Andrew Wall, General Manager of Azure Maia at Microsoft. Strong investment relationship A substantial financial relationship already exists between Microsoft and Anthropic. Late last year, Microsoft announced plans to invest up to $5 billion in Anthropic."
"In November, a strategic partnership with Nvidia was also announced, under which Microsoft and Nvidia will jointly invest up to $15 billion in Anthropic. In turn, Anthropic has committed to $30 billion in Azure spending. Part of that collaboration involves the use of Claude models for Microsoft Copilot, with a value of at least $500 million for that deal. Microsoft has also allocated additional existing Nvidia server capacity to Anthropic and is building new server clusters specifically for the AI company."
"Following the example of Google's TPUs and Amazon's Trainium, Microsoft aims to build its own chip ecosystem. Maia is currently focusing on inference and reducing the costs of running models. It does not yet have an AI model that competes with the strongest LLMs, but Microsoft AI (MAI) would like to achieve this eventually. That team was established in March 2024."
Read at Techzine Global
Unable to calculate read time
[
|
]