Microsoft has unveiled Magma, an innovative AI base model that seamlessly integrates visual and language processing to control software and robotic systems. Unlike previous multimodal AI models that segregated perception and control capabilities, Magma operates as a unified entity that can autonomously create plans based on user goals. Collaboratively developed by teams from prestigious institutions, this project positions itself as a major leap towards agentic AI, which can take initiative beyond simple queries, performing complex tasks across physical and digital realms.
Microsoft’s Magma integrates visual and language processing to create agentic AI that autonomously plans and acts, signaling a significant advancement in multimodal AI capabilities.
Collection
[
|
...
]