Close Menu
Elon Musk Monitor
  • Home
  • Elon Musk
  • AI
  • Cybertruck
    • DOGE & Cryptocurrency
    • Financial & Business
  • Grok
    • Hyperloop & Urban Mobility
    • Innovations & Future Projects
  • Mars Colonization
  • Neuralink
    • Philanthropy & Humanitarian Efforts
    • Public Perception & Cultural Impact
    • SolarCity & Renewable Energy
  • SpaceX
  • Starlink
  • Tesla
    • The Boring Company
  • X

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

xAI investigates Grok’s ‘White Genocide’ glitch

May 16, 2025

Meta Delays Release of Its ‘Behemoth’ AI Model: Report

May 16, 2025

SES to demonstrate ‘satellite orchestration’ tech for military communications

May 16, 2025
Facebook X (Twitter) Instagram
Elon Musk Monitor
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • Elon Musk
  • AI
  • Cybertruck
    • DOGE & Cryptocurrency
    • Financial & Business
  • Grok
    • Hyperloop & Urban Mobility
    • Innovations & Future Projects
  • Mars Colonization
  • Neuralink
    • Philanthropy & Humanitarian Efforts
    • Public Perception & Cultural Impact
    • SolarCity & Renewable Energy
  • SpaceX
  • Starlink
  • Tesla
    • The Boring Company
  • X
Elon Musk Monitor
Home » Microsoft Announces Magma Foundation Model That Can Complete Multimodal Agentic Tasks
Grok

Microsoft Announces Magma Foundation Model That Can Complete Multimodal Agentic Tasks

elonmuskBy elonmuskFebruary 21, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Microsoft researchers announced a new foundation model on Wednesday that can perform agentic functions. Dubbed Magma, the artificial intelligence (AI) model is pre-trained on a large volume of datasets across text, images, videos, as well as spatial formats. The Redmond-based tech giant said that Magma is an extension of vision-language (VL) models and it can not only understand multimodal information but can also plan and act on them. The AI agent-enabled model can be used in a wide range of tasks including computer vision, user interface (UI) navigation, and robot manipulation.

Microsoft Announces Magma Foundation Model

In a GitHub post, Microsoft researchers detailed the new Magma foundation model. Foundation models are distinctive large language models (LLMs), which are built from scratch and are not distilled from any other model. They often become the baseline for other models in the series. Magma is unique in the sense that the AI model is pre-trained on a wide range of datasets.

The researchers stated that the base architecture behind Magma is the Llama 3 AI model. However, Magma is also equipped with the ability to plan and act in the visual-spatial world. This allows the model to not only generate outputs like a chatbot but also execute actions.

It can be used as a computer vision chatbot that can offer information about the world it views when paired with camera sensors. Magma can also be used to control the UI of a device. But more interestingly, it can also control robots to complete complex tasks using agentic capabilities.

The researchers said a major reason behind these capabilities is the diverse dataset along with two technical components — Set-of-Mark and Trace-of-Mark. The former enables action grounding in images, videos and spatial data by having the model predict numeric marks for buttons or robot arms in image space. The latter feeds the model temporal video dynamics and makes it predict the next frames before it takes action. This allows the model to develop a strong spatial understanding.

Microsoft researchers also shared the benchmark scores of the AI model based on internal testing. It has achieved competitive scores across all the agentic evaluation tests, outperforming models by OpenAI, Alibaba, and Google. The company has not released Magma in the public domain as of now.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
elonmusk
  • Website

Related Posts

Meta Delays Release of Its ‘Behemoth’ AI Model: Report

May 16, 2025

Windsurf Releases SWE-1 Series AI Models Capable of Full-Process Software Development

May 16, 2025

Netflix Unveils AI-Powered Feature That Will Blend Ads With Shows and Movies

May 16, 2025
Leave A Reply Cancel Reply

Don't Miss
Cybertruck

Tesla Cybertruck police truck donor revealed

A batch of Tesla Cybertrucks were recently revealed to be a donation to the Las…

Tesla upgrades its ridiculous Cybertruck wiper after owners report issue

February 27, 2025

Tesla Cybertruck contract with State Dept. may have been modified after Biden admin

February 26, 2025

This Tesla Cybertruck feature helped it earn a ‘Best Tech’ award

February 25, 2025
Top Posts

Analyst Explains What It Means

May 16, 2025

This Late-Stage Bitcoin Bull Run Signal Hasn’t Shown Up Yet

May 16, 2025

BNB Price Finds Its Footing — Can Bulls Ignite the Next Leg Up?

May 16, 2025

Ethereum Breaks Above Key Realized Price Zones—What It Means for ETH

May 16, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to Elon Musk Monitor, your go-to source for comprehensive, up-to-date information on the life, work, and innovations of one of the most influential figures in the world today—Elon Musk. Our mission is to keep you informed about Musk’s ventures and projects, ranging from electric vehicles to space exploration, and everything in between. Whether you’re a tech enthusiast, investor, or simply curious about Musk’s impact on the world, we’ve got you covered.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

Analyst Explains What It Means

May 16, 2025

This Late-Stage Bitcoin Bull Run Signal Hasn’t Shown Up Yet

May 16, 2025

BNB Price Finds Its Footing — Can Bulls Ignite the Next Leg Up?

May 16, 2025
Most Popular

How I met my partner on X/Twitter

February 8, 2025

DOGE staffer resigns after racist posts uncovered. Elon Musk might bring him back.

February 9, 2025

OpenAI accuses DeepSeek of stealing data, internet digs into the ‘irony’

February 9, 2025
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 elonmuskmonitor. Designed by elonmuskmonitor.

Type above and press Enter to search. Press Esc to cancel.