Close Menu
Elon Musk Monitor
  • Home
  • Elon Musk
  • AI
  • Cybertruck
    • DOGE & Cryptocurrency
    • Financial & Business
  • Grok
    • Hyperloop & Urban Mobility
    • Innovations & Future Projects
  • Mars Colonization
  • Neuralink
    • Philanthropy & Humanitarian Efforts
    • Public Perception & Cultural Impact
    • SolarCity & Renewable Energy
  • SpaceX
  • Starlink
  • Tesla
    • The Boring Company
  • X

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Bitcoin Whales Pull 4,500 BTC From Binance, Hinting At Incoming Rally

June 18, 2025

Enlight secures financing for Gecama hybridisation – reNews

June 18, 2025

Canva Adds Create a Video Clip Feature Powered by Google’s Veo 3 Video Generation AI Model

June 18, 2025
Facebook X (Twitter) Instagram
Elon Musk Monitor
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • Elon Musk
  • AI
  • Cybertruck
    • DOGE & Cryptocurrency
    • Financial & Business
  • Grok
    • Hyperloop & Urban Mobility
    • Innovations & Future Projects
  • Mars Colonization
  • Neuralink
    • Philanthropy & Humanitarian Efforts
    • Public Perception & Cultural Impact
    • SolarCity & Renewable Energy
  • SpaceX
  • Starlink
  • Tesla
    • The Boring Company
  • X
Elon Musk Monitor
Home » Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released
Grok

Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

elonmuskBy elonmuskApril 7, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Meta introduced the first artificial intelligence (AI) models in the Llama 4 family on Saturday. The Menlo Park-based tech giant released two models — Llama 4 Scout and Llama 4 Maverick — with native multimodal capabilities to the open community. The company says these are the first open models built with Mixture-of-Experts (MoE) architecture. Compared to the predecessor, these come with higher context windows and better power efficiency. Alongside, Meta also previewed Llama 4 Behemoth, the largest AI model in the family unveiled so far.

In a blog post, the tech giant detailed its new AI models. Just like the previous Llama models, the Llama 4 Scout and Llama 4 Maverick are open-source AI models and can be downloaded via its Hugging Face listing or the dedicated Llama website. Starting today, users can also experience the Llama 4 AI models in WhatsApp, Messenger, Instagram Direct, and on the Meta.AI website.

The Llama 4 Scout is a 17 billion active parameter model with 16 experts, whereas the Maverick model comes with 17 billion active parameters and 128 experts. Scout is said to be able to run on a single Nvidia H100 GPU. Additionally, the company claimed that the previewed Llama 4 Behemoth outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on several benchmarks. Meta said the Behemoth model, with 288 billion active parameters and 16 experts, was not released as it is still being trained.

llama 4 moe Llama 4 MoE Architecture

The MoE architecture in Llama 4 AI models
Photo Credit: Meta

 

Coming to the architecture, the Llama 4 models are built on an MoE architecture. The MoE architecture activates only a fraction of the total parameters based on the requirement of the initial prompt, which makes it more compute efficient for training and inference. In the pre-training phase, Meta also used new techniques such as early fusion to integrate text and vision tokens simultaneously, and MetaP to set critical model hyper-parameters and initialisation scales.

For post-training, Meta chose to start the process with lightweight supervised fine-tuning (SFT), followed by online reinforcement learning (RL) and lightweight direct preference optimisation (DPO). The sequence was chosen to not over-constrain the model. The researchers also performed SFT on only 50 percent of the “harder” dataset.

Based on internal testing, the company claimed that the Maverick model outperforms Gemini 2.0 Flash, DeepSeek v3.1, and GPT-4o on the MMMU (image reasoning), ChartQA (image understanding), GPQA Diamond (reasoning and knowledge), and MTOB (long context) benchmarks.

On the other hand, the Scout model is said to outperform Gemma 3, Mistral 3.1, and Gemini 2.0 on the MMMU, ChartQA, MMLU (reasoning and knowledge), GPQA Diamond, and MTOB benchmarks.

Meta has also taken steps to make the AI models safer in both the pre-training and post-training processes. In pre-training, the researchers used data filtering methods to ensure harmful data was not added to its knowledge base. In post-training, the researchers added open-source safety tools such as Llama Guard and Prompt Guard to protect the model from external attacks. Additionally, the researchers have also stress-tested the models internally and have allowed red-teaming of the Llama 4 Scout and Maverick models.

Notably, the models are available to the open community with a permissive Llama 4 licence. It allows both academic and commercial usage of the models, however, Meta no longer allows companies with more than 700 million monthly active users to access its AI models.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
elonmusk
  • Website

Related Posts

Canva Adds Create a Video Clip Feature Powered by Google’s Veo 3 Video Generation AI Model

June 18, 2025

Google’s Gemini 2.5 Pro and Flash AI Models Are Now Generally Available to All Users

June 18, 2025

ChatGPT on WhatsApp Gets Upgraded With Image Generation Capability

June 18, 2025
Leave A Reply Cancel Reply

Don't Miss
Cybertruck

Tesla Cybertruck police truck donor revealed

A batch of Tesla Cybertrucks were recently revealed to be a donation to the Las…

Tesla upgrades its ridiculous Cybertruck wiper after owners report issue

February 27, 2025

Tesla Cybertruck contract with State Dept. may have been modified after Biden admin

February 26, 2025

This Tesla Cybertruck feature helped it earn a ‘Best Tech’ award

February 25, 2025
Top Posts

Bitcoin Whales Pull 4,500 BTC From Binance, Hinting At Incoming Rally

June 18, 2025

Ethereum Price at Risk of Downside Break as Bears Test Key Support

June 18, 2025

XRP Price Slides Under Support Level, Selling Pressure Intensifies

June 18, 2025

$150,000 Bitcoin Is In Play—Unless This One Macro Metric Snaps

June 18, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to Elon Musk Monitor, your go-to source for comprehensive, up-to-date information on the life, work, and innovations of one of the most influential figures in the world today—Elon Musk. Our mission is to keep you informed about Musk’s ventures and projects, ranging from electric vehicles to space exploration, and everything in between. Whether you’re a tech enthusiast, investor, or simply curious about Musk’s impact on the world, we’ve got you covered.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

Bitcoin Whales Pull 4,500 BTC From Binance, Hinting At Incoming Rally

June 18, 2025

Ethereum Price at Risk of Downside Break as Bears Test Key Support

June 18, 2025

XRP Price Slides Under Support Level, Selling Pressure Intensifies

June 18, 2025
Most Popular

How I met my partner on X/Twitter

February 8, 2025

DOGE staffer resigns after racist posts uncovered. Elon Musk might bring him back.

February 9, 2025

OpenAI accuses DeepSeek of stealing data, internet digs into the ‘irony’

February 9, 2025
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 elonmuskmonitor. Designed by elonmuskmonitor.

Type above and press Enter to search. Press Esc to cancel.