Close Menu
Elon Musk Monitor
  • Home
  • Elon Musk
  • AI
  • Cybertruck
    • DOGE & Cryptocurrency
    • Financial & Business
  • Grok
    • Hyperloop & Urban Mobility
    • Innovations & Future Projects
  • Mars Colonization
  • Neuralink
    • Philanthropy & Humanitarian Efforts
    • Public Perception & Cultural Impact
    • SolarCity & Renewable Energy
  • SpaceX
  • Starlink
  • Tesla
    • The Boring Company
  • X

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

XRP Must Complete Right Shoulder Before Takeoff: How Low?

June 17, 2025

Rising Bitcoin Dominance Above 64% Dashes Hopes Of Altcoin Season, Here’s Why

June 17, 2025

Streaming surpasses combined broadcast, cable TV viewing for first time

June 17, 2025
Facebook X (Twitter) Instagram
Elon Musk Monitor
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • Elon Musk
  • AI
  • Cybertruck
    • DOGE & Cryptocurrency
    • Financial & Business
  • Grok
    • Hyperloop & Urban Mobility
    • Innovations & Future Projects
  • Mars Colonization
  • Neuralink
    • Philanthropy & Humanitarian Efforts
    • Public Perception & Cultural Impact
    • SolarCity & Renewable Energy
  • SpaceX
  • Starlink
  • Tesla
    • The Boring Company
  • X
Elon Musk Monitor
Home » Alibaba Qwen 2.5 Vision Language Model Released in a Smaller Size, Packs Agentic Capabilities
Grok

Alibaba Qwen 2.5 Vision Language Model Released in a Smaller Size, Packs Agentic Capabilities

elonmuskBy elonmuskMarch 26, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Alibaba’s Qwen team released another artificial intelligence (AI) model to the Qwen 2.5 family on Monday. Dubbed Qwen 2.5-VL-32B Instruct, the AI model comes with improved performance and optimisations. It is a vision language model with 32 billion parameters, and joins the three billion, seven billion, and 72 billion parameter size models in the Qwen 2.5 family. Just like all previous models by the team, it is also an open-source AI model available under a permissive license.

Alibaba Releases Qwen 2.5-VL-32B AI Model

In a blog post, the Qwen team detailed the company’s latest vision language model (VLM). It is more capable than the Qwen 2.5 3B and 7B models, and smaller than the foundation 72B model. The large language model’s (LLM) older versions outperformed DeepSeek-V3, and the 32B model is said to be outperforming Google and Mistral’s similar sized systems.

Coming to its features, the Qwen 2.5-VL-32B-Instruct has an adjusted output style that provides more detailed and better-formatted responses. The researchers claimed that the responses are closely aligned with human preferences. Mathematical reasoning capability has also been improved, and the AI model can solve more complex problems.

The accuracy of image understanding capability and reasoning-focused analysis, including image parsing, content recognition, and visual logic deduction, has also been improved.

qwen25vl benchmark Qwen 2 5 VL 32B Instruct

Qwen 2.5-VL-32B-Instruct
Photo Credit: Qwen

 

Based on internal testing, the Qwen 2.5-VL-32B is claimed to have surpassed the capabilities of comparable models, such as Mistral-Small-3.1-24B and Google’s Gemma-3-27B, on the MMMU, MMMU-Pro, and MathVista benchmarks. Interestingly, the LLM was also claimed to have outperformed the much larger Qwen 2-VL-72B model on the MM-MT-Bench.

The Qwen team highlights that the latest model can directly play as a visual agent that can reason and direct tools. It is inherently capable of computer use and phone use. It accepts text, images, and videos with more than one hour of duration as input. It also supports JSON and structured outputs.

The baseline architecture and training remain the same as the older Qwen 2.5 models, however, the researchers implemented a dynamic fps sampling to enable the model to comprehend videos at varying sampling rates. Another enhancement also lets it pinpoint specific moments in a video by gaining an understanding of temporal sequence and speed.

Qwen 2.5-VL-32B-Instruct is available to download on GitHub and its Hugging Face listing. The model comes with Apache 2.0 licence, which allows both academic and commercial usage.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
elonmusk
  • Website

Related Posts

Google Unveils India-Focused Safety Charter, Shares How It Is Using AI to Combat Online Frauds and Scams

June 17, 2025

Reddit Unveils Reddit Community Intelligence, Its Suite of AI-Powered Ad Tools for Enterprises

June 17, 2025

OpenAI Improves Web Search Tool in ChatGPT, Can Now Handle More Complex Queries

June 17, 2025
Leave A Reply Cancel Reply

Don't Miss
Cybertruck

Tesla Cybertruck police truck donor revealed

A batch of Tesla Cybertrucks were recently revealed to be a donation to the Las…

Tesla upgrades its ridiculous Cybertruck wiper after owners report issue

February 27, 2025

Tesla Cybertruck contract with State Dept. may have been modified after Biden admin

February 26, 2025

This Tesla Cybertruck feature helped it earn a ‘Best Tech’ award

February 25, 2025
Top Posts

XRP Must Complete Right Shoulder Before Takeoff: How Low?

June 17, 2025

Rising Bitcoin Dominance Above 64% Dashes Hopes Of Altcoin Season, Here’s Why

June 17, 2025

Bear Signal Lingers On Dogecoin—Here’s Why That’s Bullish

June 17, 2025

Ethereum’s $4K Target Within Reach, Here’s What Needs to Happen First

June 17, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to Elon Musk Monitor, your go-to source for comprehensive, up-to-date information on the life, work, and innovations of one of the most influential figures in the world today—Elon Musk. Our mission is to keep you informed about Musk’s ventures and projects, ranging from electric vehicles to space exploration, and everything in between. Whether you’re a tech enthusiast, investor, or simply curious about Musk’s impact on the world, we’ve got you covered.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

XRP Must Complete Right Shoulder Before Takeoff: How Low?

June 17, 2025

Rising Bitcoin Dominance Above 64% Dashes Hopes Of Altcoin Season, Here’s Why

June 17, 2025

Bear Signal Lingers On Dogecoin—Here’s Why That’s Bullish

June 17, 2025
Most Popular

How I met my partner on X/Twitter

February 8, 2025

DOGE staffer resigns after racist posts uncovered. Elon Musk might bring him back.

February 9, 2025

OpenAI accuses DeepSeek of stealing data, internet digs into the ‘irony’

February 9, 2025
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 elonmuskmonitor. Designed by elonmuskmonitor.

Type above and press Enter to search. Press Esc to cancel.