Skip to content
/
OpenRouter
© 2026 OpenRouter, Inc

Product

  • Chat
  • Rankings
  • Apps
  • Models
  • Providers
  • Pricing
  • Enterprise
  • Labs

Company

  • About
  • Announcements
  • CareersHiring
  • Privacy
  • Terms of Service
  • Support
  • State of AI
  • Works With OR
  • Data

Developer

  • Documentation
  • API Reference
  • SDK
  • Status

Connect

  • Discord
  • GitHub
  • LinkedIn
  • X
  • YouTube
Favicon for qwen

Qwen: Qwen3 VL 32B Instruct

qwen/qwen3-vl-32b-instruct

ChatCompare

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text comprehension, enabling fine-grained spatial reasoning, document and scene analysis, and long-horizon video understanding.Robust OCR in 32 languages, and enhanced multimodal fusion through Interleaved-MRoPE and DeepStack architectures. Optimized for agentic interaction and visual tool use, Qwen3-VL-32B delivers state-of-the-art performance for complex real-world multimodal tasks.

Modalities

Input Price

35% off

$0.104per 1M

Output Price

35% off

$0.416per 1M

Context

131K

Weekly Tokens

21.8B

Released

Oct 23, 2025

Overview
Playground
Providers
Performance
Pricing
Benchmarks
Apps
Activity
Uptime
API

Performance for Qwen3 VL 32B Instruct

Compare different providers across OpenRouter