Qwen3 VL Reranker 8B
Summary: Qwen3 VL Reranker 8B is a multimodal reranking model by Alibaba's Qwen team that scores the relevance between a query and a set of candidate documents, supporting both text and images. Supporting over 30 languages and a 32,768-token context window, this model excels as a precision refinement step in two-stage retrieval pipelines, making it ideal for applications such as multimodal search, visual document retrieval, and cross-lingual reranking where high-precision relevance scoring is required.
Intelligence
Speed
Sovereignty
Input
Output
![]()
![]()
![]()
![]()
![]()
High
Medium
Medium
Text, Image
Score
Central parameters
Description: Multimodal reranking model by Alibaba's Qwen team, scoring query-document relevance from text and image inputs across 30+ languages.
Model identifier: Qwen/Qwen3-VL-Reranker-8B
IONOS CLOUD AI Model Hub Lifecycle and Alternatives
IONOS Launch
End of Life
Alternative
Successor
May 12, 2026
N/A
Origin
Provider
Country
License
Flavor
Release
Technology
Input Length
Parameters
Tensor Type
Multilingual
Further details
Image tokenisation: Images are tokenised at 1 token per 32×32 pixel block. Images larger than 1,310,720 pixels are downscaled proportionally before tokenisation. The token cost for a given image is:
For example, a 1296×1936 px image (2,507,616 px) is downscaled to fit within 1,310,720 px, resulting in approximately 1,247 tokens. With a 32,768-token context window and ~200 tokens of query and prompt overhead, a single document can contain approximately 25 images of that resolution.
Modalities
Text
Image
Audio
Input and output
Input
Not supported
Endpoints
Chat Completions
Rerank
Not supported
v1/rerank
Features
Streaming
Reasoning
Tool calling
Not supported
Not supported
Not supported
Usage example
Rerank
The following example demonstrates how to rerank a set of documents by relevance to a query using Qwen3 VL Reranker 8B.
API Endpoint: POST https://openai.inference.de-txl.ionos.com/v1/rerank
Request:
Response:
Multimodal rerank
The following example demonstrates how to rerank image-only documents using Qwen3 VL Reranker 8B. The first document is a hosted image URL; the second is a base64-encoded image.
API Endpoint: POST https://openai.inference.de-txl.ionos.com/v1/rerank
Request:
Response:
Rate limits
Rate limits ensure fair usage and reliable access to the AI Model Hub. In addition to the contract-wide rate limits, no model-specific limits apply.
Last updated
Was this helpful?