Qwen3 VL Reranker 8B

Summary: Qwen3 VL Reranker 8B is a multimodal reranking model by Alibaba's Qwen team that scores the relevance between a query and a set of candidate documents, supporting both text and images. Supporting over 30 languages and a 32,768-token context window, this model excels as a precision refinement step in two-stage retrieval pipelines, making it ideal for applications such as multimodal search, visual document retrieval, and cross-lingual reranking where high-precision relevance scoring is required.

Intelligence

Speed

Sovereignty

Input

Output

High

Medium

Medium

Text, Image

Score

Central parameters

Description: Multimodal reranking model by Alibaba's Qwen team, scoring query-document relevance from text and image inputs across 30+ languages.

Model identifier: Qwen/Qwen3-VL-Reranker-8B

IONOS CLOUD AI Model Hub Lifecycle and Alternatives

IONOS Launch

End of Life

Alternative

Successor

May 12, 2026

N/A

Origin

Provider

Country

License

Flavor

Release

Community

-

January 8, 2026

Technology

Input Length

Parameters

Tensor Type

Multilingual

Further details

32768

8B

bfloat16

Yes

Image tokenisation: Images are tokenised at 1 token per 32×32 pixel block. Images larger than 1,310,720 pixels are downscaled proportionally before tokenisation. The token cost for a given image is:

For example, a 1296×1936 px image (2,507,616 px) is downscaled to fit within 1,310,720 px, resulting in approximately 1,247 tokens. With a 32,768-token context window and ~200 tokens of query and prompt overhead, a single document can contain approximately 25 images of that resolution.

Modalities

Text

Image

Audio

Input and output

Input

Not supported

Endpoints

Chat Completions

Rerank

Not supported

v1/rerank

Features

Streaming

Reasoning

Tool calling

Not supported

Not supported

Not supported

Usage example

Rerank

The following example demonstrates how to rerank a set of documents by relevance to a query using Qwen3 VL Reranker 8B.

API Endpoint: POST https://openai.inference.de-txl.ionos.com/v1/rerank

Request:

Response:

Multimodal rerank

The following example demonstrates how to rerank image-only documents using Qwen3 VL Reranker 8B. The first document is a hosted image URL; the second is a base64-encoded image.

API Endpoint: POST https://openai.inference.de-txl.ionos.com/v1/rerank

Request:

Response:

Rate limits

Rate limits ensure fair usage and reliable access to the AI Model Hub. In addition to the contract-wide rate limits, no model-specific limits apply.

Last updated

Was this helpful?