Generate chat completions using Qwen ILM model with image inputs.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| model | string | Yes | - | The model to use (e.g., qwen:qwen3-vl-plus) |
| messages | array | Yes | - | Array of message objects. Each message contains: - role: Role type, values: system, user- content: Message content, can be a string or array. Array items can contain:- image: Image URL or base64 encoded image- text: Text content |
| temperature | number | No | 0.7 | Controls randomness: 0.0-2.0, higher = more random |
| max_tokens | integer | No | 1024 | Maximum number of tokens to generate |
| top_p | number | No | 0.9 | Nucleus sampling: 0.0-1.0, consider tokens with top_p probability mass |
| frequency_penalty | number | No | 0.0 | Reduces repetition of frequent tokens: -2.0 to 2.0 |
| presence_penalty | number | No | 0.0 | Increases likelihood of new topics: -2.0 to 2.0 |
| n | integer | No | 1 | Number of completions to generate |
| stream | boolean | No | false | Whether to stream the response |
| stop | string | array | null | No | null | Stop sequences. Can be a string, array of strings, or null |
| extra_body | object | No | - | Additional body parameters. Contains: - metadata: Metadata object- enable_thinking: Boolean to enable thinking mode- thinking_budget: Integer value for thinking budget- response_format: Response format configuration- type: Format type (json_schema)- json_schema: JSON schema object- name: Schema name- schema: JSON schema definition |
| Parameter | Type | Description |
|---|---|---|
| id | string | Unique identifier for the completion |
| object | string | Object type, always “completion” |
| model | string | The model used for the completion |
| created_at | integer | Unix timestamp of when the completion was created |
| status | string | Status of the completion (e.g., “completed”) |
| choices | array | Array of completion choices |
| choices[].text | string | Text content of the completion |
| choices[].index | integer | Index of the choice in the choices array |
| usage | object | Token usage information |
| usage.input_tokens | integer | Number of input tokens used |
| usage.output_tokens | integer | Number of output tokens generated |
| usage.total_tokens | integer | Total number of tokens used |
| meta | object | Metadata about the completion |
| meta.provider | string | Provider name (e.g., “qwen”) |
| meta.provider_model | string | Provider-specific model name |
The model to use (e.g., qwen:qwen3-vl-plus)
"qwen:qwen3-vl-plus"
Array of message objects
Controls randomness: 0.0-2.0, higher = more random
0 <= x <= 2Maximum number of tokens to generate
Nucleus sampling: 0.0-1.0
0 <= x <= 1Reduces repetition of frequent tokens: -2.0 to 2.0
-2 <= x <= 2Increases likelihood of new topics: -2.0 to 2.0
-2 <= x <= 2Number of completions to generate
Whether to stream the response
Stop sequences
Chat completion response with structured output
Unique identifier for the completion
"6612267a-d08f-9ea0-a254-7bcea6339f49"
Object type, always 'completion'
"completion"
The model used for the completion
"qwen3-vl-plus"
Unix timestamp of when the completion was created
1767094289
Status of the completion
"completed"