OpenAI Compatible Endpoints
Please provide header value as 'Bearer ' and don't forget to add 'Bearer' HTTP Authorization Scheme before the token.
ID of the model to use
An object specifying the format that the model must output. Use json_object for JSON mode or json_schema to enforce a specific schema (Structured Outputs). If omitted, default text output is used.
The sampling temperature to be used
1An alternative to sampling with temperature
-1The number of chat completion choices to generate for each input message
1If set to true, it sends partial message deltas
falseUp to 4 sequences where the API will stop generating further tokens
The maximum number of tokens to generate in the chat. This value is now deprecated in favor of max_completion_tokens completion
16An upper bound for the number of tokens that can be generated for a completion, including visible output tokens
16It is used to penalize new tokens based on their existence in the text so far
0It is used to penalize new tokens based on their frequency in the text so far
0Used to modify the probability of specific tokens appearing in the completion
A unique identifier representing your end-user
Controls which (if any) tool is called by the model.
none means the model will not call any tool and instead generates a message.
auto means the model can pick between generating a message or calling one or more tools.
required means the model must call one or more tools.
Specifying a particular tool via {"type": "function", "function": {"name": "my_function"}} forces the model to call that tool.
none is the default when no tools are present. auto is the default if tools are present.
none means the model will not call any tool and instead generates a message. auto means the model can pick between generating a message or calling one or more tools. required means the model must call one or more tools.
Successful operation
Bad request
Rate limit exceeded. Retry after the number of seconds indicated by the Retry-After header. Limits are contract-specific; check X-RateLimit-Limit and X-RateLimit-Burst in the response headers for the values that apply to your contract.
Server error
Please provide header value as 'Bearer ' and don't forget to add 'Bearer' HTTP Authorization Scheme before the token.
ID of the model to use
The prompt to generate completions from
The sampling temperature to be used
An alternative to sampling with temperature
The number of chat completion choices to generate for each input message
If set to true, it sends partial message deltas
Up to 4 sequences where the API will stop generating further tokens
The maximum number of tokens to generate in the chat completion
It is used to penalize new tokens based on their existence in the text so far
It is used to penalize new tokens based on their frequency in the text so far
Used to modify the probability of specific tokens appearing in the completion
A unique identifier representing your end-user
Successful operation
Bad request
Rate limit exceeded. Retry after the number of seconds indicated by the Retry-After header. Limits are contract-specific; check X-RateLimit-Limit and X-RateLimit-Burst in the response headers for the values that apply to your contract.
Server error
Please provide header value as 'Bearer ' and don't forget to add 'Bearer' HTTP Authorization Scheme before the token.
Successful operation
Rate limit exceeded. Retry after the number of seconds indicated by the Retry-After header. Limits are contract-specific; check X-RateLimit-Limit and X-RateLimit-Burst in the response headers for the values that apply to your contract.
Please provide header value as 'Bearer ' and don't forget to add 'Bearer' HTTP Authorization Scheme before the token.
ID of the model to use. Please check /v1/models for available models
The prompt to generate images from
The number of images to generate. Defaults to 1.
1The size of the image to generate.
Defaults to "1024*1024".
Must be one of "1024*1024", "1792*1024", or "1024*1792".
The maximum supported resolution is "1792*1024"
1024*1024The format of the response.
b64_jsonPossible values: A unique identifier representing your end-user
Successful operation
Bad request
Rate limit exceeded. Retry after the number of seconds indicated by the Retry-After header. Limits are contract-specific; check X-RateLimit-Limit and X-RateLimit-Burst in the response headers for the values that apply to your contract.
Server error
Please provide header value as 'Bearer ' and don't forget to add 'Bearer' HTTP Authorization Scheme before the token.
ID of the model to use. Please check /v1/models for available models
The input text to create an embedding for (single string)
The input text to create embeddings for (list of strings)
Successful operation
Bad request
Rate limit exceeded. Retry after the number of seconds indicated by the Retry-After header. Limits are contract-specific; check X-RateLimit-Limit and X-RateLimit-Burst in the response headers for the values that apply to your contract.
Server error
Last updated
Was this helpful?