Create a response

Creates a streaming or non-streaming response using the OpenAI Responses API format. Supports text, images, files, audio, video, function calling, web search, file search, code interpreter, reasoning, and more.

Authentication

Authorization

string

required

Bearer token. Use your API key as the bearer token in the Authorization header.Format: Bearer <SUNRA_KEY>

Request

This endpoint expects an object.

model

string

required

Model ID used to generate the response. Browse available models at sunra.ai/models.

input

string | object[]

Input for the response request. Can be a string or an array of input items including messages, function calls, function call outputs, reasoning items, and output messages.

Show input item types

role

string

required

The role of the message author. Supported values: user, assistant, system, developer.

content

string | object[]

The content of the message. Can be a string or an array of content parts.

Show content part types

InputText
InputImage
InputFile
InputAudio
InputVideo

type

string

required

Value: input_text.

text

string

required

The text content.

type

string

required

Value: input_image.

image_url

string | null

The URL of the image, or a base64-encoded data URI.

detail

string

required

Image detail level. Supported values: auto, high, low.

type

string

required

Value: input_file.

file_id

string | null

The ID of a previously uploaded file.

file_data

string

Base64-encoded file data.

filename

string

The name of the file.

file_url

string

The URL of the file.

type

string

required

Value: input_audio.

input_audio

object

required

Audio input data.

Show properties

data

string

required

Base64-encoded audio data.

format

string

required

Audio format. Supported values: mp3, wav.

type

string

required

Value: input_video.

video_url

string

required

A base64 data URL or remote URL that resolves to a video file.

type

string

Value: message. Optional for easy input messages.

phase

string

The phase of an assistant message. Supported values: commentary, final_answer. For follow-up requests, preserve and resend phase on all assistant messages.

string

required

The ID of the message item.

type

string

Value: message.

role

string

required

The role. Supported values: user, system, developer.

content

object[] | null

Array of content parts (input_text, input_image, input_file, input_audio, input_video).

string

required

The ID of the output message.

type

string

required

Value: message.

role

string

required

Value: assistant.

status

string

Status of the message. Supported values: completed, incomplete, in_progress.

content

string | object[]

required

Array of content items (output_text, refusal).

phase

string

The phase. Supported values: commentary, final_answer.

type

string

required

Value: function_call.

string

required

The unique ID of the function call item.

call_id

string

required

The call ID to match with the function call output.

name

string

required

The name of the function.

arguments

string

required

The arguments in JSON string format.

status

string

Status. Supported values: in_progress, completed, incomplete.

type

string

required

Value: function_call_output.

call_id

string

required

The call ID of the function call being responded to.

output

string | object[]

required

The output of the function call. Can be a string or array of content parts (input_text, input_image, input_file).

string | null

Optional ID for the output item.

status

string

Status. Supported values: in_progress, completed, incomplete.

type

string

required

Value: reasoning.

string

required

The unique ID of the reasoning item.

summary

object[]

Array of reasoning summary text items, each with type: "summary_text" and text.

content

object[] | null

Array of reasoning text content items, each with type: "reasoning_text" and text.

encrypted_content

string | null

Encrypted reasoning content for models that support it.

status

string

Status. Supported values: completed, incomplete, in_progress.

signature

string | null

A signature for the reasoning content, used for verification.

instructions

string | null

Inserts a system (or developer) message as the first item in the model’s context. When used with input, the instructions are inserted at the start of the input.

stream

boolean

default:false

If set to true, the response will be streamed using server-sent events (SSE).

max_output_tokens

number | null

An upper bound for the number of output tokens, including visible output tokens and reasoning tokens.

temperature

number | null

Sampling temperature between 0 and 2. Higher values increase randomness.

top_p

number | null

Nucleus sampling parameter. An alternative to sampling with temperature.

top_k

number

Sample only from the top K options for each subsequent token. Used to remove “long tail” low-probability responses.

frequency_penalty

number | null

Number between -2.0 and 2.0. Positive values penalize new tokens based on their existing frequency in the text.

presence_penalty

number | null

Number between -2.0 and 2.0. Positive values penalize new tokens based on whether they already appear in the text.

top_logprobs

integer | null

An integer specifying the number of most likely tokens to return at each token position.

max_tool_calls

integer | null

Maximum number of tool calls the model can make in a single response.

tools

object[]

An array of tools the model may call.

Show tool types

type

string

required

Value: function.

name

string

required

The name of the function.

description

string | null

A description of the function.

parameters

object | null

A JSON Schema object defining the function parameters.

strict

boolean | null

Whether strict schema adherence is enabled.

type

string

required

Value: web_search_preview or web_search_preview_2025_03_11.

search_context_size

string

Size of the search context. Supported values: low, medium, high.

user_location

object

User location information for search personalization.

Show properties

type

string

required

Value: approximate.

city

string | null

City name.

country

string | null

Country name.

region

string | null

Region/state name.

timezone

string | null

IANA timezone name.

type

string

required

Value: web_search or web_search_2025_08_26.

search_context_size

string

Size of the search context. Supported values: low, medium, high.

filters

object | null

Domain filters for search results.

Show properties

allowed_domains

string[] | null

List of allowed domains to restrict search results to.

user_location

object

User location information for search personalization.

type

string

required

Value: file_search.

vector_store_ids

string[]

required

IDs of vector stores to search.

filters

object

Filters for file search. Can be a comparison filter (eq, ne, gt, gte, lt, lte) or a compound filter (and, or).

max_num_results

integer

Maximum number of results to return.

ranking_options

object

Ranking options for search results.

Show properties

ranker

string

Ranker to use. Supported values: auto, default-2024-11-15.

score_threshold

number

Minimum score threshold for results.

type

string

required

Value: computer_use_preview.

display_height

number

required

Display height in pixels.

display_width

number

required

Display width in pixels.

environment

string

required

The environment. Supported values: windows, mac, linux, ubuntu, browser.

type

string

required

Value: code_interpreter.

container

string | object

required

Container configuration. Can be a container ID string or an object.

Show properties (when object)

type

string

required

Value: auto.

file_ids

string[]

File IDs to make available in the container.

memory_limit

string | null

Memory limit. Supported values: 1g, 4g, 16g, 64g.

type

string

required

Value: mcp.

server_label

string

required

A label for the MCP server.

server_url

string

The URL of the MCP server.

allowed_tools

string[] | object

Tools the model is allowed to use from this server.

require_approval

string | object

Approval requirements for tool calls. String values: always, never. Can also be an object with never and always lists.

headers

object | null

Custom headers to include in requests to the MCP server.

server_description

string

Description of the MCP server.

type

string

required

Value: image_generation.

background

string

Background type. Supported values: transparent, opaque, auto.

model

string

Model to use. Supported values: gpt-image-1, gpt-image-1-mini.

quality

string

Image quality. Supported values: low, medium, high, auto.

size

string

Image size. Supported values: 1024x1024, 1024x1536, 1536x1024, auto.

output_format

string

Output format. Supported values: png, webp, jpeg.

moderation

string

Moderation level. Supported values: auto, low.

output_compression

number

Compression level for output.

partial_images

number

Number of partial images to return during generation.

tool_choice

string | object

Controls tool selection behavior. String values: none, auto, required. Can also specify a particular function or tool type.

Show object variants

function
web_search_preview

type

string

required

Value: function.

name

string

required

The name of the function to use.

type

string

required

Value: web_search_preview or web_search_preview_2025_03_11.

parallel_tool_calls

boolean | null

Whether to allow the model to run tool calls in parallel.

text

object

Configuration for text response format.

Show properties

format

object

The text format configuration.

Show format types

text
json_object
json_schema

type

string

required

Value: text.

type

string

required

Value: json_object.

type

string

required

Value: json_schema.

name

string

required

The name of the response format.

schema

object

required

The JSON schema definition.

description

string

Description of the schema.

strict

boolean | null

Whether strict schema adherence is enabled.

verbosity

string | null

Controls the verbosity of the text output. Supported values: high, medium, low.

reasoning

object

Configuration for reasoning output.

Show properties

effort

string

Constrains effort on reasoning. Supported values: xhigh, high, medium, low, minimal, none.

summary

string

Controls reasoning summary verbosity. Supported values: auto, concise, detailed.

max_tokens

number | null

Maximum number of tokens for reasoning.

enabled

boolean | null

Whether reasoning is enabled.

modalities

string[]

Output modalities for the response. Supported values: text, image.

previous_response_id

string | null

The ID of a previous response to use as context for this request.

include

string[]

Additional fields to include in the response. Supported values: file_search_call.results, message.input_image.image_url, computer_call_output.output.image_url, reasoning.encrypted_content, code_interpreter_call.outputs.

store

boolean

Whether to store the generated response for later retrieval.

service_tier

string

The service tier to use for this request. Supported values: auto.

truncation

string

Truncation strategy. Supported values: auto, disabled.

background

boolean | null

Whether to run the request in the background.

metadata

object

Set of key-value pairs that can be attached to the response. Keys must be ≤64 characters. Values must be ≤512 characters. Maximum 16 pairs allowed.

user

string

A unique identifier representing your end-user. Maximum of 128 characters.

Response

Successful response object.

string

Unique response identifier.

object

string

The object type. Always response.

created_at

number

Unix timestamp (in seconds) of when the response was created.

completed_at

number | null

Unix timestamp (in seconds) of when the response completed.

status

string

The status of the response. Possible values: completed, incomplete, in_progress, failed, cancelled, queued.

model

string

The model used for generating the response.

output

object[]

An array of output items generated by the model.

Show output item types

type

string

Value: message.

string

The unique ID of the output message.

role

string

Always assistant.

status

string

Status of the message. Possible values: completed, incomplete, in_progress.

content

object[]

The content of the output message.

Show content types

OutputText
Refusal

type

string

Value: output_text.

text

string

The generated text content.

annotations

object[]

Annotations for the content. Types include:

file_citation: {type, file_id, filename, index}
url_citation: {type, url, title, start_index, end_index}
file_path: {type, file_id, index}

logprobs

object[]

Log probability information for output tokens. Each item contains token, bytes, logprob, and top_logprobs.

type

string

Value: refusal.

refusal

string

The refusal message.

phase

string

The phase of the message. Possible values: commentary, final_answer.

type

string

Value: reasoning.

string

The unique ID of the reasoning item.

content

object[] | null

Array of reasoning text items, each with type: "reasoning_text" and text.

summary

object[]

Array of reasoning summary items, each with type: "summary_text" and text.

encrypted_content

string | null

Encrypted reasoning content.

status

string

Status. Possible values: completed, incomplete, in_progress.

signature

string | null

A signature for the reasoning content, used for verification.

format

string | null

The format of the reasoning content. Possible values: unknown, openai-responses-v1, azure-openai-responses-v1, xai-responses-v1, anthropic-claude-v1, google-gemini-v1.

type

string

Value: function_call.

string

The unique ID of the function call.

name

string

The name of the function called.

arguments

string

The arguments in JSON string format.

call_id

string

The call ID for matching with function call output.

status

string

Status. Possible values: completed, incomplete, in_progress.

type

string

Value: web_search_call.

string

The unique ID of the web search call.

action

object

The search action. Types include:

search: {type, query, queries?, sources?}
open_page: {type, url}
find_in_page: {type, pattern, url}

status

string

Status. Possible values: completed, searching, in_progress, failed.

type

string

Value: file_search_call.

string

The unique ID of the file search call.

queries

string[]

The search queries used.

status

string

Status. Possible values: completed, searching, in_progress, failed.

type

string

Value: image_generation_call.

string

The unique ID of the image generation call.

result

string | null

The generated image data (base64).

status

string

Status. Possible values: in_progress, completed, generating, failed.

output_text

string

Convenience field containing the concatenated text output from all output messages.

incomplete_details

object | null

Details about why the response is incomplete, if applicable.

Show properties

reason

string

The reason. Possible values: max_output_tokens, content_filter.

error

object | null

An error object if the generation failed.

Show properties

code

string

Error code. Possible values: server_error, rate_limit_exceeded, invalid_prompt, vector_store_timeout, invalid_image, invalid_image_format, invalid_base64_image, invalid_image_url, image_too_large, image_too_small, image_parse_error, image_content_policy_violation, invalid_image_mode, image_file_too_large, unsupported_image_media_type, empty_image_file, failed_to_download_image, image_file_not_found.

message

string

Human-readable error message.

usage

object

Token usage statistics for the response.

Show properties

input_tokens

number

The number of input tokens.

output_tokens

number

The number of output tokens.

total_tokens

number

The total number of tokens.

input_tokens_details

object

Breakdown of input tokens.

Show properties

cached_tokens

number

The number of cached tokens.

output_tokens_details

object

Breakdown of output tokens.

Show properties

reasoning_tokens

number

The number of reasoning tokens.

temperature

number | null

The sampling temperature used.

top_p

number | null

The nucleus sampling value used.

max_output_tokens

number | null

The max output tokens setting used.

top_logprobs

number

The top logprobs setting used.

max_tool_calls

number | null

The max tool calls setting used.

presence_penalty

number | null

The presence penalty used.

frequency_penalty

number | null

The frequency penalty used.

instructions

string | object[] | null

The instructions/system message used.

metadata

object

The metadata attached to the response.

tools

object[]

The tools configuration used.

tool_choice

string | object

The tool choice configuration used.

parallel_tool_calls

boolean

Whether parallel tool calls was enabled.

reasoning

object

The reasoning configuration used.

service_tier

string

The service tier used. Possible values: auto, default, flex, priority, scale.

store

boolean

Whether the response was stored.

truncation

string

The truncation strategy used. Possible values: auto, disabled.

text

object

The text format configuration used.

previous_response_id

string | null

The ID of the previous response used as context.

background

boolean | null

Whether the request ran in the background.

curl -X POST https://api-llm.sunra.ai/v1/responses \
  -H "Authorization: Bearer <SUNRA_KEY>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/gpt-4o",
    "input": [
      {
        "type": "message",
        "role": "user",
        "content": "Hello, how are you?"
      }
    ]
  }'

{
  "id": "resp-abc123",
  "object": "response",
  "created_at": 1704067200,
  "completed_at": 1704067201,
  "status": "completed",
  "model": "openai/gpt-4o",
  "output": [
    {
      "type": "message",
      "id": "msg_abc123",
      "role": "assistant",
      "status": "completed",
      "content": [
        {
          "type": "output_text",
          "text": "Hello! I'm doing well, thank you for asking. How can I help you today?",
          "annotations": []
        }
      ]
    }
  ],
  "output_text": "Hello! I'm doing well, thank you for asking. How can I help you today?",
  "incomplete_details": null,
  "error": null,
  "temperature": 1.0,
  "top_p": 1.0,
  "max_output_tokens": null,
  "top_logprobs": 0,
  "presence_penalty": null,
  "frequency_penalty": null,
  "instructions": null,
  "metadata": {},
  "tools": [],
  "tool_choice": "auto",
  "parallel_tool_calls": true,
  "reasoning": null,
  "service_tier": "auto",
  "store": true,
  "truncation": "disabled",
  "text": {
    "format": {
      "type": "text"
    }
  },
  "usage": {
    "input_tokens": 15,
    "output_tokens": 18,
    "total_tokens": 33,
    "input_tokens_details": {
      "cached_tokens": 0
    },
    "output_tokens_details": {
      "reasoning_tokens": 0
    }
  }
}

Chat

Anthropic Messages

Responses

Create a response

Authentication

Request

Response

Chat

Anthropic Messages

Responses

​Authentication

​Request

​Response

Authentication

Request

Response