OpenAI Responses
Create responses using the OpenAI Responses API format
Creates a response using the OpenAI Responses API format (Documentation Index
Fetch the complete documentation index at: https://www.edgee.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
POST /v1/responses). This endpoint is compatible with OpenAI’s Responses API, making it easy to use with tools like the Codex CLI.Authorizations
Headers
Comma-separated list of tags for categorizing and filtering requests in analytics and logs. Example: production,agent,codex
Enable debug mode to include additional debugging information in the response.
Compression bundle to apply.
claude, opencode, cursor, codex Body
ID of the model to use, with provider prefix. Format: {author_id}/{model_id}.
"openai/gpt-4o"
The input to the model. Either a plain string (treated as a single user message) or a flat array of typed input items (messages, function calls, function call outputs).
System-level instruction prepended to the conversation. An alternative to including a system role message in the input array.
If set, the response is streamed as Server-Sent Events (SSE).
Maximum number of tokens to generate.
x >= 1Tools available to the model. Uses the Responses API flat format (no nested function key).
Controls tool selection.
auto, none Sampling temperature between 0 and 2. Higher values produce more random outputs.
0 <= x <= 2Nucleus sampling probability. Alternative to temperature.
List of string tags for categorizing and filtering requests in analytics and logs. Can also be sent via the X-Edgee-Tags header.
Enable debug mode to include additional information in the response.
List of Edgee-managed tool IDs to include automatically (e.g. edgee_current_time, edgee_generate_uuid). Each ID must be activated for your API key.
["edgee_current_time", "edgee_generate_uuid"]Pending operation ID when continuing a conversation after Edge Tool execution. The gateway injects stored Edge Tool results into the conversation history.
Selects the compression bundle to apply to the request. Equivalent to the X-Edgee-Compression-Model header.
claude, opencode, codex, cursor Response
Response created successfully
Unique identifier for the response, prefixed with resp_.
"resp_abc123"
response completed for non-streaming responses; in_progress is emitted on the initial response.created SSE event.
completed, in_progress Unix timestamp (as a float) of when the response was created.
1677652288
The model used to generate the response.
"openai/gpt-4o"
Array of output items produced by the model.
Token usage statistics for the response.