The OpenAI Responses API is used to create model responses. It supports multi-turn dialogue, tool calls, inference, and other functionalities.
Authentication is done using Bearer Token.
Format: Authorization: Bearer sk-xxxxxx
Model ID used to generate the response
input: string or array
A system (or developer) message inserted into the model's context.
An upper bound for the number of tokens that can be generated for a response, including visible output tokens and reasoning tokens.
What sampling temperature to use, between 0 and 2.
An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass.
If set to true, the model response data will be streamed to the client as it is generated using server-sent events.
The unique ID of the previous response to the model. Use this to create multi-turn conversations. Learn more about conversation state.
auto, disabled