聊天补全

认证

通过 Bearer Token 认证。

Authorizationstring必填

API Key，格式为 Bearer YOUR_API_KEY。

默认值：-

Content-Typestring必填

请求体格式。

默认值：application/json

Request-Idstring可选

客户侧生成的唯一请求标识，用于追踪和排查。

默认值：-

请求

核心字段与 OpenAI Chat Completions 一致，Google 模型可额外使用思考相关参数。

modelstring必填

Gemini 模型 ID，例如 gemini-2.5-flash。

默认值：-

messagesarray必填

OpenAI 兼容格式的对话消息列表。

默认值：-

streamboolean可选

是否启用 SSE 流式输出。

默认值：false

reasoning_effortstring可选

思考力度控制：low、medium 或 high。

默认值：-

thinkingobject可选

Gemini 思考模式控制，例如关闭思考可传 budget_tokens: 0。

默认值：-

stream_optionsobject可选

流式选项，例如 include_usage。

默认值：-

temperaturenumber可选

采样温度。

默认值：1

top_pnumber可选

核采样参数。

默认值：1

max_tokensinteger可选

限制本次响应最多生成的 Token 数。

默认值：4096

响应

非流式响应返回 chat.completion；流式响应中可能通过 delta.reasoning_content 返回思考内容。

idstring必填

本次补全唯一标识。

默认值：-

objectstring必填

对象类型。流式模式为 chat.completion.chunk。

默认值：chat.completion

createdinteger必填

创建时间，Unix 时间戳。

默认值：-

modelstring必填

实际执行请求的 Gemini 模型。

默认值：-

choicesarray必填

补全结果列表。

默认值：-

choices[].messageobject可选

非流式响应中的模型消息。

默认值：-

choices[].delta.reasoning_contentstring可选

流式响应中可能出现的思考内容。

默认值：-

usageobject可选

Token 用量统计。

默认值：-

关闭思考

如果不需要 Gemini 思考内容，可通过 thinking 控制：

{
  "thinking": {
    "type": "enabled",
    "budget_tokens": 0
  }
}

错误

状态码	说明
`400`	请求体字段错误、消息格式错误或模型不支持指定参数
`401`	API Key 缺失或无效
`403`	当前账户无权调用该模型
`429`	请求超过速率限制
`500` / `503`	平台或上游模型服务异常

创建 Gemini 聊天补全

认证

请求

响应

关闭思考

错误

相关指南

认证​

请求​

响应​

关闭思考​

错误​

相关指南​

认证

请求

响应

关闭思考

错误

相关指南