openai/gpt-audio
上下文长度: 128,000
text+audio->text+audio
GPT
2026-01-20 更新
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.