OpenAIJan 19, 2026
OpenAI: GPT Audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.
Context Window
128K
tokens
Max Output
16K
tokens
Released
Jan 19, 2026
Arena Rank
—
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Frequency Penalty
Reduce repetition
Logit Bias
Adjust token weights
Log Probs
Token probabilities
Max Tokens
Output length limit
Presence Penalty
Encourage new topics
Response Format
JSON mode / structured output
Seed
Deterministic outputs
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
top_logprobs
Top P
Nucleus sampling
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| OpenRouter | $2.50 | $10.00 | — |
Model IDs
OpenRouter
openai/gpt-audioRelated Models
Other#66
Meituan: LongCat Flash Chat
131K ctxFree/1M in
Other#72
Xiaomi: MiMo-V2-Flash
262K ctx$0.09/1M in
Other#106
Prime Intellect: INTELLECT-3
131K ctx$0.20/1M in
OpenAI#108
OpenAI: gpt-oss-120b (free)
131K ctxFree/1M in
Other#139
AllenAI: Olmo 3.1 32B Instruct
66K ctx$0.20/1M in
OpenAI#159
OpenAI: gpt-oss-20b (free)
131K ctxFree/1M in