OpenAIJan 19, 2026

OpenAI: GPT Audio

Name: OpenAI: GPT Audio
Price: 2.5 USD
Author: OpenAI

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

Context Window

128K

tokens

Max Output

16K

tokens

Released

Jan 19, 2026

Arena Rank

—

Capabilities

👁Vision

🧠Reasoning

🔧Tool Calling

⚡Prompt Caching

🖥Computer Use

🎨Image Generation

Supported Parameters

Frequency Penalty

Reduce repetition

Logit Bias

Adjust token weights

Log Probs

Token probabilities

Max Tokens

Output length limit

Presence Penalty

Encourage new topics

Response Format

JSON mode / structured output

Seed

Deterministic outputs

Stop Sequences

Custom stop tokens

structured_outputs

Temperature

Controls randomness

top_logprobs

Top P

Nucleus sampling

Pricing Comparison

Router	Input / 1M	Output / 1M	Cached Input / 1M
OpenRouter	$2.50	$10.00	—

Model IDs

OpenRouteropenai/gpt-audio

Compare with another model

Related Models

Other#66

Capabilities

Supported Parameters

Pricing Comparison

Model IDs

Related Models

Meituan: LongCat Flash Chat

Xiaomi: MiMo-V2-Flash

Prime Intellect: INTELLECT-3

OpenAI: gpt-oss-120b (free)

AllenAI: Olmo 3.1 32B Instruct

OpenAI: gpt-oss-20b (free)