Poe API

GPT-Audio

OFFICIAL

OpenAI's gpt-audio model, brought to Poe as a server bot! This model accepts text and audio inputs and can respond with natural-sounding speech. Learn more at https://platform.openai.com/docs/models/gpt-audio. When audio output is on, responses include both a voice recording and a text transcript. When audio output is off, one of the messages in the conversation must include an audio file attachment (audio in previous messages from either you or the bot is fine). Optional parameters: - enable_audio_output: either true or false to control whether responses are spoken aloud (default: true) - voice: the name of a voice for voice outputs: "alloy", "ash", "ballad", "cedar", "coral", "echo", "fable", "marin", "nova", "onyx", "sage", or "shimmer" (default: marin) - model_snapshot: select a specific model version - auto_max_tokens: Let the model decide the max number of tokens to output (enabled by default). Disable this option to manually specify a token limit between 10 and 16,384 using the max_tokens parameter.

Build with GPT-Audio using the Poe API

Start by creating an API key, for use with any bot on Poe:

Generate API key

See the full documentation for comprehensive guidance on getting started.

More from Bina AI

OpenAI-TTS-1-HD

OpenAI-TTS-1

GPT-Audio-1.5

GPT-Audio

GPT-Audio-Mini

GPT-4o-Mini-Audio

GPT-4o-Audio-Preview