Poe API
GPT-Audio
OpenAI's gpt-audio model, brought to Poe as a server bot! This model accepts text and audio inputs and can respond with natural-sounding speech. Learn more at https://platform.openai.com/docs/models/gpt-audio.
When audio output is on, responses include both a voice recording and a text transcript. When audio output is off, one of the messages in the conversation must include an audio file attachment (audio in previous messages from either you or the bot is fine).
Optional parameters:
- enable_audio_output: either true or false to control whether responses are spoken aloud (default: true)
- voice: the name of a voice for voice outputs: "alloy", "ash", "ballad", "cedar", "coral", "echo", "fable", "marin", "nova", "onyx", "sage", or "shimmer" (default: marin)
- model_snapshot: select a specific model version
- auto_max_tokens: Let the model decide the max number of tokens to output (enabled by default). Disable this option to manually specify a token limit between 10 and 16,384 using the max_tokens parameter.
Powered by a server managed by @binaai. Learn more
Build with GPT-Audio using the Poe API
Start by creating an API key, for use with any bot on Poe:
See the full documentation for comprehensive guidance on getting started.
More from Bina AI
OpenAI-TTS-1-HD
OpenAI-TTS-1
GPT-Audio-1.5
GPT-Audio
GPT-Audio-Mini
GPT-4o-Mini-Audio
GPT-4o-Audio-Preview