Step-Audio-TTS-3B

Brief: Convert text to speech using the Step-Audio-TTS-3B model.

Overview

Method: POST
Path: /v1/audio/speech
Content-Type: application/json

Authentication

Header: Authorization: Bearer <token>
Optional Header: Accept: */*

Request Body Parameters

Parameter	Type	Required	Description
model	string	Yes	Model name, set to `Step-Audio-TTS-3B`
input	string	Yes	Text to convert into speech
extra_body	object	No	Additional settings object for reference text or voice imitation audio
extra_body.prompt_text	string	No	Reference text used to guide the voice imitation, supports `zh` or `en`
extra_body.prompt_audio_url	string	No	Voice imitation audio URL, supports WAV format only

curl Example

bash

curl -X POST "https://api.gpt.ge/v1/audio/speech" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxxx" \
  -H "Accept: */*" \
  -d '{
    "model": "Step-Audio-TTS-3B",
    "input": "Hello, I am the morphogen TTS assistant. Thank you for using our API service!",
    "extra_body": {
      "prompt_text": "zh",
      "prompt_audio_url": "https://example.com/path/to/voice.wav"
    }
  }'

JavaScript (fetch) Example

javascript

fetch('https://api.gpt.ge/v1/audio/speech', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    'Authorization': 'Bearer sk-xxxx',
    'Accept': '*/*'
  },
  body: JSON.stringify({
    model: 'Step-Audio-TTS-3B',
    input: 'Hello, I am the morphogen TTS assistant. Thank you for using our API service!',
    extra_body: {
      prompt_text: 'zh',
      prompt_audio_url: 'https://example.com/path/to/voice.wav'
    }
  })
}).then(r => r.blob()).then(console.log);

Python (requests) Example

python

import requests

response = requests.post(
    'https://api.gpt.ge/v1/audio/speech',
    headers={
        'Content-Type': 'application/json',
        'Authorization': 'Bearer sk-xxxx',
        'Accept': '*/*'
    },
    json={
        'model': 'Step-Audio-TTS-3B',
        'input': 'Hello, I am the morphogen TTS assistant. Thank you for using our API service!',
        'extra_body': {
            'prompt_text': 'zh',
            'prompt_audio_url': 'https://example.com/path/to/voice.wav'
        }
    }
)
print(response.status_code)
print(response.content[:20])

Response Example (200)

text

<binary audio data returned directly, such as MP3 or WAV file content>

Note: The response usually returns the audio file content directly in the selected audio format.

Step-Audio-TTS-3B ​

Overview ​

Authentication ​

Request Body Parameters ​

curl Example ​

JavaScript (fetch) Example ​

Python (requests) Example ​

Response Example (200) ​

Step-Audio-TTS-3B

Overview

Authentication

Request Body Parameters

curl Example

JavaScript (fetch) Example

Python (requests) Example

Response Example (200)