Fish Audio Audio Cloning
Audio
Fish Audio Audio Cloning
POST
Fish Audio Audio Cloning
The Fish Audio API is used to create voice models (voice cloning).
Request Headers
Enum value:
application/jsonBearer authentication format: Bearer {{API Key}}.
Request Body
Model type. tts represents text-to-speech.Available value:
ttsAllowed value: "tts"The model title or name.
The model training mode. For TTS models, fast means the model is available immediately after creation.Available value:
fastAllowed value: "fast"Upload voice files used to fine-tune the model.
Model visibility. public will be displayed on the discovery page, unlist allows anyone with the link to access it, and private is visible only to the creator.Available values:
public, unlist, privateModel description.
Model cover image. This is required if the model is public.
Text corresponding to the voices. If not specified, ASR (automatic speech recognition) will be performed on the voices.
Model tags.
Enhance audio quality.
Response Information
The unique identifier of the created model.
Model type.Available values:
svc, ttsThe model title or name.
Model description.
The URL of the model cover image.
The current state of the model.Available values:
created, training, trained, failedModel tags.
The timestamp when the model was created.
The timestamp when the model was last updated.
The model visibility setting.Available values:
public, unlist, privateThe number of likes the model has received.
The number of favorites/bookmarks the model has received.
The number of times the model has been shared.
The number of tasks associated with the model.
Information about the model author.
The training mode used by the model.Available values:
fast, fullSample data associated with the model.
Languages supported by the model.
Whether the visibility setting is locked.
Whether the current user has unliked this model.
Whether the current user has liked this model.
Whether the current user has favorited/bookmarked this model.