Fish Speech [1][2] (Later)ΒΆ

git clone --depth=1 https://github.com/fishaudio/fish-speech
cd fish-speech
uv venv --python 3.12
.venv\Scripts\activate.bat
uv pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
uv pip install https://github.com/AnyaCoder/fish-speech/releases/download/v0.1.0/triton_windows-0.1.0-py3-none-any.whl
uv pip install -e .
uv pip install hf_transfer
  1. Create folder checkpoints\.

  2. Download fishaudio/openaudio-s1-mini into checkpoints\openaudio-s1-mini.

UsageΒΆ

As CLI.

As GUI:

python -m tools.run_webui --llama-checkpoint-path "checkpoints/openaudio-s1-mini" --decoder-checkpoint-path "checkpoints/openaudio-s1-mini/codec.pth" --decoder-config-name modded_dac_vq

If serve port used, edit API_FLAGS.txt.