Files
rocm-chatterbox-whisper/Dockerfile.rocm
scott 2b1398109d
Some checks failed
Build ROCm Image / build (push) Has been cancelled
Switch to ONNX runtime with chatterbox-turbo-ONNX (fp16)
Replaces the PyTorch/chatterbox-tts stack with direct ONNX inference
using ResembleAI/chatterbox-turbo-ONNX fp16 weights.

- engine.py: full rewrite — ONNX sessions, autoregressive KV-cache LM
  loop, voice conditionals cache via speech_encoder outputs
- wyoming_handler.py: remove torch dep, use np.asarray for audio bytes
- requirements-rocm-init.txt: onnxruntime-rocm replaces torch wheels
- requirements-rocm.txt: drop chatterbox/torch deps, keep audio utils
- Dockerfile.rocm: remove chatterbox-tts install step

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-06 19:08:34 -04:00

41 lines
1017 B
Docker

FROM rocm/dev-ubuntu-22.04:6.1
ENV DEBIAN_FRONTEND=noninteractive \
PYTHONDONTWRITEBYTECODE=1 \
PYTHONUNBUFFERED=1 \
HF_HOME=/app/hf_cache \
PIP_NO_CACHE_DIR=1
RUN apt-get update && apt-get install -y --no-install-recommends \
python3 \
python3-pip \
python3-dev \
git \
ffmpeg \
libsndfile1 \
&& rm -rf /var/lib/apt/lists/*
WORKDIR /app
# Step 1: Install onnxruntime-rocm first so it claims the onnxruntime namespace
# before any other package can pull in the CPU-only onnxruntime wheel.
COPY requirements-rocm-init.txt .
RUN pip3 install -r requirements-rocm-init.txt
# Step 2: Install remaining dependencies.
COPY requirements-rocm.txt .
RUN pip3 install -r requirements-rocm.txt
# Application source
COPY engine.py config.py wyoming_handler.py wyoming_voices.py main.py ./
# Default config (can be overridden by volume mount)
COPY config.yaml .
# Create default directories
RUN mkdir -p voices reference_audio hf_cache
EXPOSE 10200
CMD ["python3", "main.py"]